Troubleshooting#
Common issues and solutions for the AI-Q blueprint.
Installation Issues#
Issue |
Cause |
Fix |
|---|---|---|
|
Package not installed in editable mode |
|
|
Using system |
Use |
NeMo Agent Toolkit plugins not found |
Plugins not installed |
|
Pre-commit hook failures |
Missing pre-commit setup |
|
|
Version conflict with LangGraph |
|
API Key Issues#
Issue |
Cause |
Fix |
|---|---|---|
|
Invalid or expired NVIDIA API key |
Regenerate key at build.nvidia.com |
|
Model endpoint overloaded or unavailable |
Retry, or switch to a different model in config |
Tavily search returns empty |
Invalid |
Verify key at tavily.com |
Serper search fails |
Missing |
Set key or remove |
Runtime Issues#
Issue |
Cause |
Fix |
|---|---|---|
Agent hangs on deep research |
LLM timeout or rate limit |
Set |
Shallow research returns generic answers |
Insufficient tool calls |
Increase |
Clarifier keeps asking questions |
Too many clarification turns |
Reduce |
SSE stream disconnects |
Network timeout |
Client auto-reconnects using |
Job status stuck on RUNNING |
Dask worker crashed |
Check Dask logs; the ghost job reaper will eventually mark it FAILURE |
Knowledge Layer Issues#
Issue |
Cause |
Fix |
|---|---|---|
|
Adapter module not imported |
Ensure backend package is installed: |
Empty retrieval results |
Collection is empty or wrong name |
Run ingestion first; verify |
Foundational RAG connection refused |
RAG Blueprint not running |
Start the RAG Blueprint server; verify |
|
Missing dependency |
|
Docker / Deployment Issues#
Issue |
Cause |
Fix |
|---|---|---|
Container fails to start |
Missing environment variables |
Check |
Port already in use |
Another service on port 3000/8000 |
Set |
UI shows “Backend unavailable” |
Backend not healthy |
|
VM / Remote Development#
If you are running the AI-Q blueprint on a remote VM (cloud instance, WSL, SSH server) and accessing it from your local browser, localhost:3000 and localhost:8000 will not resolve because the services are listening on the VM — not your local machine.
SSH Port Forwarding#
Forward the required ports through your SSH connection:
# Forward both the frontend and backend ports
ssh -L 3000:localhost:3000 -L 8000:localhost:8000 user@your-vm-host
Then open http://localhost:3000 on your local machine as usual.
To forward ports to an already-active SSH session, you can also use ~C (SSH escape sequence) to open the SSH command line and type the following on a single line (press Enter at the end):
-L 3000:localhost:3000 -L 8000:localhost:8000
VS Code Remote SSH#
If you are using VS Code Remote-SSH, ports are typically forwarded automatically when the server starts listening. If not, open the Ports panel (Ctrl+Shift+P → “Ports: Focus on Ports View”) and add ports 3000 and 8000 manually.
Common Symptoms#
Symptom |
Cause |
Fix |
|---|---|---|
“This site can’t be reached” on |
Ports not forwarded from VM to local machine |
Use SSH port forwarding (see above) |
Connection refused after forwarding |
Service not running on the VM |
SSH into the VM and verify with |
Port forwarding conflicts |
Local port already in use |
Use alternate local ports: |
Note
Docker Compose deployments on the VM handle container-to-host port mapping automatically. The SSH forwarding described here is for making the VM’s ports accessible on your local machine.
Debugging Tips#
Enable Verbose Logging#
# In your config YAML
workflow:
_type: chat_deepresearcher_agent
verbose: true
Or through CLI: ./scripts/start_cli.sh --verbose
Phoenix Tracing#
For full setup instructions covering Phoenix, LangSmith, and other tracing backends, see Observability.
Start a Phoenix server and enable tracing in config:
general:
telemetry:
tracing:
phoenix:
_type: phoenix
endpoint: http://localhost:6006/v1/traces
project: dev
Then open http://localhost:6006 to inspect traces, token usage, and latency.
Check Registered Components#
# List registered NeMo Agent Toolkit plugins
.venv/bin/nat info components