Troubleshooting#
This page lists common symptoms when you run nemotron steps run translate/nemo_curator and shows the field, flag, or environment variable to inspect first.
Each table pairs a symptom with a concrete remedy.
For stage flow and design rationale, see the explanation pages linked from Concepts.
Authentication and Credentials#
Symptom |
What to do |
|---|---|
HTTP 401 or 403 from the chat-completions endpoint, or a Curator log line about a missing API key |
Confirm the variable named in |
FAITH scoring fails with a credentials error even though |
FAITH always uses the large language model (LLM) client under |
Google backend rejects the request with a permission or project error |
Confirm application default credentials are present in the environment that runs the step. Do not paste secrets into |
Model and Endpoint Configuration#
Symptom |
What to do |
|---|---|
HTTP 404 or a “model not found” message from the LLM endpoint |
Hosted catalogs retire identifiers frequently. List the models your tenant currently exposes and pin |
Google translation rejects the request because |
API version |
NMT requests time out before the service responds |
Raise |
Throttling and Concurrency#
Symptom |
What to do |
|---|---|
HTTP 429 responses, bursty failures, or sustained slowdowns from a hosted LLM endpoint |
Lower |
A self-hosted NMT service returns errors under load |
Reduce |
Inputs and Output Layout#
Symptom |
What to do |
|---|---|
Reader errors about mixed file types when |
Curator readers expect one record format per directory. Split the inputs into separate directories for JSON Lines (JSONL) and Parquet, or set |
Ray worker logs show |
Export |
Empty JSONL input fails with |
The reader found no records. Treat the run as an empty-input validation failure, confirm the input path is correct, and rerun with a non-empty file or directory. |
Output shards do not appear under |
The writer emits partitioned files, not a single merged file. Inspect the shard pattern under |
FAITH Evaluation#
Symptom |
What to do |
|---|---|
Every translated row is dropped after FAITH runs |
The |
FAITH scores look inconsistent across runs of the same data |
Pin both |