Run FAITH Evaluation#
Use this guide when you need to tune thresholds, filtering, or scorer models for FAITH inside nemotron steps run translate/nemo_curator without a separate evaluation command.
FAITH runs inside TranslationStage whenever faith_eval.enabled is true. The default.yaml starter profile ships with FAITH enabled. For how FAITH couples to non-LLM backends, see FAITH Evaluation Inside Translation.
Essential Knobs#
YAML path |
Purpose |
|---|---|
|
Master toggle. Set |
|
Average score floor. Rows failing the threshold are dropped when |
|
Enables or disables removing low-scoring rows. |
|
Overrides |
|
Optional OpenAI-compatible generation settings for the FAITH scorer. |
|
Optional scorer-side concurrency limit. |
LLM Credentials#
FAITH always requires the OpenAI-compatible server configuration. Set NVIDIA_API_KEY first, for example by replacing <api-key> below.
export NVIDIA_API_KEY="<api-key>"
uv run nemotron steps run translate/nemo_curator -c default \
faith_eval.enabled=true \
faith_eval.threshold=3.0 \
faith_eval.filter_enabled=true \
server.model=YOUR_LLM_MODEL_ID \
input_path=/path/to/chat.jsonl \
output_dir=/path/to/out \
source_language=en \
target_language=hi
FAITH scoring is part of Curator’s translation stage and is aligned to the translated segments produced by the stage. There is no separate faith_eval.segment_level switch in this Nemotron config.
Disable FAITH Temporarily#
uv run nemotron steps run translate/nemo_curator -c default \
faith_eval.enabled=false \
input_path=/path/to/chat.jsonl \
output_dir=/path/to/out \
source_language=en \
target_language=hi \
server.model=YOUR_LLM_MODEL_ID
Even with FAITH off, backend=llm still needs server.model for translation itself.