Run FAITH Evaluation#

Use this guide when you need to tune thresholds, filtering, or scorer models for FAITH inside nemotron steps run translate/nemo_curator without a separate evaluation command.

FAITH runs inside TranslationStage whenever faith_eval.enabled is true. The default.yaml starter profile ships with FAITH enabled. For how FAITH couples to non-LLM backends, see FAITH Evaluation Inside Translation.

Essential Knobs#

YAML path

Purpose

faith_eval.enabled

Master toggle. Set false when you only need raw translation.

faith_eval.threshold

Average score floor. Rows failing the threshold are dropped when filter_enabled is true.

faith_eval.filter_enabled

Enables or disables removing low-scoring rows.

faith_eval.model_name

Overrides server.model for scoring-only workloads.

faith_eval.generation_config

Optional OpenAI-compatible generation settings for the FAITH scorer.

faith_eval.max_concurrent_requests

Optional scorer-side concurrency limit.

LLM Credentials#

FAITH always requires the OpenAI-compatible server configuration. Set NVIDIA_API_KEY first, for example by replacing <api-key> below.

export NVIDIA_API_KEY="<api-key>"
uv run nemotron steps run translate/nemo_curator -c default \
  faith_eval.enabled=true \
  faith_eval.threshold=3.0 \
  faith_eval.filter_enabled=true \
  server.model=YOUR_LLM_MODEL_ID \
  input_path=/path/to/chat.jsonl \
  output_dir=/path/to/out \
  source_language=en \
  target_language=hi

FAITH scoring is part of Curator’s translation stage and is aligned to the translated segments produced by the stage. There is no separate faith_eval.segment_level switch in this Nemotron config.

Disable FAITH Temporarily#

uv run nemotron steps run translate/nemo_curator -c default \
  faith_eval.enabled=false \
  input_path=/path/to/chat.jsonl \
  output_dir=/path/to/out \
  source_language=en \
  target_language=hi \
  server.model=YOUR_LLM_MODEL_ID

Even with FAITH off, backend=llm still needs server.model for translation itself.