Skip Stages When Iterating#

Both generate and translate honor skip_until, a string that names an enum entry on the internal stage list. Stages whose enum value is less than the named stage are skipped as long as the expected Parquet already exists.

Generation enum names#

From McqGenerationStage in runtime/benchmark_families/mcq/pipeline.py, valid names include:

GENERATION, JUDGEMENT, SEMANTIC_DEDUPLICATION, DISTRACTOR_EXPANSION, COVERAGE_CHECK, DISTRACTOR_VALIDITY_CHECK, SEMANTIC_OUTLIER_DETECTION, HALLUCINATION_EASINESS_DETECTION, FINAL_OUTPUT

Translation enum names#

From McqTranslationStage:

TRANSLATION, BACKTRANSLATION, QUALITY_METRICS, FINAL_OUTPUT

CLI usage#

Pass the resume point as a dotlist override:

uv run nemotron steps run byob/mcq -c /path/to/generate.yaml skip_until=JUDGEMENT
uv run nemotron steps run byob/mcq -c translate stage=translate skip_until=BACKTRANSLATION

Preconditions#

Skipping only works when the Parquet file produced by the previous stage is already on disk under output_dir/expt_name/stage_cache/. Otherwise the next stage reads missing input and fails.

For other common failure modes, see Troubleshooting.