Concepts#
These pages explain how the mcq family inside src/nemotron/steps/byob prepares data, runs each generation stage, and optionally translates benchmarks.
Architecture#
Pipeline overview
Prepare, generate, translate, and the Parquet stage cache.
Core processes#
Data preparation
Seeds from Hugging Face plus local corpus chunks.
Mapping targets to sources
source_subjects, weights, and optional tags.
Question generation
Data Designer batched calls from prepared seeds.
Quality assurance#
Validation stack
Judgement, deduplication, distractors, coverage, outliers.
Filtering
Easiness and hallucination scores with removal flags.
Translation#
Translation
Curator translation, backtranslation, metrics, final schema.
Next steps#
Hands-on first run: Getting Started with Building MCQ Benchmarks
YAML tables: Reference