Concepts#

These pages explain how the mcq family inside src/nemotron/steps/byob prepares data, runs each generation stage, and optionally translates benchmarks.

Architecture#

Pipeline overview

Prepare, generate, translate, and the Parquet stage cache.

Pipeline Overview

Core processes#

Data preparation

Seeds from Hugging Face plus local corpus chunks.

Data Preparation for Multiple-Choice Question Benchmarks
Mapping targets to sources

source_subjects, weights, and optional tags.

Getting the Right Questions From the Source Benchmark
Question generation

Data Designer batched calls from prepared seeds.

Question Generation

Quality assurance#

Validation stack

Judgement, deduplication, distractors, coverage, outliers.

Quality Validation
Filtering

Easiness and hallucination scores with removal flags.

Easiness and Hallucination Filtering

Translation#

Translation

Curator translation, backtranslation, metrics, final schema.

Translation

Next steps#