> For clean Markdown of any page, append .md to the page URL.
> For a complete documentation index, see https://docs.nvidia.com/nemo/automodel/llms.txt.
> For AI client integration (Claude Code, Cursor, etc.), connect to the MCP server at https://docs.nvidia.com/nemo/automodel/_mcp/server.

# nemo_automodel.components.datasets.llm

## Subpackages

* **[`nemo_automodel.components.datasets.llm.megatron`](/nemo-automodel/nemo_automodel/components/datasets/llm/megatron)**

## Submodules

* **[`nemo_automodel.components.datasets.llm.agent_chat`](/nemo-automodel/nemo_automodel/components/datasets/llm/agent_chat)**
* **[`nemo_automodel.components.datasets.llm.chat_dataset`](/nemo-automodel/nemo_automodel/components/datasets/llm/chat_dataset)**
* **[`nemo_automodel.components.datasets.llm.column_mapped_text_instruction_dataset`](/nemo-automodel/nemo_automodel/components/datasets/llm/column_mapped_text_instruction_dataset)**
* **[`nemo_automodel.components.datasets.llm.column_mapped_text_instruction_iterable_dataset`](/nemo-automodel/nemo_automodel/components/datasets/llm/column_mapped_text_instruction_iterable_dataset)**
* **[`nemo_automodel.components.datasets.llm.delta_lake_dataset`](/nemo-automodel/nemo_automodel/components/datasets/llm/delta_lake_dataset)**
* **[`nemo_automodel.components.datasets.llm.eagle3`](/nemo-automodel/nemo_automodel/components/datasets/llm/eagle3)**
* **[`nemo_automodel.components.datasets.llm.eagle3_cache`](/nemo-automodel/nemo_automodel/components/datasets/llm/eagle3_cache)**
* **[`nemo_automodel.components.datasets.llm.formatting_utils`](/nemo-automodel/nemo_automodel/components/datasets/llm/formatting_utils)**
* **[`nemo_automodel.components.datasets.llm.hellaswag`](/nemo-automodel/nemo_automodel/components/datasets/llm/hellaswag)**
* **[`nemo_automodel.components.datasets.llm.length_grouped_sampler`](/nemo-automodel/nemo_automodel/components/datasets/llm/length_grouped_sampler)**
* **[`nemo_automodel.components.datasets.llm.megatron_dataset`](/nemo-automodel/nemo_automodel/components/datasets/llm/megatron_dataset)**
* **[`nemo_automodel.components.datasets.llm.mock`](/nemo-automodel/nemo_automodel/components/datasets/llm/mock)**
* **[`nemo_automodel.components.datasets.llm.mock_iterable_dataset`](/nemo-automodel/nemo_automodel/components/datasets/llm/mock_iterable_dataset)**
* **[`nemo_automodel.components.datasets.llm.mock_packed`](/nemo-automodel/nemo_automodel/components/datasets/llm/mock_packed)**
* **[`nemo_automodel.components.datasets.llm.mock_prefix_tree`](/nemo-automodel/nemo_automodel/components/datasets/llm/mock_prefix_tree)**
* **[`nemo_automodel.components.datasets.llm.mock_seq_cls`](/nemo-automodel/nemo_automodel/components/datasets/llm/mock_seq_cls)**
* **[`nemo_automodel.components.datasets.llm.nanogpt_dataset`](/nemo-automodel/nemo_automodel/components/datasets/llm/nanogpt_dataset)**
* **[`nemo_automodel.components.datasets.llm.neat_packing`](/nemo-automodel/nemo_automodel/components/datasets/llm/neat_packing)**
* **[`nemo_automodel.components.datasets.llm.packed_sequence`](/nemo-automodel/nemo_automodel/components/datasets/llm/packed_sequence)**
* **[`nemo_automodel.components.datasets.llm.prefix_tree`](/nemo-automodel/nemo_automodel/components/datasets/llm/prefix_tree)**
* **[`nemo_automodel.components.datasets.llm.retrieval_collator`](/nemo-automodel/nemo_automodel/components/datasets/llm/retrieval_collator)**
* **[`nemo_automodel.components.datasets.llm.retrieval_dataset`](/nemo-automodel/nemo_automodel/components/datasets/llm/retrieval_dataset)**
* **[`nemo_automodel.components.datasets.llm.retrieval_dataset_inline`](/nemo-automodel/nemo_automodel/components/datasets/llm/retrieval_dataset_inline)**
* **[`nemo_automodel.components.datasets.llm.seq_cls`](/nemo-automodel/nemo_automodel/components/datasets/llm/seq_cls)**
* **[`nemo_automodel.components.datasets.llm.squad`](/nemo-automodel/nemo_automodel/components/datasets/llm/squad)**
* **[`nemo_automodel.components.datasets.llm.xlam`](/nemo-automodel/nemo_automodel/components/datasets/llm/xlam)**

## Package Contents

### Data

[`__all__`](#nemo_automodel-components-datasets-llm-__all__)

### API

```python
nemo_automodel.components.datasets.llm.__all__ = ['NanogptDataset', 'make_squad_dataset', 'make_retrieval_dataset', 'make_xlam_da...
```