> For clean Markdown of any page, append .md to the page URL.
> For a complete documentation index, see https://docs.nvidia.com/nemo/automodel/llms.txt.
> For AI client integration (Claude Code, Cursor, etc.), connect to the MCP server at https://docs.nvidia.com/nemo/automodel/_mcp/server.

# nemo_automodel.components.speculative.dflash.registry

Dispatch registry mapping target architecture -> DFlash draft model.

Mirrors the EAGLE registry (`components/speculative/eagle/registry.py`). The
DFlash draft is a non-causal Qwen3-style stack and is config-driven, so adding a
Qwen3-shaped architecture is a one-line append.

## Module Contents

### Classes

| Name                                                                                        | Description                                                             |
| ------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------- |
| [`DFlashDraftSpec`](#nemo_automodel-components-speculative-dflash-registry-DFlashDraftSpec) | How to build a DFlash draft model for a particular target architecture. |

### Functions

| Name                                                                                                            | Description                                                                          |
| --------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------ |
| [`resolve_dflash_draft_spec`](#nemo_automodel-components-speculative-dflash-registry-resolve_dflash_draft_spec) | Return the first registered DFlash draft spec matching any architecture in the list. |

### Data

[`DFLASH_DRAFT_REGISTRY`](#nemo_automodel-components-speculative-dflash-registry-DFLASH_DRAFT_REGISTRY)

[`_QWEN3_ARCHITECTURES`](#nemo_automodel-components-speculative-dflash-registry-_QWEN3_ARCHITECTURES)

### API

```python
class nemo_automodel.components.speculative.dflash.registry.DFlashDraftSpec(
    draft_cls: type[transformers.PreTrainedModel]
)
```

Dataclass

How to build a DFlash draft model for a particular target architecture.

```python
nemo_automodel.components.speculative.dflash.registry.resolve_dflash_draft_spec(
    architectures: list[str]
) -> nemo_automodel.components.speculative.dflash.registry.DFlashDraftSpec
```

Return the first registered DFlash draft spec matching any architecture in the list.

```python
nemo_automodel.components.speculative.dflash.registry.DFLASH_DRAFT_REGISTRY: dict[str, DFlashDraftSpec] = {arch: (DFlashDraftSpec(draft_cls=Qwen3DFlashDraftModel)) for arch in _QWEN3_ARC...
```

```python
nemo_automodel.components.speculative.dflash.registry._QWEN3_ARCHITECTURES: tuple[str, ...] = ('Qwen3ForCausalLM', 'Qwen3MoeForCausalLM')
```