nemo_automodel.components.speculative.dflash.registry
nemo_automodel.components.speculative.dflash.registry
Dispatch registry mapping target architecture -> DFlash draft model.
Mirrors the EAGLE registry (components/speculative/eagle/registry.py). The
DFlash draft is a non-causal Qwen3-style stack and is config-driven, so adding a
Qwen3-shaped architecture is a one-line append.
Module Contents
Classes
Functions
Data
API
Dataclass
How to build a DFlash draft model for a particular target architecture.
draft_cls
Return the first registered DFlash draft spec matching any architecture in the list.