nemo_automodel.components.speculative.dspark

View as Markdown

DSpark speculative-decoding draft model and training objective.

A semi-autoregressive parallel drafter: a parallel backbone produces every position of a block in one pass, a lightweight serial Markov head injects intra-block token dependency, and a confidence head predicts per-position acceptance probability for scheduled verification.

Submodules

Package Contents

Data

__all__

API

nemo_automodel.components.speculative.dspark.__all__ = ['DSparkForwardOutput', 'Qwen3DSparkModel', 'build_draft_config', 'compute_dspar...