nemo_automodel.components.models.deepseek_v4.kernels.tilelang_sparse_mla_bwd
nemo_automodel.components.models.deepseek_v4.kernels.tilelang_sparse_mla_bwd
Module Contents
Functions
API
Backward interface for V4 sparse MQA attention.
Parameters:
q
[B, S, H, D] bf16
kv
[B, S_kv, D] bf16
attn_sink
[H] fp32
o
[B, S, H, D] bf16 (forward output)
do
[B, S, H, D] bf16 (grad of output)
topk_idxs
[B, S, topk] int32
lse
[B, S, H] fp32 (log-sum-exp from forward)
sm_scale
float or None
Returns:
[B, S, H, D] bf16