Discussions#
In-depth technical discussions and optimization guides:
Optimizing DeepSeek-V3 Training on GB200 NVL72 - Achieving 970 TFLOPS/GPU with MXFP8, kernel optimizations, and HybridEP
In-depth technical discussions and optimization guides:
Optimizing DeepSeek-V3 Training on GB200 NVL72 - Achieving 970 TFLOPS/GPU with MXFP8, kernel optimizations, and HybridEP