Skip to yearly menu bar Skip to main content


Oral Wed, May 20, 2026 • 9:15 AM – 9:30 AM PDT

MTraining: Distributed Dynamic Sparse Attention for Efficient Ultra-Long Context Training

Wenxuan Li ⋅ Chengruidong Zhang ⋅ Huiqiang Jiang ⋅ Yucheng Li ⋅ ⋅ Lili Qiu

Abstract

Log in and register to view live content