Skip to yearly menu bar Skip to main content


Poster 11

On the Diminishing Returns of Expert Load Balancing in MoE LLM Serving

Hanfei Yu ⋅ Jinru Duan ⋅ Jiabin Luo ⋅ Hao Wang

Log in and register to view live content