Skip to yearly menu bar Skip to main content


Poster

MegaBlocks: Efficient Sparse Training with Mixture-of-Experts

Trevor Gale ⋅ Deepak Narayanan ⋅ Cliff Young ⋅ Matei Zaharia
[ Paper

Abstract

Video

Chat is not available.