Skip to yearly menu bar Skip to main content


Poster

MegaBlocks: Efficient Sparse Training with Mixture-of-Experts

Trevor Gale · Deepak Narayanan · Cliff Young · Matei Zaharia
[ Paper

Abstract

Video

Chat is not available.