(3 events)   Timezone:  
Show all
Toggle Poster Visibility
Poster
Ballroom B - Position 1
PipeFisher: Efficient Training of Large Language Models Using Pipelining and Fisher Information Matrices
Kazuki Osawa · Shigang Li · Torsten Hoefler
[ Slides [ Poster
Poster
Ballroom B - Position 3
Breadth-First Pipeline Parallelism
Joel Lamy-Poirier
[ Paper [ Slides [ Poster
Poster
Ballroom B - Position 2
Tutel: Adaptive Mixture-of-Experts at Scale
Changho Hwang · Wei Cui · Yifan Xiong · Ziyue Yang · Ze Liu · Han Hu · Zilong Wang · Rafael Salas · Jithin Jose · Prabhat Ram · HoYuen Chau · Peng Cheng · Fan Yang · Mao Yang · Yongqiang Xiong
[ Paper [ Slides