Skip to yearly menu bar Skip to main content


Poster

Seesaw: High-throughput LLM Inference via Model Re-sharding

Qidong Su · Wei Zhao · Xin Li · Muralidhar Andoorveedu · Chenhao Jiang · Zhanda Zhu · Kevin Song · Christina Giannoula · Gennady Pekhimenko

Abstract

Video

Chat is not available.