Skip to yearly menu bar Skip to main content


Poster

Seesaw: High-throughput LLM Inference via Model Re-sharding

Qidong Su · Wei Zhao · Xin Li · Muralidhar Andoorveedu · Chenhao Jiang · Zhanda Zhu · Kevin Song · Christina Giannoula · Gennady Pekhimenko
Outstanding Paper Honorable Mention Outstanding Paper Honorable Mention

Abstract

Video

Chat is not available.