Skip to yearly menu bar Skip to main content


Poster

Seesaw: High-throughput LLM Inference via Model Re-sharding

Qidong Su ⋅ Wei Zhao ⋅ Xin Li ⋅ Muralidhar Andoorveedu ⋅ Chenhao Jiang ⋅ Zhanda Zhu ⋅ Kevin Song ⋅ Christina Giannoula ⋅ Gennady Pekhimenko
Outstanding Paper Honorable Mention Outstanding Paper Honorable Mention

Abstract

Video

Chat is not available.