Skip to yearly menu bar Skip to main content


Poster 11

FaaScale: Unlocking Fast LLM Scaling for Serverless Inference

Minchen Yu ⋅ Rui Yang ⋅ Chaobo Jia ⋅ Zhaoyuan Su ⋅ Sheng Yao ⋅ Tingfeng Lan ⋅ Yuchen Yang ⋅ Zirui Wang ⋅ Yue Cheng ⋅ Wei Wang ⋅ Ao Wang ⋅ Ruichuan Chen

Abstract

Log in and register to view live content