Skip to yearly menu bar Skip to main content


Oral

FaaScale: Unlocking Fast LLM Scaling for Serverless Inference

Minchen Yu ⋅ Rui Yang ⋅ ⋅ Zhaoyuan Su ⋅ Sheng Yao ⋅ Tingfeng Lan ⋅ ⋅ Zirui Wang ⋅ Yue Cheng ⋅ Wei Wang ⋅ ⋅ Ruichuan Chen

Abstract

Chat is not available.