Skip to yearly menu bar Skip to main content


Oral Wed, May 20, 2026 • 9:45 AM – 10:00 AM PDT

FaaScale: Unlocking Fast LLM Scaling for Serverless Inference

Minchen Yu ⋅ Rui Yang ⋅ ⋅ Zhaoyuan Su ⋅ Sheng Yao ⋅ Tingfeng Lan ⋅ ⋅ Zirui Wang ⋅ Yue Cheng ⋅ Wei Wang ⋅ ⋅ Ruichuan Chen

Abstract

Log in and register to view live content