Skip to yearly menu bar Skip to main content


Oral

FaaScale: Unlocking Fast LLM Scaling for Serverless Inference

Minchen Yu · Rui Yang · · Zhaoyuan Su · Sheng Yao · Tingfeng Lan · · Zirui Wang · Yue Cheng · Wei Wang · · Ruichuan Chen

Abstract

Chat is not available.