Skip to yearly menu bar Skip to main content


Poster

DiffServe: Efficiently Serving Text-to-Image Diffusion Models with Query-Aware Model Scaling

Sohaib Ahmad ⋅ Qizheng Yang ⋅ Haoliang Wang ⋅ Ramesh Sitaraman ⋅ Hui Guan

Abstract

Video

Chat is not available.