Skip to yearly menu bar Skip to main content


Oral Wed, May 20, 2026 • 9:00 AM – 9:15 AM PDT

BOute: Cost-Efficient LLM Serving with Heterogeneous LLMs and GPUs via Multi-Objective Bayesian Optimization

YOUHE JIANG ⋅ Fangcheng Fu ⋅ Eiko Yoneki

Abstract

Log in and register to view live content