Skip to yearly menu bar Skip to main content


Poster

BOute: Cost-Efficient LLM Serving with Heterogeneous LLMs and GPUs via Multi-Objective Bayesian Optimization

YOUHE JIANG ⋅ Fangcheng Fu ⋅ Eiko Yoneki

Abstract

Chat is not available.