Skip to yearly menu bar Skip to main content


Poster

BOute: Cost-Efficient LLM Serving with Heterogeneous LLMs and GPUs via Multi-Objective Bayesian Optimization

YOUHE JIANG · Fangcheng Fu · Eiko Yoneki

Abstract

Chat is not available.