Skip to yearly menu bar Skip to main content


Oral

TriInfer: Hybrid EPD Disaggregation for Efficient Multimodal Large Language Model Inference

Xianzhe Dong ⋅ Tongxuan Liu ⋅ Yuting Zeng ⋅ Weizhe Huang ⋅ ⋅ Siyu Wu ⋅ ⋅ Liu Yang ⋅ ⋅ Hailong Yang ⋅ ⋅ Jing Li

Abstract

Chat is not available.