Skip to yearly menu bar Skip to main content


Poster 12

TriInfer: Hybrid EPD Disaggregation for Efficient Multimodal Large Language Model Inference

Xianzhe Dong ⋅ Tongxuan Liu ⋅ Yuting Zeng ⋅ Weizhe Huang ⋅ Xiaoyang Zhao ⋅ Siyu Wu ⋅ Liangyu Liu ⋅ Liu Yang ⋅ Yu Wu ⋅ Hailong Yang ⋅ Ke Zhang ⋅ Jing Li

Abstract

Log in and register to view live content