Skip to yearly menu bar Skip to main content


Poster

TriInfer: Hybrid EPD Disaggregation for Efficient Multimodal Large Language Model Inference

Xianzhe Dong · Tongxuan Liu · Yuting Zeng · Weizhe Huang · · Siyu Wu · · Liu Yang · · Hailong Yang · · Jing Li

Abstract

Chat is not available.