Skip to yearly menu bar Skip to main content


Poster

RagInfer: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval

Chien-Yu Lin · Keisuke Kamahori · Yiyu Liu · Xiaoxiang Shi · Madhav Kashyap · Yile Gu · Rulin Shao · Zihao Ye · Kan Zhu · Rohan Kadekodi · Stephanie Wang · Arvind Krishnamurthy · Luis Ceze · Baris Kasikci

Abstract

Chat is not available.