Skip to yearly menu bar Skip to main content


Oral Tue, May 19, 2026 • 3:15 PM – 3:30 PM PDT

RagInfer: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval

Chien-Yu Lin ⋅ Keisuke Kamahori ⋅ Yiyu Liu ⋅ Xiaoxiang Shi ⋅ Madhav Kashyap ⋅ Yile Gu ⋅ Rulin Shao ⋅ Zihao Ye ⋅ Kan Zhu ⋅ Rohan Kadekodi ⋅ Stephanie Wang ⋅ Arvind Krishnamurthy ⋅ Luis Ceze ⋅ Baris Kasikci

Abstract

Log in and register to view live content