Skip to yearly menu bar Skip to main content


Invited Talk Mon, May 18, 2026 • 9:50 AM – 10:15 AM PDT Grand Ballroom 1

LMCache: An Efficient KV Cache Layer for Enterprise-Scale LLM Inference

Yuhan Liu

Abstract

Speaker

Yuhan Liu

Yuhan Liu

Yuhan Liu is a fifth-year PhD candidate at the University of Chicago, co-advised by Junchen Jiang and Shan Lu. Her research interest is in building efficient large-scale system and networking support for ML model inference. She received MIT EECS rising star, EuroSys best paper award, and UChicago’s Neubauer PhD fellowship for her research. She also leads two open-source projects that build large-scale KV caching layer for efficient LLM inference, and are used in over 30 companies in production, including Google Cloud, Amazon AWS, NVIDIA, IBM etc.

Log in and register to view live content