Skip to yearly menu bar Skip to main content


Poster 15

OPKV: A High-Throughput Plugin-Driven Framework for Recallable Sparsity in Paged KV Cache Systems

Huazheng Lao ⋅ Xiaofeng Li ⋅ Rui Xu ⋅ Long Chen ⋅ Xia Zhu ⋅ Jinquan Zhang

Abstract

Log in and register to view live content