Skip to yearly menu bar Skip to main content


Poster

FlexiCache: Leveraging Temporal Stability of Attention Heads for Efficient KV Cache Management

Nazmul Takbir ⋅ Hamidreza Alikhani Koshkak ⋅ Nikil Dutt ⋅ Sangeetha Abdu Jyothi

Abstract

Log in and register to view live content