Skip to yearly menu bar Skip to main content


Poster

FlexAttention: A Programming Model for Generating Fused Attention Variants.

Juechu Dong ⋅ BOYUAN FENG ⋅ Driss Guessous ⋅ Yanbo Liang ⋅ Horace He

Abstract

Video

Chat is not available.