Skip to yearly menu bar Skip to main content


Poster

FlexAttention: A Programming Model for Generating Fused Attention Variants.

Juechu Dong · BOYUAN FENG · Driss Guessous · Yanbo Liang · Horace He

Abstract

Video

Chat is not available.