Skip to yearly menu bar Skip to main content


Poster 52

FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling

Ted Zadouri ⋅ Markus Hoehnerbach ⋅ Jay Shah ⋅ Vijay Thakkar ⋅ Tri Dao

Abstract

Log in and register to view live content