Skip to yearly menu bar Skip to main content


Poster

AWQ: Activation-aware Weight Quantization for On-Device LLM Compression and Acceleration

Ji Lin ⋅ Jiaming Tang ⋅ Haotian Tang ⋅ Shang Yang ⋅ Wei-Ming Chen ⋅ Wei-Chen Wang ⋅ Guangxuan Xiao ⋅ Xingyu Dang ⋅ Chuang Gan ⋅ Song Han
Best Paper Award Best Paper Award
2024 Poster

Abstract

Video

Chat is not available.