Skip to yearly menu bar Skip to main content


Poster

Atom: Low-Bit Quantization for Efficient and Accurate LLM Serving

Yilong Zhao ⋅ Chien-Yu Lin ⋅ Kan Zhu ⋅ Zihao Ye ⋅ Lequn Chen ⋅ Size Zheng ⋅ Luis Ceze ⋅ Arvind Krishnamurthy ⋅ Tianqi Chen ⋅ Baris Kasikci
2024 Poster

Abstract

Video

Chat is not available.