Skip to yearly menu bar Skip to main content


Poster

Atom: Low-Bit Quantization for Efficient and Accurate LLM Serving

Yilong Zhao · Chien-Yu Lin · Kan Zhu · Zihao Ye · Lequn Chen · Size Zheng · Luis Ceze · Arvind Krishnamurthy · Tianqi Chen · Baris Kasikci
2024 Poster

Abstract

Video

Chat is not available.