Toggle Poster Visibility
Poster
Tue May 14 09:00 AM -- 09:20 AM (PDT) @ Poster Position Number 15
AWQ: Activation-aware Weight Quantization for On-Device LLM Compression and Acceleration
Poster
Tue May 14 09:20 AM -- 09:40 AM (PDT) @ Poster Position Number 31
QMoE: Sub-1-Bit Compression of Trillion Parameter Models
Poster
Tue May 14 09:40 AM -- 10:00 AM (PDT) @ Poster Position Number 13
Atom: Low-Bit Quantization for Efficient and Accurate LLM Serving
[
Slides]
Successful Page Load