Toggle Poster Visibility
Poster
Tue May 14 01:30 PM -- 01:50 PM (PDT) @ Poster Position Number 26
Q-Hitter: A Better Token Oracle for Efficient LLM Inference via Sparse-Quantized KV Cache
Poster
Tue May 14 01:50 PM -- 02:10 PM (PDT) @ Poster Position Number 11
Fine-Tuning Language Models Using Formal Methods Feedback: A Use Case in Autonomous Systems
[
Slides]
Poster
Tue May 14 02:20 PM -- 02:40 PM (PDT) @ Poster Position Number 34
Punica: Multi-Tenant LoRA Serving
[
Slides]
Poster
Tue May 14 02:40 PM -- 03:00 PM (PDT) @ Poster Position Number 9
SLoRA: Scalable Serving of Thousands of LoRA Adapters
Successful Page Load