Timezone: America/Los_Angeles
Filter Events
Registration Desk
8:00 AM - 4:00 PM
Remarks
8:30 AM - 8:45 AM
Poster
8:45 AM - 10:00 AM
5 Events in this session
Invited Talk
... more
10:30 AM - 11:30 AM
Large language models (LLMs) have taken the world by storm—enabling new applications, intensifying GPU shortages, and raising concerns about the accuracy of their outputs. In this talk, I will present several projects I have worked on to address these challenges. Specifically, I will focus on: (i) Ray, a distributed framework for scaling AI workloads; (ii) vLLM and SGLang, two high-throughput inference engines for LLMs; and (iii) Chatbot Arena, a platform for accurate LLM benchmarking. I will conclude with key lessons learned and outline directions for future research.
Speaker Bio
My area of research is at the intersection between AI and systems, cloud computing, and distributed systems. I am equally interested in designing algorithms and systems with strong theoretical foundations, and in providing practical implementations that are deployable in the real world.
... more
Poster
1:15 PM - 2:40 PM
Poster
2:40 PM - 4:00 PM
Poster
4:45 PM - 6:00 PM
5 Events in this session
Poster
6:00 PM - 8:00 PM
Successful Page Load