Registration Desk: Registration and coffee Mon 12 May 08:30 a.m.
Invited Talk: Soumith Chintala
In this talk, we’ll explore how cutting-edge users are pushing PyTorch to its limits—from planetary-scale training on interactive supercomputers to ultra-efficient, real-time inference on exotic hardware. These power users give us a unique window into today’s most demanding ML systems challenges. We’ll also examine a bold idea that's on top of everyone's mind at this conference: using AI agents to automate a big chunk of work that these cutting-edge users currently do. This vision is far from realized. We’ll outline the open challenges in building such agents, and share concrete opportunities for open collaboration toward making SysML AI agents a reality.
Bio :
Invited Talk: Tim Dettmers
Lessons Learned from Successful PhD Students
This talk is for young PhD students and would be PhD students to help them understand how to have a satisfying and successful PhD in Machine Learning Systems. While it draws on my own experience in my PhD, creating such things as bitsandbytes and developing QLoRA, I will mainly draw on research about academic success and general patterns that I saw in successful PhD students to provide perspective. Key questions I will talk about: What is my research style? What should I work on? How should I work on it? How do I get the resources that I need for my work? How do I stay motivated?
Bio :Invited Talk: Wei-Lin Chiang
LMArena: An Open Platform for Crowdsourced AI benchmarks
Recent advance in AI has unlocked new capabilities and applications; however, its evaluation still poses significant challenges. We introduce LMArena, an open platform for evaluating AI based on human preferences. Our methodology employs a pairwise comparison approach and leverages input from a global user base through crowdsourcing. The platform has been operational for over two years, collecting ~3 million community votes. LMArena has emerged as one of the most popular LLM leaderboards, widely referenced by leading LLM developers and companies. Our website is publicly available at https://lmarena.ai
Bio :Invited Talk: Simran Arora
Designing Models from the Hardware Up
This talk presents systems-level techniques for designing language models that are both high quality and highly efficient. I’ll introduce ThunderKittens, a GPU programming library that simplifies the development of hardware-friendly models, and show how it enabled BASED—an attention-free architecture built from simple, throughput-oriented components. These innovations made it possible to train state-of-the-art 8B–405B parameter attention-free models on academic resources and have influenced emerging approaches across research, industry, and open-source.
Bio :Session: Industry Lightning Talks Mon 12 May 01:30 p.m.
Speaker Schedule
Time | Company | Speaker | Role |
---|---|---|---|
1:30 - 1:34pm | Waymo | Xinru Hua | Software Engineer |
1:34 - 1:38pm | Cloud Native Computing | Jeffrey Sica | Head of Projects |
1:38 - 1:42pm | Meta | Mark Saroufim | Software Engineer |
1:42 - 1:46pm | Turing | Ilya Shadfar | Technical Project Manager |
1:46 - 1:50pm | Character AI | Meng Zhu | Member of Technical Staff |
1:50 - 1:54pm | Megagon Labs | Eser Kandogan | Principal Research Engineer |
1:54 - 1:58pm | AWS | Yida Wang | Principal Scientist |
1:58 - 2:02pm | Qualcomm | Paul Whatmoug | Senior Director, Engineering |
2:02 - 2:06pm | Ant Research | Da Zheng | Senior Staff Scientist |
2:06 - 2:10pm | Google Research | Martin Maas | Staff Research Scientist |
2:10 - 2:14pm | Databricks | Qi Zheng | Engineering Manager |
2:14 - 2:18pm | Together AI | Yucheng Lu | Researcher |
2:18 - 2:22pm | PDT Partners | Rushi Nadimpally | Head of Research Engineering |
2:22 - 2:26pm | Lambda | Jessica Nicholson | Machine Learning Engineer |
2:26 - 2:30pm | PyTorch | Matt White | Executive Director |