Skip to yearly menu bar Skip to main content


Keynote Talk

When AI Starts Writing Systems Code

May 18, 3:30 PM - 4:30 PM Grand Ballroom 1
Systems are increasingly being written and optimized by AI systems. This talk focuses on kernel LLMs: models that generate GPU kernels. GPU kernels are a strong target for AI-driven optimization because they are verifiable and commercially interesting to optimize. But despite promising demos, very few AI-generated kernels are reliable enough to be used in production without significant human supervision. We will go through examples of how we made LLM kernel evaluation more robust through open benchmarks, community feedback loops, and infrastructure built in public through GPU MODE. We will close with some thoughts on where ML systems are going, where junior researchers should spend their time, and how to build systems that last in a world where the cost of writing code is approaching zero.
Speaker
Mark Saroufim

Mark Saroufim

Mark Saroufim is a member of technical staff at Core Automation, co-founder of GPU MODE and was formerly a systems researcher at Meta working on PyTorch. His work focuses on AI infrastructure, GPU kernels, open-source systems, and kernel LLMs. He cares about both building better AI systems and building the open communities and benchmarks that make progress possible.
View full details
Keynote Talk

The Next Horizon of Systems: From MLSys to System Intelligence

May 19, 10:30 AM - 11:30 AM Grand Ballroom
MLSys showed how systems can accelerate AI. The next shift is broader: AI is beginning to reshape the practice of systems itself. This emerging paradigm, which we call system intelligence, goes beyond automating programming tasks. It enables new forms of reasoning, design, validation, and evolution for complex systems while preserving rigor. In this talk, I will argue that system intelligence changes not only what systems we can build, but also how we understand systems as a discipline. It pushes us to rethink systems principles and methodology, shifting attention from code-level complexity to greater rigor in specification, design, and validation. Through our experiences with system verification, I will discuss how this shift may help give systems a stronger scientific foundation.
Speaker
Lidong Zhou

Lidong Zhou

Dr. Lidong Zhou is Corporate Vice President at Microsoft, Chief Scientist of the Microsoft Asia Pacific R&D Group, and Managing Director of Microsoft Research Asia. He has held research and leadership roles across Microsoft’s Silicon Valley, Redmond, and Asia labs. His work focuses on scalable, reliable, and trustworthy distributed systems, and has award papers at SOSP, OSDI, and USENIX ATC. He has also contributed to the design of large-scale systems that power Microsoft’s search, big data, cloud, and AI infrastructure.
View full details
Keynote Talk

Keynote - Luke Zettlemoyer

May 21, 10:30 AM - 11:30 AM Grand Ballroom
Speaker
Luke Zettlemoyer

Luke Zettlemoyer

I am a Professor in the Paul G. Allen School of Computer Science & Engineering at the University of Washington, and a Senior Research Director at Meta. My research focuses on empirical methods for natural language semantics, and involves designing machine learning algorithms, introducing new tasks and datasets, analyzing model performance, and, most recently, studying how to best develop self-supervision signals for pre-training. My honors include being elected as ACL President (2024), named an ACL Fellow (2021), as well as winning a PECASE award (2016), an Allen Distinguished Investigator award (2014), and many best paper awards. I was an undergraduate at NC State, recieved my PhD from MIT, and was a postdocal researcher at the University of Edinburgh.
View full details
Keynote Talk

Keynote - Amin Vahdat

May 20, 10:30 AM - 11:30 AM Grand Ballroom
Speaker
Amin Vahdat

Amin Vahdat

Amin Vahdat is a Fellow and Chief Technologist for AI Infrastructure at Google, where his team is responsible for delivering industry-leading infrastructure which spans custom silicon, data centers, network, and supply chain and operations. This infrastructure serves Alphabet, Google and the world, and Artificial Intelligence technologies that empower ML developers and solve customers’ most pressing business challenges. In the past, he was Vice President and General Manager for Google's compute, storage, and network hardware and software infrastructure. Until 2019, he was the Technical Lead and Vice President for the Networking organization at Google.
View full details
Keynote Talk

Keynote - Christos Kozyrakis

May 22, 9:45 AM - 10:45 AM Grand Ballroom
Speaker
Christos Kozyrakis

Christos Kozyrakis

Christos Kozyrakis is a Professor of Electrical Engineering and Computer Science at Stanford University. He leads the multi-scale architecture & systems team (MAST), a research group that investigates hardware architectures, runtime management environments, system software, and programming models for systems ranging from cellphones to warehouse-scale datacenters. His current research focuses on resource efficient cloud computing, energy efficient compute and memory systems for emerging workloads, and scalable operating systems. Christos joined Stanford in 2002 after receiving a PhD in Computer Science from UC Berkeley. His alma mater is the University of Crete in Greece. He is a fellow of the ACM and the IEEE.
View full details

No Events Found

Try adjusting your search terms