Beyond Model Serving: Cross-Stack Co-Design for Agentic Systems
Esha Choukse
Abstract
AI is moving from single-model inference to interactive, multimodal, and agentic systems. In this new regime, performance depends on co-design across the full stack, not on models or hardware alone. This talk argues for rethinking the boundary between machine learning and computer systems, and for treating accuracy and quality as dynamic system-level quantities that can be traded against latency, cost, and energy.
Chat is not available.
Successful Page Load