Skip to yearly menu bar Skip to main content


Oral Wed, May 20, 2026 • 3:30 PM – 3:45 PM PDT

Learning from Less: Measuring the Effectiveness of RLVR in Low Data and Compute Regimes

Justin Bauer ⋅ Thomas Walshe ⋅ Derek Pham ⋅ Harit Vishwakarma ⋅ Armin Parchami ⋅ Frederic Sala ⋅ Paroma Varma

Abstract

Log in and register to view live content