Skip to yearly menu bar Skip to main content


Oral Wed, May 20, 2026 • 4:00 PM – 4:15 PM PDT

Learning from Less: Measuring the Effectiveness of RLVR in Low Data and Compute Regimes

Justin Bauer ⋅ Thomas Walshe ⋅ Derek Pham ⋅ Harit Vishwakarma ⋅ Armin Parchami ⋅ Frederic Sala ⋅ Paroma Varma

Abstract

Log in and register to view live content