Skip to yearly menu bar Skip to main content


Oral

HetRL: Efficient Reinforcement Learning for LLMs in Heterogeneous Environments

Yongjun He · Shuai Zhang · · Xiyuan Zhang · Boran Han · Bernie Wang · Huzefa Rangwala · George Karypis

Abstract

Chat is not available.