Skip to yearly menu bar Skip to main content


Poster 36

Toward Principled LLM Safety Testing: Solving the Jailbreak Oracle Problem

Shuyi Lin ⋅ Anshuman Suri ⋅ Alina Oprea ⋅ Cheng Tan

Abstract

Log in and register to view live content