Iclr2025acceptance

🎉 Our paper “A Theoretical Framework for Partially Observed Reward-States in RLHF” has been accepted to ICLR 2025! This is joint work with Mirco Mutti, Aldo Pacchiano, and my advisor Ambuj Tewari.