Iclr2025acceptance
🎉 Our paper “A Theoretical Framework for Partially Observed Reward-States in RLHF” has been accepted to ICLR 2025! This is joint work with Mirco Mutti, Aldo Pacchiano, and my advisor Ambuj Tewari.
🎉 Our paper “A Theoretical Framework for Partially Observed Reward-States in RLHF” has been accepted to ICLR 2025! This is joint work with Mirco Mutti, Aldo Pacchiano, and my advisor Ambuj Tewari.