Skip to yearly menu bar Skip to main content


Poster

Policy Teaching via Data Poisoning in Learning from Human Preferences

Andi Nika ⋅ Jonathan Nöther ⋅ Debmalya Mandal ⋅ Parameswaran Kamalaruban ⋅ Adish Singla ⋅ Goran Radanovic

Abstract

Chat is not available.