Skip to yearly menu bar Skip to main content


Poster

Policy Evaluation for Reinforcement Learning from Human Feedback: A Sample Complexity Analysis

Zihao Li ⋅ Xiang Ji ⋅ Minshuo Chen ⋅ Mengdi Wang
2024 Poster

Abstract

Chat is not available.