Skip to yearly menu bar Skip to main content


Poster

Policy Evaluation for Reinforcement Learning from Human Feedback: A Sample Complexity Analysis

Zihao Li · Xiang Ji · Minshuo Chen · Mengdi Wang
2024 Poster

Abstract

Chat is not available.