Skip to yearly menu bar Skip to main content


Poster 133

Optimal Posterior Sampling for Policy Identification in Tabular Markov Decision Processes

Cyrille Kone · Kevin Jamieson

Abstract

Log in and register to view live content