Skip to yearly menu bar Skip to main content


Poster

Order-Optimal Regret with Novel Policy Gradient Approaches in Infinite-Horizon Average Reward MDPs

Swetha Ganesh ⋅ Washim Uddin Mondal ⋅ Vaneet Aggarwal

Abstract

Chat is not available.