Skip to yearly menu bar Skip to main content


Poster

Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPs

Kihyuk (Ki) Hong ⋅ Woojin Chae ⋅ Yufan Zhang ⋅ Dabeen Lee ⋅ Ambuj Tewari

Abstract

Chat is not available.