simmediumatarimetric · varies

Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models

Description

Achieving efficient and scalable exploration in complex domains poses a major challenge in reinforcement learning. While Bayesian and PAC-MDP approaches to the exploration problem offer strong formal guarantees, they are often impractical in higher dimensions due to their reliance on enumerating the state-action space. Hence, exploration in complex domains is often performed with simple epsilon-greedy methods. In this paper, we consider the challenging Atari games domain, which requires processi

Source

http://arxiv.org/abs/1507.00814v3