← Back to Benchmarks
simmediumatarimetric · varies
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
Description
Exploration is critical for deep reinforcement learning in complex environments with high-dimensional observations and sparse rewards. To address this problem, recent approaches proposed to leverage intrinsic rewards to improve exploration, such as novelty-based exploration and prediction-based exploration. However, many intrinsic reward modules require sophisticated structures and representation learning, resulting in prohibitive computational complexity and unstable performance. In this paper,