simmediumatarimetric · varies

Learning Intrinsic Symbolic Rewards in Reinforcement Learning

Description

Learning effective policies for sparse objectives is a key challenge in Deep Reinforcement Learning (RL). A common approach is to design task-related dense rewards to improve task learnability. While such rewards are easily interpreted, they rely on heuristics and domain expertise. Alternate approaches that train neural networks to discover dense surrogate rewards avoid heuristics, but are high-dimensional, black-box solutions offering little interpretability. In this paper, we present a method

Source

http://arxiv.org/abs/2010.03694v2