simmediumatarimetric · varies

Nuclear Norm Maximization Based Curiosity-Driven Learning

Description

To handle the sparsity of the extrinsic rewards in reinforcement learning, researchers have proposed intrinsic reward which enables the agent to learn the skills that might come in handy for pursuing the rewards in the future, such as encouraging the agent to visit novel states. However, the intrinsic reward can be noisy due to the undesirable environment's stochasticity and directly applying the noisy value predictions to supervise the policy is detrimental to improve the learning performance a

Source

http://arxiv.org/abs/2205.10484v2