simmediumatarimetric · varies

Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning

Description

Model-free deep reinforcement learning (RL) algorithms have been widely used for a range of complex control tasks. However, slow convergence and sample inefficiency remain challenging problems in RL, especially when handling continuous and high-dimensional state spaces. To tackle this problem, we propose a general acceleration method for model-free, off-policy deep RL algorithms by drawing the idea underlying regularized Anderson acceleration (RAA), which is an effective approach to accelerating

Source

http://arxiv.org/abs/1909.03245v3