← Back to Benchmarks
simmediumatarimetric · varies

Distributional Reinforcement Learning with Dual Expectile-Quantile Regression

Description

Distributional reinforcement learning (RL) has proven useful in multiple benchmarks as it enables approximating the full distribution of returns and extracts rich feedback from environment samples. The commonly used quantile regression approach to distributional RL -- based on asymmetric $L_1$ losses -- provides a flexible and effective way of learning arbitrary return distributions. In practice, it is often improved by using a more efficient, asymmetric hybrid $L_1$-$L_2$ Huber loss for quantil

Source

http://arxiv.org/abs/2305.16877v4