simmediumatarimetric · varies

Interpretable performance analysis towards offline reinforcement learning: A dataset perspective

Description

Offline reinforcement learning (RL) has increasingly become the focus of the artificial intelligent research due to its wide real-world applications where the collection of data may be difficult, time-consuming, or costly. In this paper, we first propose a two-fold taxonomy for existing offline RL algorithms from the perspective of exploration and exploitation tendency. Secondly, we derive the explicit expression of the upper bound of extrapolation error and explore the correlation between the p

Source

http://arxiv.org/abs/2105.05473v1