simmediumatarimetric · varies

EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data

Description

Sample efficiency remains a crucial challenge in applying Reinforcement Learning (RL) to real-world tasks. While recent algorithms have made significant strides in improving sample efficiency, none have achieved consistently superior performance across diverse domains. In this paper, we introduce EfficientZero V2, a general framework designed for sample-efficient RL algorithms. We have expanded the performance of EfficientZero to multiple domains, encompassing both continuous and discrete action

Source

http://arxiv.org/abs/2403.00564v2