simmediumoffline-rlmetric · varies

Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation

Description

Model-based offline Reinforcement Learning (RL) constructs environment models from offline datasets to perform conservative policy optimization. Existing approaches focus on learning state transitions through ensemble models, rollouting conservative estimation to mitigate extrapolation errors. However, the static data makes it challenging to develop a robust policy, and offline agents cannot access the environment to gather new data. To address these challenges, we introduce Model-based Offline

Source

http://arxiv.org/abs/2503.20285v1