simmediumoffline-rlmetric · varies

State-Covering Trajectory Stitching for Diffusion Planners

Description

Diffusion-based generative models are emerging as powerful tools for long-horizon planning in reinforcement learning (RL), particularly with offline datasets. However, their performance is fundamentally limited by the quality and diversity of training data. This often restricts their generalization to tasks outside their training distribution or longer planning horizons. To overcome this challenge, we propose State-Covering Trajectory Stitching (SCoTS), a novel reward-free trajectory augmentatio

Source

http://arxiv.org/abs/2506.00895v3