simmediummanipulation-datametric · varies

Persistent Robot World Models: Stabilizing Multi-Step Rollouts via Reinforcement Learning

Description

Action-conditioned robot world models generate future video frames of the manipulated scene given a robot action sequence, offering a promising alternative for simulating tasks that are difficult to model with traditional physics engines. However, these models are optimized for short-term prediction and break down when deployed autoregressively: each predicted clip feeds back as context for the next, causing errors to compound and visual quality to rapidly degrade. We address this through the fo

Source

http://arxiv.org/abs/2603.25685v1