← Back to Benchmarks
simmediummanipulation-datametric · varies
Persistent Robot World Models: Stabilizing Multi-Step Rollouts via Reinforcement Learning
Description
Action-conditioned robot world models generate future video frames of the manipulated scene given a robot action sequence, offering a promising alternative for simulating tasks that are difficult to model with traditional physics engines. However, these models are optimized for short-term prediction and break down when deployed autoregressively: each predicted clip feeds back as context for the next, causing errors to compound and visual quality to rapidly degrade. We address this through the fo