← Back to Benchmarks
simmediumroboticsmetric · varies

ABot-PhysWorld: Interactive World Foundation Model for Robotic Manipulation with Physics Alignment

Description

Video-based world models offer a powerful paradigm for embodied simulation and planning, yet state-of-the-art models often generate physically implausible manipulations - such as object penetration and anti-gravity motion - due to training on generic visual data and likelihood-based objectives that ignore physical laws. We present ABot-PhysWorld, a 14B Diffusion Transformer model that generates visually realistic, physically plausible, and action-controllable videos. Built on a curated dataset o

Source

http://arxiv.org/abs/2603.23376v2