← Back to Benchmarks
simmediumvision-robotmetric · varies

Rethinking Video Generation Model for the Embodied World

Description

Video generation models have significantly advanced embodied intelligence, unlocking new possibilities for generating diverse robot data that capture perception, reasoning, and action in the physical world. However, synthesizing high-quality videos that accurately reflect real-world robotic interactions remains challenging, and the lack of a standardized benchmark limits fair comparisons and progress. To address this gap, we introduce a comprehensive robotics benchmark, RBench, designed to evalu

Source

http://arxiv.org/abs/2601.15282v1