← Back to Benchmarks
simmediumvision-robotmetric · varies
WorldBench: Disambiguating Physics for Diagnostic Evaluation of World Models
Description
Recent advances in generative foundational models, often termed "world models," have propelled interest in applying them to critical tasks like robotic planning and autonomous system training. For reliable deployment, these models must exhibit high physical fidelity, accurately simulating real-world dynamics. Existing physics-based video benchmarks, however, suffer from entanglement, where a single test simultaneously evaluates multiple physical laws and concepts, fundamentally limiting their di