← Back to Benchmarks
simmediumroboticsmetric · varies

DriveDreamer-Policy: A Geometry-Grounded World-Action Model for Unified Generation and Planning

Description

Recently, world-action models (WAM) have emerged to bridge vision-language-action (VLA) models and world models, unifying their reasoning and instruction-following capabilities and spatio-temporal world modeling. However, existing WAM approaches often focus on modeling 2D appearance or latent representations, with limited geometric grounding-an essential element for embodied systems operating in the physical world. We present DriveDreamer-Policy, a unified driving world-action model that integra

Source

http://arxiv.org/abs/2604.01765v1