← Back to Benchmarks
simmediumroboticsmetric · varies
DriveDreamer-Policy: A Geometry-Grounded World-Action Model for Unified Generation and Planning
Description
Recently, world-action models (WAM) have emerged to bridge vision-language-action (VLA) models and world models, unifying their reasoning and instruction-following capabilities and spatio-temporal world modeling. However, existing WAM approaches often focus on modeling 2D appearance or latent representations, with limited geometric grounding-an essential element for embodied systems operating in the physical world. We present DriveDreamer-Policy, a unified driving world-action model that integra