← Back to Benchmarks
simmediumsim-to-realmetric · varies

CRISP: Contact-Guided Real2Sim from Monocular Video with Planar Scene Primitives

Description

We introduce CRISP, a method that recovers simulatable human motion and scene geometry from monocular video. Prior work on joint human-scene reconstruction relies on data-driven priors and joint optimization with no physics in the loop, or recovers noisy geometry with artifacts that cause motion tracking policies with scene interactions to fail. In contrast, our key insight is to recover convex, clean, and simulation-ready geometry by fitting planar primitives to a point cloud reconstruction of

Source

http://arxiv.org/abs/2512.14696v3