← Back to Benchmarks
simmediummanipulation-datametric · varies

V-CAGE: Context-Aware Generation and Verification for Scalable Long-Horizon Embodied Tasks

Description

Learning long-horizon embodied behaviors from synthetic data remains challenging because generated scenes are often physically implausible, language-driven programs frequently "succeed" without satisfying task semantics, and high-level instructions require grounding into executable action sequences. To address these limitations, we introduce V-CAGE, a closed-loop framework for generating robust, semantically aligned manipulation datasets at scale. First, we propose a context-aware instantiation

Source

http://arxiv.org/abs/2601.15164v1