simmediumnavigationmetric · varies

Embodied4C: Measuring What Matters for Embodied Vision-Language Navigation

Description

Vision-language navigation requires agents to reason and act under constraints of embodiment. While vision-language models (VLMs) demonstrate strong generalization, current benchmarks provide limited understanding of how embodiment -- i.e., the choice of physical platform, sensor configuration, and modality alignment -- influences perception, reasoning, and control. We introduce Embodied4C, a closed-loop benchmark designed as a Turing test for embodied reasoning. The benchmark evaluates the core

Source

http://arxiv.org/abs/2512.18028v1