← Back to Benchmarks
simmediumroboticsmetric · varies

NaviTrace: Evaluating Embodied Navigation of Vision-Language Models

Description

Vision-language models demonstrate unprecedented performance and generalization across a wide range of tasks and scenarios. Integrating these foundation models into robotic navigation systems opens pathways toward building general-purpose robots. Yet, evaluating these models' navigation capabilities remains constrained by costly real-world trials, overly simplified simulations, and limited benchmarks. We introduce NaviTrace, a high-quality Visual Question Answering benchmark where a model receiv

Source

http://arxiv.org/abs/2510.26909v3