simmediumnavigationmetric · varies

CompassNav: Steering From Path Imitation To Decision Understanding In Navigation

Description

The dominant paradigm for training Large Vision-Language Models (LVLMs) in navigation relies on imitating expert trajectories. This approach reduces the complex navigation task to a sequence-to-sequence replication of a single correct path, fundamentally limiting the agent's ability to explore and generalize. In this work, we argue for and introduce a new paradigm: a shift from Path Imitation to Decision Understanding. The goal of this paradigm is to build agents that do not just follow, but tru

Source

http://arxiv.org/abs/2510.10154v2