← Back to Benchmarks
simmediumroboticsmetric · varies

CATNAV: Cached Vision-Language Traversability for Efficient Zero-Shot Robot Navigation

Description

Navigating unstructured environments requires assessing traversal risk relative to a robot's physical capabilities, a challenge that varies across embodiments. We present CATNAV, a cost-aware traversability navigation framework that leverages multimodal LLMs for zero-shot, embodiment-aware costmap generation without task-specific training. We introduce a visuosemantic caching mechanism that detects scene novelty and reuses prior risk assessments for semantically similar frames, reducing online V

Source

http://arxiv.org/abs/2603.22800v1