← Back to Benchmarks
simmediumroboticsmetric · varies
CATNAV: Cached Vision-Language Traversability for Efficient Zero-Shot Robot Navigation
Description
Navigating unstructured environments requires assessing traversal risk relative to a robot's physical capabilities, a challenge that varies across embodiments. We present CATNAV, a cost-aware traversability navigation framework that leverages multimodal LLMs for zero-shot, embodiment-aware costmap generation without task-specific training. We introduce a visuosemantic caching mechanism that detects scene novelty and reuses prior risk assessments for semantically similar frames, reducing online V