← Back to Benchmarks
simmediumnavigationmetric · varies
RANGER: A Monocular Zero-Shot Semantic Navigation Framework through Visual Contextual Adaptation
Description
Efficient target localization and autonomous navigation in complex environments are fundamental to real-world embodied applications. While recent advances in multimodal foundation models have enabled zero-shot object goal navigation, allowing robots to search for arbitrary objects without fine-tuning, existing methods face two key limitations: (1) heavy reliance on ground-truth depth and pose information, which restricts applicability in real-world scenarios; and (2) lack of visual in-context le