← Back to Benchmarks
simmediumnavigationmetric · varies

Hydra-Nav: Object Navigation via Adaptive Dual-Process Reasoning

Description

While large vision-language models (VLMs) show promise for object goal navigation, current methods still struggle with low success rates and inefficient localization of unseen objects--failures primarily attributed to weak temporal-spatial reasoning. Meanwhile, recent attempts to inject reasoning into VLM-based agents improve success rates but incur substantial computational overhead. To address both the ineffectiveness and inefficiency of existing approaches, we introduce Hydra-Nav, a unified V

Source

http://arxiv.org/abs/2602.09972v1