← Back to Benchmarks
simmediumnavigationmetric · varies
CostNav: A Navigation Benchmark for Real-World Economic-Cost Evaluation of Physical AI Agents
Description
While current navigation benchmarks prioritize task success in simplified settings, they neglect the multidimensional economic constraints essential for the real-world commercialization of autonomous delivery systems. We introduce CostNav, an Economic Navigation Benchmark that evaluates physical AI agents through comprehensive economic cost-revenue analysis aligned with real-world business operations. By integrating industry-standard data--such as Securities and Exchange Commission (SEC) filings