arXiv Open Access 2025

CostNav: A Navigation Benchmark for Real-World Economic-Cost Evaluation of Physical AI Agents

Haebin Seong Sungmin Kim Yongjun Cho Myunchul Joe Geunwoo Kim +18 lainnya
Lihat Sumber

Abstrak

While current navigation benchmarks prioritize task success in simplified settings, they neglect the multidimensional economic constraints essential for the real-world commercialization of autonomous delivery systems. We introduce CostNav, an Economic Navigation Benchmark that evaluates physical AI agents through comprehensive economic cost-revenue analysis aligned with real-world business operations. By integrating industry-standard data--such as Securities and Exchange Commission (SEC) filings and Abbreviated Injury Scale (AIS) injury reports--with Isaac Sim's detailed collision and cargo dynamics, CostNav transcends simple task completion to accurately evaluate business value in complex, real-world scenarios. To our knowledge, CostNav is the first physics-grounded economic benchmark that uses industry-standard regulatory and financial data to quantitatively expose the gap between navigation research metrics and commercial viability, revealing that optimizing for task success on a simplified task fundamentally differs from optimizing for real-world economic deployment. Evaluating seven baselines--two rule-based and five imitation learning--we find that no current method is economically viable, all yielding negative contribution margins. The best-performing method, CANVAS (-27.36\$/run), equipped with only an RGB camera and GPS, outperforms LiDAR-equipped Nav2 w/ GPS (-35.46\$/run). We challenge the community to develop navigation policies that achieve economic viability on CostNav. We remain method-agnostic, evaluating success solely on cost rather than the underlying architecture. All resources are available at https://github.com/worv-ai/CostNav.

Penulis (23)

H

Haebin Seong

S

Sungmin Kim

Y

Yongjun Cho

M

Myunchul Joe

G

Geunwoo Kim

Y

Yubeen Park

S

Sunhoo Kim

Y

Yoonshik Kim

S

Suhwan Choi

J

Jaeyoon Jung

J

Jiyong Youn

J

Jinmyung Kwak

S

Sunghee Ahn

J

Jaemin Lee

Y

Younggil Do

S

Seungyeop Yi

W

Woojin Cheong

M

Minhyeok Oh

M

Minchan Kim

S

Seongjae Kang

S

Samwoo Seong

Y

Youngjae Yu

Y

Yunsung Lee

Format Sitasi

Seong, H., Kim, S., Cho, Y., Joe, M., Kim, G., Park, Y. et al. (2025). CostNav: A Navigation Benchmark for Real-World Economic-Cost Evaluation of Physical AI Agents. https://arxiv.org/abs/2511.20216

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓