arXiv Open Access 2024

Does Spatial Cognition Emerge in Frontier Models?

Santhosh Kumar Ramakrishnan Erik Wijmans Philipp Kraehenbuehl Vladlen Koltun

Lihat Sumber

Abstrak

Not yet. We present SPACE, a benchmark that systematically evaluates spatial cognition in frontier models. Our benchmark builds on decades of research in cognitive science. It evaluates large-scale mapping abilities that are brought to bear when an organism traverses physical environments, smaller-scale reasoning about object shapes and layouts, and cognitive infrastructure such as spatial attention and memory. For many tasks, we instantiate parallel presentations via text and images, allowing us to benchmark both large language models and large multimodal models. Results suggest that contemporary frontier models fall short of the spatial intelligence of animals, performing near chance level on a number of classic tests of animal cognition. Code and data are available: https://github.com/apple/ml-space-benchmark

Topik & Kata Kunci

cs.AI cs.CV cs.LG

Penulis (4)

Santhosh Kumar Ramakrishnan

Erik Wijmans

Philipp Kraehenbuehl

Vladlen Koltun

Format Sitasi

APA MLA BibTeX

Ramakrishnan, S.K., Wijmans, E., Kraehenbuehl, P., Koltun, V. (2024). Does Spatial Cognition Emerge in Frontier Models?. https://arxiv.org/abs/2410.06468

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓