arXiv Open Access 2025

DeepAndes: A Self-Supervised Vision Foundation Model for Multi-Spectral Remote Sensing Imagery of the Andes

Junlin Guo James R. Zimmer-Dauphinee Jordan M. Nieusma Siqi Lu Quan Liu +14 lainnya
Lihat Sumber

Abstrak

By mapping sites at large scales using remotely sensed data, archaeologists can generate unique insights into long-term demographic trends, inter-regional social networks, and past adaptations to climate change. Remote sensing surveys complement field-based approaches, and their reach can be especially great when combined with deep learning and computer vision techniques. However, conventional supervised deep learning methods face challenges in annotating fine-grained archaeological features at scale. While recent vision foundation models have shown remarkable success in learning large-scale remote sensing data with minimal annotations, most off-the-shelf solutions are designed for RGB images rather than multi-spectral satellite imagery, such as the 8-band data used in our study. In this paper, we introduce DeepAndes, a transformer-based vision foundation model trained on three million multi-spectral satellite images, specifically tailored for Andean archaeology. DeepAndes incorporates a customized DINOv2 self-supervised learning algorithm optimized for 8-band multi-spectral imagery, marking the first foundation model designed explicitly for the Andes region. We evaluate its image understanding performance through imbalanced image classification, image instance retrieval, and pixel-level semantic segmentation tasks. Our experiments show that DeepAndes achieves superior F1 scores, mean average precision, and Dice scores in few-shot learning scenarios, significantly outperforming models trained from scratch or pre-trained on smaller datasets. This underscores the effectiveness of large-scale self-supervised pre-training in archaeological remote sensing. Codes will be available on https://github.com/geopacha/DeepAndes.

Topik & Kata Kunci

Penulis (19)

J

Junlin Guo

J

James R. Zimmer-Dauphinee

J

Jordan M. Nieusma

S

Siqi Lu

Q

Quan Liu

R

Ruining Deng

C

Can Cui

J

Jialin Yue

Y

Yizhe Lin

T

Tianyuan Yao

J

Juming Xiong

J

Junchao Zhu

C

Chongyu Qu

Y

Yuechen Yang

M

Mitchell Wilkes

X

Xiao Wang

P

Parker VanValkenburgh

S

Steven A. Wernke

Y

Yuankai Huo

Format Sitasi

Guo, J., Zimmer-Dauphinee, J.R., Nieusma, J.M., Lu, S., Liu, Q., Deng, R. et al. (2025). DeepAndes: A Self-Supervised Vision Foundation Model for Multi-Spectral Remote Sensing Imagery of the Andes. https://arxiv.org/abs/2504.20303

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓