Semantic Scholar Open Access 2023 205 sitasi

BioCLIP: A Vision Foundation Model for the Tree of Life

Samuel Stevens Jiaman Wu Matthew J. Thompson Elizabeth G. Campolongo Chan Hee Song +7 lainnya

Abstrak

Images of the natural world, collected by a variety of cameras, from drones to individual phones, are increasingly abundant sources of biological information. There is an ex-plosion of computational methods and tools, particularly computer vision, for extracting biologically relevant information from images for science and conservation. Yet most of these are bespoke approaches designed for a specific task and are not easily adaptable or extendable to new questions, contexts, and datasets. A vision model for general or-ganismal biology questions on images is of timely need. To approach this, we curate and release Tree Of Life-10m, the largest and most diverse ML-ready dataset of biology images. We then develop Bioclip, a foundation model for the tree of life, leveraging the unique properties of bi-ology captured by Treeoflife-10m, namely the abun-dance and variety of images of plants, animals, and fungi, together with the availability of rich structured biological knowledge. We rigorously benchmark our approach on di-verse fine-grained biology classification tasks and find that BloCLIP consistently and substantially outperforms existing baselines (by 16% to 17% absolute). Intrinsic evaluation reveals that BloCLIP has learned a hierarchical representation conforming to the tree of life, shedding light on its strong generalizability.11imageomics.github.io/bioclip has models, data and code.

Topik & Kata Kunci

Penulis (12)

S

Samuel Stevens

J

Jiaman Wu

M

Matthew J. Thompson

E

Elizabeth G. Campolongo

C

Chan Hee Song

D

David E. Carlyn

L

Li Dong

W

W. Dahdul

C

Charles V. Stewart

T

Tanya Y. Berger-Wolf

W

Wei-Lun Chao

Y

Yu Su

Format Sitasi

Stevens, S., Wu, J., Thompson, M.J., Campolongo, E.G., Song, C.H., Carlyn, D.E. et al. (2023). BioCLIP: A Vision Foundation Model for the Tree of Life. https://doi.org/10.1109/CVPR52733.2024.01836

Akses Cepat

Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Total Sitasi
205×
Sumber Database
Semantic Scholar
DOI
10.1109/CVPR52733.2024.01836
Akses
Open Access ✓