arXiv Open Access 2022

StyleBabel: Artistic Style Tagging and Captioning

Dan Ruta Andrew Gilbert Pranav Aggarwal Naveen Marri Ajinkya Kale +7 lainnya
Lihat Sumber

Abstrak

We present StyleBabel, a unique open access dataset of natural language captions and free-form tags describing the artistic style of over 135K digital artworks, collected via a novel participatory method from experts studying at specialist art and design schools. StyleBabel was collected via an iterative method, inspired by `Grounded Theory': a qualitative approach that enables annotation while co-evolving a shared language for fine-grained artistic style attribute description. We demonstrate several downstream tasks for StyleBabel, adapting the recent ALADIN architecture for fine-grained style similarity, to train cross-modal embeddings for: 1) free-form tag generation; 2) natural language description of artistic style; 3) fine-grained text search of style. To do so, we extend ALADIN with recent advances in Visual Transformer (ViT) and cross-modal representation learning, achieving a state of the art accuracy in fine-grained style retrieval.

Topik & Kata Kunci

Penulis (12)

D

Dan Ruta

A

Andrew Gilbert

P

Pranav Aggarwal

N

Naveen Marri

A

Ajinkya Kale

J

Jo Briggs

C

Chris Speed

H

Hailin Jin

B

Baldo Faieta

A

Alex Filipkowski

Z

Zhe Lin

J

John Collomosse

Format Sitasi

Ruta, D., Gilbert, A., Aggarwal, P., Marri, N., Kale, A., Briggs, J. et al. (2022). StyleBabel: Artistic Style Tagging and Captioning. https://arxiv.org/abs/2203.05321

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2022
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓