arXiv Open Access 2025

Shape2Animal: Creative Animal Generation from Natural Silhouettes

Quoc-Duy Tran Anh-Tuan Vo Dinh-Khoi Vo Tam V. Nguyen Minh-Triet Tran +1 lainnya

Lihat Sumber

Abstrak

Humans possess a unique ability to perceive meaningful patterns in ambiguous stimuli, a cognitive phenomenon known as pareidolia. This paper introduces Shape2Animal framework to mimics this imaginative capacity by reinterpreting natural object silhouettes, such as clouds, stones, or flames, as plausible animal forms. Our automated framework first performs open-vocabulary segmentation to extract object silhouette and interprets semantically appropriate animal concepts using vision-language models. It then synthesizes an animal image that conforms to the input shape, leveraging text-to-image diffusion model and seamlessly blends it into the original scene to generate visually coherent and spatially consistent compositions. We evaluated Shape2Animal on a diverse set of real-world inputs, demonstrating its robustness and creative potential. Our Shape2Animal can offer new opportunities for visual storytelling, educational content, digital art, and interactive media design. Our project page is here: https://shape2image.github.io

Topik & Kata Kunci

cs.CV

Penulis (6)

Quoc-Duy Tran

Anh-Tuan Vo

Dinh-Khoi Vo

Tam V. Nguyen

Minh-Triet Tran

Trung-Nghia Le

Format Sitasi

APA MLA BibTeX

Tran, Q., Vo, A., Vo, D., Nguyen, T.V., Tran, M., Le, T. (2025). Shape2Animal: Creative Animal Generation from Natural Silhouettes. https://arxiv.org/abs/2506.20616

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓