arXiv Open Access 2024

Artistic Intelligence: A Diffusion-Based Framework for High-Fidelity Landscape Painting Synthesis

Wanggong Yang Yifei Zhao
Lihat Sumber

Abstrak

Generating high-fidelity landscape paintings remains a challenging task that requires precise control over both structure and style. In this paper, we present LPGen, a novel diffusion-based model specifically designed for landscape painting generation. LPGen introduces a decoupled cross-attention mechanism that independently processes structural and stylistic features, effectively mimicking the layered approach of traditional painting techniques. Additionally, LPGen proposes a structural controller, a multi-scale encoder designed to control the layout of landscape paintings, striking a balance between aesthetics and composition. Besides, the model is pre-trained on a curated dataset of high-resolution landscape images, categorized by distinct artistic styles, and then fine-tuned to ensure detailed and consistent output. Through extensive evaluations, LPGen demonstrates superior performance in producing paintings that are not only structurally accurate but also stylistically coherent, surpassing current state-of-the-art models. This work advances AI-generated art and offers new avenues for exploring the intersection of technology and traditional artistic practices. Our code, dataset, and model weights will be publicly available.

Topik & Kata Kunci

Penulis (2)

W

Wanggong Yang

Y

Yifei Zhao

Format Sitasi

Yang, W., Zhao, Y. (2024). Artistic Intelligence: A Diffusion-Based Framework for High-Fidelity Landscape Painting Synthesis. https://arxiv.org/abs/2407.17229

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓