arXiv Open Access 2025

IC-Custom: Diverse Image Customization via In-Context Learning

Yaowei Li Xiaoyu Li Zhaoyang Zhang Yuxuan Bian Gan Liu +9 lainnya
Lihat Sumber

Abstrak

Image customization, a crucial technique for industrial media production, aims to generate content that is consistent with reference images. However, current approaches conventionally separate image customization into position-aware and position-free customization paradigms and lack a universal framework for diverse customization, limiting their applications across various scenarios. To overcome these limitations, we propose IC-Custom, a unified framework that seamlessly integrates position-aware and position-free image customization through in-context learning. IC-Custom concatenates reference images with target images to a polyptych, leveraging DiT's multi-modal attention mechanism for fine-grained token-level interactions. We propose the In-context Multi-Modal Attention (ICMA) mechanism, which employs learnable task-oriented register tokens and boundary-aware positional embeddings to enable the model to effectively handle diverse tasks and distinguish between inputs in polyptych configurations. To address the data gap, we curated a 12K identity-consistent dataset with 8K real-world and 4K high-quality synthetic samples, avoiding the overly glossy, oversaturated look typical of synthetic data. IC-Custom supports various industrial applications, including try-on, image insertion, and creative IP customization. Extensive evaluations on our proposed ProductBench and the publicly available DreamBench demonstrate that IC-Custom significantly outperforms community workflows, closed-source models, and state-of-the-art open-source approaches. IC-Custom achieves about 73\% higher human preference across identity consistency, harmony, and text alignment metrics, while training only 0.4\% of the original model parameters. Project page: https://liyaowei-stu.github.io/project/IC_Custom

Topik & Kata Kunci

Penulis (14)

Y

Yaowei Li

X

Xiaoyu Li

Z

Zhaoyang Zhang

Y

Yuxuan Bian

G

Gan Liu

X

Xinyuan Li

J

Jiale Xu

W

Wenbo Hu

Y

Yating Liu

L

Lingen Li

J

Jing Cai

Y

Yuexian Zou

Y

Yancheng He

Y

Ying Shan

Format Sitasi

Li, Y., Li, X., Zhang, Z., Bian, Y., Liu, G., Li, X. et al. (2025). IC-Custom: Diverse Image Customization via In-Context Learning. https://arxiv.org/abs/2507.01926

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓