arXiv Open Access 2025

Zero-Shot Dynamic Concept Personalization with Grid-Based LoRA

Rameen Abdal Or Patashnik Ekaterina Deyneka Hao Chen Aliaksandr Siarohin +3 lainnya

Lihat Sumber

Abstrak

Recent advances in text-to-video generation have enabled high-quality synthesis from text and image prompts. While the personalization of dynamic concepts, which capture subject-specific appearance and motion from a single video, is now feasible, most existing methods require per-instance fine-tuning, limiting scalability. We introduce a fully zero-shot framework for dynamic concept personalization in text-to-video models. Our method leverages structured 2x2 video grids that spatially organize input and output pairs, enabling the training of lightweight Grid-LoRA adapters for editing and composition within these grids. At inference, a dedicated Grid Fill module completes partially observed layouts, producing temporally coherent and identity preserving outputs. Once trained, the entire system operates in a single forward pass, generalizing to previously unseen dynamic concepts without any test-time optimization. Extensive experiments demonstrate high-quality and consistent results across a wide range of subjects beyond trained concepts and editing scenarios.

Topik & Kata Kunci

cs.GR cs.CV cs.LG

Penulis (8)

Rameen Abdal

Or Patashnik

Ekaterina Deyneka

Hao Chen

Aliaksandr Siarohin

Sergey Tulyakov

Daniel Cohen-Or

Kfir Aberman

Format Sitasi

APA MLA BibTeX

Abdal, R., Patashnik, O., Deyneka, E., Chen, H., Siarohin, A., Tulyakov, S. et al. (2025). Zero-Shot Dynamic Concept Personalization with Grid-Based LoRA. https://arxiv.org/abs/2507.17963

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓