arXiv Open Access 2025

PersonaBooth: Personalized Text-to-Motion Generation

Boeun Kim Hea In Jeong JungHoon Sung Yihua Cheng Jeongmin Lee +6 lainnya

Lihat Sumber

Abstrak

This paper introduces Motion Personalization, a new task that generates personalized motions aligned with text descriptions using several basic motions containing Persona. To support this novel task, we introduce a new large-scale motion dataset called PerMo (PersonaMotion), which captures the unique personas of multiple actors. We also propose a multi-modal finetuning method of a pretrained motion diffusion model called PersonaBooth. PersonaBooth addresses two main challenges: i) A significant distribution gap between the persona-focused PerMo dataset and the pretraining datasets, which lack persona-specific data, and ii) the difficulty of capturing a consistent persona from the motions vary in content (action type). To tackle the dataset distribution gap, we introduce a persona token to accept new persona features and perform multi-modal adaptation for both text and visuals during finetuning. To capture a consistent persona, we incorporate a contrastive learning technique to enhance intra-cohesion among samples with the same persona. Furthermore, we introduce a context-aware fusion mechanism to maximize the integration of persona cues from multiple input motions. PersonaBooth outperforms state-of-the-art motion style transfer methods, establishing a new benchmark for motion personalization.

Topik & Kata Kunci

cs.CV

Penulis (11)

Boeun Kim

Hea In Jeong

JungHoon Sung

Yihua Cheng

Jeongmin Lee

Ju Yong Chang

Sang-Il Choi

Younggeun Choi

Saim Shin

Jungho Kim

Hyung Jin Chang

Format Sitasi

APA MLA BibTeX

Kim, B., Jeong, H.I., Sung, J., Cheng, Y., Lee, J., Chang, J.Y. et al. (2025). PersonaBooth: Personalized Text-to-Motion Generation. https://arxiv.org/abs/2503.07390

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓