arXiv Open Access 2025

PersonaBooth: Personalized Text-to-Motion Generation

Boeun Kim Hea In Jeong JungHoon Sung Yihua Cheng Jeongmin Lee +6 lainnya
Lihat Sumber

Abstrak

This paper introduces Motion Personalization, a new task that generates personalized motions aligned with text descriptions using several basic motions containing Persona. To support this novel task, we introduce a new large-scale motion dataset called PerMo (PersonaMotion), which captures the unique personas of multiple actors. We also propose a multi-modal finetuning method of a pretrained motion diffusion model called PersonaBooth. PersonaBooth addresses two main challenges: i) A significant distribution gap between the persona-focused PerMo dataset and the pretraining datasets, which lack persona-specific data, and ii) the difficulty of capturing a consistent persona from the motions vary in content (action type). To tackle the dataset distribution gap, we introduce a persona token to accept new persona features and perform multi-modal adaptation for both text and visuals during finetuning. To capture a consistent persona, we incorporate a contrastive learning technique to enhance intra-cohesion among samples with the same persona. Furthermore, we introduce a context-aware fusion mechanism to maximize the integration of persona cues from multiple input motions. PersonaBooth outperforms state-of-the-art motion style transfer methods, establishing a new benchmark for motion personalization.

Topik & Kata Kunci

Penulis (11)

B

Boeun Kim

H

Hea In Jeong

J

JungHoon Sung

Y

Yihua Cheng

J

Jeongmin Lee

J

Ju Yong Chang

S

Sang-Il Choi

Y

Younggeun Choi

S

Saim Shin

J

Jungho Kim

H

Hyung Jin Chang

Format Sitasi

Kim, B., Jeong, H.I., Sung, J., Cheng, Y., Lee, J., Chang, J.Y. et al. (2025). PersonaBooth: Personalized Text-to-Motion Generation. https://arxiv.org/abs/2503.07390

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓