arXiv Open Access 2023

Semantics-aware Motion Retargeting with Vision-Language Models

Haodong Zhang ZhiKe Chen Haocheng Xu Lei Hao Xiaofei Wu +4 lainnya
Lihat Sumber

Abstrak

Capturing and preserving motion semantics is essential to motion retargeting between animation characters. However, most of the previous works neglect the semantic information or rely on human-designed joint-level representations. Here, we present a novel Semantics-aware Motion reTargeting (SMT) method with the advantage of vision-language models to extract and maintain meaningful motion semantics. We utilize a differentiable module to render 3D motions. Then the high-level motion semantics are incorporated into the motion retargeting process by feeding the vision-language model with the rendered images and aligning the extracted semantic embeddings. To ensure the preservation of fine-grained motion details and high-level semantics, we adopt a two-stage pipeline consisting of skeleton-aware pre-training and fine-tuning with semantics and geometry constraints. Experimental results show the effectiveness of the proposed method in producing high-quality motion retargeting results while accurately preserving motion semantics.

Topik & Kata Kunci

Penulis (9)

H

Haodong Zhang

Z

ZhiKe Chen

H

Haocheng Xu

L

Lei Hao

X

Xiaofei Wu

S

Songcen Xu

Z

Zhensong Zhang

Y

Yue Wang

R

Rong Xiong

Format Sitasi

Zhang, H., Chen, Z., Xu, H., Hao, L., Wu, X., Xu, S. et al. (2023). Semantics-aware Motion Retargeting with Vision-Language Models. https://arxiv.org/abs/2312.01964

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓