arXiv Open Access 2024

Harmonizing Pixels and Melodies: Maestro-Guided Film Score Generation and Composition Style Transfer

F. Qi L. Ni C. Xu
Lihat Sumber

Abstrak

We introduce a film score generation framework to harmonize visual pixels and music melodies utilizing a latent diffusion model. Our framework processes film clips as input and generates music that aligns with a general theme while offering the capability to tailor outputs to a specific composition style. Our model directly produces music from video, utilizing a streamlined and efficient tuning mechanism on ControlNet. It also integrates a film encoder adept at understanding the film's semantic depth, emotional impact, and aesthetic appeal. Additionally, we introduce a novel, effective yet straightforward evaluation metric to evaluate the originality and recognizability of music within film scores. To fill this gap for film scores, we curate a comprehensive dataset of film videos and legendary original scores, injecting domain-specific knowledge into our data-driven generation model. Our model outperforms existing methodologies in creating film scores, capable of generating music that reflects the guidance of a maestro's style, thereby redefining the benchmark for automated film scores and laying a robust groundwork for future research in this domain. The code and generated samples are available at https://anonymous.4open.science/r/HPM.

Topik & Kata Kunci

Penulis (3)

F

F. Qi

L

L. Ni

C

C. Xu

Format Sitasi

Qi, F., Ni, L., Xu, C. (2024). Harmonizing Pixels and Melodies: Maestro-Guided Film Score Generation and Composition Style Transfer. https://arxiv.org/abs/2411.07539

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓