arXiv Open Access 2024

Language Models for Music Medicine Generation

Emmanouil Nikolakakis Joann Ching Emmanouil Karystinaios Gabrielle Sipin Gerhard Widmer +1 lainnya

Lihat Sumber

Abstrak

Music therapy has been shown in recent years to provide multiple health benefits related to emotional wellness. In turn, maintaining a healthy emotional state has proven to be effective for patients undergoing treatment, such as Parkinson's patients or patients suffering from stress and anxiety. We propose fine-tuning MusicGen, a music-generating transformer model, to create short musical clips that assist patients in transitioning from negative to desired emotional states. Using low-rank decomposition fine-tuning on the MTG-Jamendo Dataset with emotion tags, we generate 30-second clips that adhere to the iso principle, guiding patients through intermediate states in the valence-arousal circumplex. The generated music is evaluated using a music emotion recognition model to ensure alignment with intended emotions. By concatenating these clips, we produce a 15-minute "music medicine" resembling a music therapy session. Our approach is the first model to leverage Language Models to generate music medicine. Ultimately, the output is intended to be used as a temporary relief between music therapy sessions with a board-certified therapist.

Topik & Kata Kunci

cs.SD eess.AS

Penulis (6)

Emmanouil Nikolakakis

Joann Ching

Emmanouil Karystinaios

Gabrielle Sipin

Gerhard Widmer

Razvan Marinescu

Format Sitasi

APA MLA BibTeX

Nikolakakis, E., Ching, J., Karystinaios, E., Sipin, G., Widmer, G., Marinescu, R. (2024). Language Models for Music Medicine Generation. https://arxiv.org/abs/2411.09080

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓