arXiv Open Access 2024

Integrating Text-to-Music Models with Language Models: Composing Long Structured Music Pieces

Lilac Atassi

Lihat Sumber

Abstrak

Recent music generation methods based on transformers have a context window of up to a minute. The music generated by these methods is largely unstructured beyond the context window. With a longer context window, learning long-scale structures from musical data is a prohibitively challenging problem. This paper proposes integrating a text-to-music model with a large language model to generate music with form. The papers discusses the solutions to the challenges of such integration. The experimental results show that the proposed method can generate 2.5-minute-long music that is highly structured, strongly organized, and cohesive.

Topik & Kata Kunci

cs.SD cs.LG eess.AS

Penulis (1)

Lilac Atassi

Format Sitasi

APA MLA BibTeX

Atassi, L. (2024). Integrating Text-to-Music Models with Language Models: Composing Long Structured Music Pieces. https://arxiv.org/abs/2410.00344

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓