arXiv Open Access 2025

Evaluating Interval-based Tokenization for Pitch Representation in Symbolic Music Analysis

Dinh-Viet-Toan Le Louis Bigo Mikaela Keller
Lihat Sumber

Abstrak

Symbolic music analysis tasks are often performed by models originally developed for Natural Language Processing, such as Transformers. Such models require the input data to be represented as sequences, which is achieved through a process of tokenization. Tokenization strategies for symbolic music often rely on absolute MIDI values to represent pitch information. However, music research largely promotes the benefit of higher-level representations such as melodic contour and harmonic relations for which pitch intervals turn out to be more expressive than absolute pitches. In this work, we introduce a general framework for building interval-based tokenizations. By evaluating these tokenizations on three music analysis tasks, we show that such interval-based tokenizations improve model performances and facilitate their explainability.

Topik & Kata Kunci

Penulis (3)

D

Dinh-Viet-Toan Le

L

Louis Bigo

M

Mikaela Keller

Format Sitasi

Le, D., Bigo, L., Keller, M. (2025). Evaluating Interval-based Tokenization for Pitch Representation in Symbolic Music Analysis. https://arxiv.org/abs/2501.04630

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓