Semantic Scholar Open Access 2016 226 sitasi

Learning Features of Music from Scratch

John Thickstun Z. Harchaoui S. Kakade

Abstrak

This paper introduces a new large-scale music dataset, MusicNet, to serve as a source of supervision and evaluation of machine learning methods for music research. MusicNet consists of hundreds of freely-licensed classical music recordings by 10 composers, written for 11 instruments, together with instrument/note annotations resulting in over 1 million temporal labels on 34 hours of chamber music performances under various studio and microphone conditions. The paper defines a multi-label classification task to predict notes in musical recordings, along with an evaluation protocol, and benchmarks several machine learning architectures for this task: i) learning from spectrogram features; ii) end-to-end learning with a neural net; iii) end-to-end learning with a convolutional neural net. These experiments show that end-to-end models trained for note prediction learn frequency selective filters as a low-level representation of audio.

Penulis (3)

J

John Thickstun

Z

Z. Harchaoui

S

S. Kakade

Format Sitasi

Thickstun, J., Harchaoui, Z., Kakade, S. (2016). Learning Features of Music from Scratch. https://www.semanticscholar.org/paper/f6154535699c65633243c482d2b97d4b66036633

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2016
Bahasa
en
Total Sitasi
226×
Sumber Database
Semantic Scholar
Akses
Open Access ✓