Semantic Scholar Open Access 2015 1599 sitasi

MUSAN: A Music, Speech, and Noise Corpus

David Snyder Guoguo Chen Daniel Povey

Abstrak

This report introduces a new corpus of music, speech, and noise. This dataset is suitable for training models for voice activity detection (VAD) and music/speech discrimination. Our corpus is released under a flexible Creative Commons license. The dataset consists of music from several genres, speech from twelve languages, and a wide assortment of technical and non-technical noises. We demonstrate use of this corpus for music/speech discrimination on Broadcast news and VAD for speaker identification.

Topik & Kata Kunci

Penulis (3)

D

David Snyder

G

Guoguo Chen

D

Daniel Povey

Format Sitasi

Snyder, D., Chen, G., Povey, D. (2015). MUSAN: A Music, Speech, and Noise Corpus. https://www.semanticscholar.org/paper/32d21dc13f8770958b196a96f99a6f3959c7dc0f

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2015
Bahasa
en
Total Sitasi
1599×
Sumber Database
Semantic Scholar
Akses
Open Access ✓