Semantic Scholar
Open Access
2015
1599 sitasi
MUSAN: A Music, Speech, and Noise Corpus
David Snyder
Guoguo Chen
Daniel Povey
Abstrak
This report introduces a new corpus of music, speech, and noise. This dataset is suitable for training models for voice activity detection (VAD) and music/speech discrimination. Our corpus is released under a flexible Creative Commons license. The dataset consists of music from several genres, speech from twelve languages, and a wide assortment of technical and non-technical noises. We demonstrate use of this corpus for music/speech discrimination on Broadcast news and VAD for speaker identification.
Topik & Kata Kunci
Penulis (3)
D
David Snyder
G
Guoguo Chen
D
Daniel Povey
Akses Cepat
PDF tidak tersedia langsung
Cek di sumber asli →Informasi Jurnal
- Tahun Terbit
- 2015
- Bahasa
- en
- Total Sitasi
- 1599×
- Sumber Database
- Semantic Scholar
- Akses
- Open Access ✓