Semantic Scholar Open Access 2015 1599 sitasi

MUSAN: A Music, Speech, and Noise Corpus

David Snyder Guoguo Chen Daniel Povey

Lihat Sumber

Abstrak

This report introduces a new corpus of music, speech, and noise. This dataset is suitable for training models for voice activity detection (VAD) and music/speech discrimination. Our corpus is released under a flexible Creative Commons license. The dataset consists of music from several genres, speech from twelve languages, and a wide assortment of technical and non-technical noises. We demonstrate use of this corpus for music/speech discrimination on Broadcast news and VAD for speaker identification.

Topik & Kata Kunci

Computer Science

Penulis (3)

David Snyder

Guoguo Chen

Daniel Povey

Format Sitasi

APA MLA BibTeX

Snyder, D., Chen, G., Povey, D. (2015). MUSAN: A Music, Speech, and Noise Corpus. https://www.semanticscholar.org/paper/32d21dc13f8770958b196a96f99a6f3959c7dc0f

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2015
Bahasa: en
Total Sitasi: 1599×
Sumber Database: Semantic Scholar
Akses: Open Access ✓