MUSAN: A Music, Speech, and Noise Corpus

28 Oct 2015 · David Snyder, Guoguo Chen, Daniel Povey ·

This report introduces a new corpus of music, speech, and noise. This dataset is suitable for training models for voice activity detection (VAD) and music/speech discrimination. Our corpus is released under a flexible Creative Commons license. The dataset consists of music from several genres, speech from twelve languages, and a wide assortment of technical and non-technical noises. We demonstrate use of this corpus for music/speech discrimination on Broadcast news and VAD for speaker identification.

PDF Abstract

Code

Add Remove Mark official

Jasson-Chen/Add_noise_and_rir_to_sp…

JerryPeng21cuhk/prj_broadcast

Datasets

Introduced in the Paper:

MUSAN

Edit Social Preview

MUSAN: A Music, Speech, and Noise Corpus

Code Edit Add Remove Mark official

Categories

Datasets Edit

Code

Add Remove Mark official

Datasets