no code implementations • 14 May 2022 • Tanel Alumäe, Kunnar Kukk
For the unconstrained task, we relied on both externally available pretrained models as well as external data: the multilingual XLSR-53 wav2vec2. 0 model was finetuned on the VoxLingua107 corpus for the language recognition task, and finally finetuned on the provided target language training data, augmented with CommonVoice data.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
no code implementations • 31 Mar 2022 • Kunnar Kukk, Tanel Alumäe
Language identification from speech is a common preprocessing step in many spoken language processing systems.