no code implementations • 3 Apr 2021 • Lujun Li, Yikai Kang, Yuchen Shi, Ludwig Kürzinger, Tobias Watzel, Gerhard Rigoll
Inspired by the extensive applications of the generative adversarial networks (GANs) in speech enhancement and ASR tasks, we propose an adversarial joint training framework with the self-attention mechanism to boost the noise robustness of the ASR system.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
11 code implementations • 17 Jul 2020 • Ludwig Kürzinger, Dominik Winkelbauer, Lujun Li, Tobias Watzel, Gerhard Rigoll
In this work, we combine freely available corpora for German speech recognition, including yet unlabeled speech data, to a big dataset of over $1700$h of speech data.
Ranked #5 on Speech Recognition on TUDA (using extra training data)
Speech Recognition Audio and Speech Processing
no code implementations • 15 Jun 2020 • Tobias Watzel, Ludwig Kürzinger, Lujun Li, Gerhard Rigoll
Nowadays, attention models are one of the popular candidates for speech recognition.