no code implementations • 23 Jan 2024 • Md Asif Jalal, Pablo Peso Parada, George Pavlidis, Vasileios Moschopoulos, Karthikeyan Saravanan, Chrysovalantis-Giorgos Kontoulis, Jisi Zhang, Anastasios Drosou, Gil Ho Lee, Jungin Lee, Seokyeong Jung
During training, a list of biasing phrases are selected from a large pool of phrases following a sampling strategy.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 22 Jan 2024 • Jisi Zhang, Vandana Rajan, Haaris Mehmood, David Tuckey, Pablo Peso Parada, Md Asif Jalal, Karthikeyan Saravanan, Gil Ho Lee, Jungin Lee, Seokyeong Jung
On-device Automatic Speech Recognition (ASR) models trained on speech data of a large population might underperform for individuals unseen during training.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 25 Jul 2023 • Md Asif Jalal, Pablo Peso Parada, Jisi Zhang, Karthikeyan Saravanan, Mete Ozay, Myoungji Han, Jung In Lee, Seokyeong Jung
Our paper proposes a privacy-enhancing framework that targets speaker identity anonymization while preserving speech recognition accuracy for our downstream task~-~Automatic Speech Recognition (ASR).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 3 May 2022 • Jisi Zhang, Catalin Zorila, Rama Doddipatla, Jon Barker
In this paper, we explore an improved framework to train a monoaural neural enhancement model for robust speech recognition.
no code implementations • 15 Jun 2021 • Jisi Zhang, Catalin Zorila, Rama Doddipatla, Jon Barker
The proposed method first uses mixtures of unseparated sources and the mixture invariant training (MixIT) criterion to train a teacher model.
no code implementations • 7 Feb 2021 • Jisi Zhang, Catalin Zorila, Rama Doddipatla, Jon Barker
In this paper, we present a novel multi-channel speech extraction system to simultaneously extract multiple clean individual sources from a mixture in noisy and reverberant environments.
no code implementations • 11 Nov 2020 • Jisi Zhang, Catalin Zorila, Rama Doddipatla, Jon Barker
To reduce the influence of reverberation on spatial feature extraction, a dereverberation pre-processing method has been applied to further improve the separation performance.