no code implementations • 28 Jun 2022 • Amber Afshan, Abeer Alwan
However, self-attentive embeddings perform weighted pooling such that the weights correspond to the importance of the frames in a speaker classification task.
no code implementations • 28 Jun 2022 • Amber Afshan, Abeer Alwan
Using the SITW evaluation tasks, which involve different conversational speech tasks, the proposed loss combined with self-attention conditioning results in significant relative improvements in EER by 2-5% and minDCF by 6-12% over baseline.
no code implementations • 30 Jun 2021 • Amber Afshan, Kshitiz Kumar, Jian Wu
We propose a cost-effective method of using CC scores to select an optimal adaptation data set, where we maximize ASR gains from minimal data.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 12 Feb 2021 • Ruchao Fan, Amber Afshan, Abeer Alwan
We present a bidirectional unsupervised model pre-training (UPT) method and apply it to children's automatic speech recognition (ASR).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 8 Aug 2020 • Amber Afshan, Jinxi Guo, Soo Jin Park, Vijay Ravi, Alan McCree, Abeer Alwan
For instance, when enrolled with conversation utterances, the EER increased to 3. 03%, 2. 96% and 22. 12% when tested on read, narrative, and pet-directed speech, respectively.
no code implementations • 8 Aug 2020 • Amber Afshan, Jody Kreiman, Abeer Alwan
Native listeners performed better than machines in the style-matched conditions (EERs of 6. 96% versus 14. 35% for read speech, and 15. 12% versus 19. 87%, for conversations), but for style-mismatched conditions, there was no significant difference between native listeners and machines.
no code implementations • 8 Aug 2020 • Vijay Ravi, Ruchao Fan, Amber Afshan, Huanhua Lu, Abeer Alwan
A fusion of the x-vector/PLDA baseline and the SID/PLDA scores prior to PID fusion further improved performance by 15% indicating complementarity of the proposed approach to the x-vector system.