no code implementations • 24 Sep 2022 • Jie Wang, Yuji Liu, Binling Wang, Yiming Zhi, Song Li, Shipeng Xia, Jiayang Zhang, Feng Tong, Lin Li, Qingyang Hong
This paper describes a spatial-aware speaker diarization system for the multi-channel multi-party meeting.
no code implementations • 11 Feb 2022 • Jie Wang, Yuji Liu, Binling Wang, Yiming Zhi, Song Li1, Shipeng Xia, Jiayang Zhang, Lin Li1, Qingyang Hong, Feng Tong
By performing DMSNet based OSD module, the DER of cluster-based diarization system decrease significantly form 13. 44% to 7. 63%.
no code implementations • 23 Jul 2021 • Binling Wang, Wenxuan Hu, Jing Li, Yiming Zhi, Zheng Li, Qingyang Hong, Lin Li, Dong Wang, Liming Song, Cheng Yang
In addition to the Language Identification (LID) tasks, multilingual Automatic Speech Recognition (ASR) tasks are introduced to OLR 2021 Challenge for the first time.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 5 Jul 2021 • Jing Li, Binling Wang, Yiming Zhi, Zheng Li, Lin Li, Qingyang Hong, Dong Wang
The fifth Oriental Language Recognition (OLR) Challenge focuses on language recognition in a variety of complex environments to promote its development.
no code implementations • 30 Jun 2021 • Dexin Liao, Jing Li, Yiming Zhi, Song Li, Qingyang Hong, Lin Li
For the SV system, we proposed a multi-task learning network, where phonetic branch is trained with the character label of the utterance, and speaker branch is trained with the label of the speaker.