no code implementations • 9 Oct 2023 • Ying Shi, Dong Wang, Lantian Li, Jiqing Han
This paper investigates the possibility of extracting a target sentence from multi-talker speech using only a keyword as input.
no code implementations • 28 May 2023 • Ying Shi, Dong Wang, Lantian Li, Jiqing Han, Shi Yin
We propose a novel Mix Training (MT) strategy that encourages the model to discover low-energy keywords from noisy and mixed speech.
1 code implementation • 5 May 2023 • Jian Guan, Youde Liu, Qiaoxi Zhu, Tieran Zheng, Jiqing Han, Wenwu Wang
This paper presents Time-Weighted Frequency Domain Representation (TWFR) with the GMM method (TWFR-GMM) for anomalous sound detection.
no code implementations • 27 Feb 2023 • Dekai Sun, Yancheng He, Jiqing Han
For the difficulty of multimodal fusion, we use a K-layer multi-head attention mechanism as a downstream fusion module.
no code implementations • 8 Apr 2022 • Longshen Ou, Ziyi Guo, Emmanouil Benetos, Jiqing Han, Ye Wang
Most recent research about automatic music transcription (AMT) uses convolutional neural networks and recurrent neural networks to model the mapping from music signals to symbolic notation.
no code implementations • 4 Nov 2020 • Ying Shi, Haolin Chen, Zhiyuan Tang, Lantian Li, Dong Wang, Jiqing Han
Recently, speech enhancement (SE) based on deep speech prior has attracted much attention, such as the variational auto-encoder with non-negative matrix factorization (VAE-NMF) architecture.
1 code implementation • 6 Aug 2020 • Ziqiang Shi, Rujie Liu, Jiqing Han
We have open sourced our re-implementation of the DPRNN-TasNet here (https://github. com/ShiZiqiang/dual-path-RNNs-DPRNNs-based-speech-separation), and our TasTas is realized based on this implementation of DPRNN-TasNet, it is believed that the results in this paper can be reproduced with ease.
1 code implementation • 23 Jan 2020 • Ziqiang Shi, Rujie Liu, Jiqing Han
We have open-sourced our re-implementation of the DPRNN-TasNet in https://github. com/ShiZiqiang/dual-path-RNNs-DPRNNs-based-speech-separation, and our `La Furca' is realized based on this implementation of DPRNN-TasNet, it is believed that the results in this paper can be smoothly reproduced.
Sound Audio and Speech Processing
no code implementations • 17 Apr 2019 • Jiabin Xue, Jiqing Han, Tieran Zheng, Xiang Gao, Jiaxing Guo
On the one hand, we constrain the new parameters not to deviate too far from the original parameters and punish the new system when forgetting original knowledge.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 17 Apr 2019 • Jiabin Xue, Jiqing Han, Tieran Zheng, Jiaxing Guo, Boyong Wu
Thus, the parameters are more influenced by the training samples with a big propagation error than the samples with a small one.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
1 code implementation • 10 Apr 2019 • Hongwei Song, Jiqing Han, Shiwen Deng, Zhihao Du
In this paper, we propose a new strategy for acoustic scene classification (ASC) , namely recognizing acoustic scenes through identifying distinct sound events.
no code implementations • 12 Feb 2019 • Ziqiang Shi, Huibin Lin, Liu Liu, Rujie Liu, Jiqing Han, Anyan Shi
Deep dilated temporal convolutional networks (TCN) have been proved to be very effective in sequence modeling.
Sound Audio and Speech Processing