no code implementations • 27 Apr 2024 • Tiantian Feng, Xuan Shi, Rahul Gupta, Shrikanth S. Narayanan
Automatic Speech Understanding (ASU) aims at human-like speech interpretation, providing nuanced intent, emotion, sentiment, and content understanding from speech and language (text) content conveyed in speech.
no code implementations • 18 Dec 2022 • Tiantian Feng, Rajat Hebbar, Nicholas Mehlman, Xuan Shi, Aditya Kommineni, and Shrikanth Narayanan
Speech-centric machine learning systems have revolutionized many leading domains ranging from transportation and healthcare to education and defense, profoundly changing how people live, work, and interact with each other.
1 code implementation • 24 Jul 2021 • Xuan Shi, Erica Cooper, Junichi Yamagishi
Constructing an embedding space for musical instrument sounds that can meaningfully represent new and unseen instruments is important for downstream music generation tasks such as multi-instrument synthesis and timbre transfer.
no code implementations • 18 Apr 2019 • Xingjian Du, Xuan Shi, Risheng Huang
Region based object detectors achieve the state-of-the-art performance, but few consider to model the relation of proposals.
no code implementations • 2 Jan 2019 • Xingjian Du, Mengyao Zhu, Xuan Shi, Xinpeng Zhang, Wen Zhang, Jingdong Chen
The experiments comparing ourCSM based end-to-end model with other methods are conductedto confirm that the CSM accelerate the model training andhave significant improvements in speech quality.