no code implementations • 22 May 2024 • Senmao Tian, Haoyu Gao, Gangyi Hong, Shuyun Wang, JingJie Wang, Xin Yu, Shunli Zhang
Minor variations in silhouette sequences can be diminished in the network's intermediate layers due to the accumulation of quantization errors.
no code implementations • 22 Sep 2023 • Haoyu Gao, Ting-En Lin, Hangyu Li, Min Yang, Yuchuan Wu, Wentao Ma, Yongbin Li
Task-oriented dialogue (TOD) systems facilitate users in executing various activities via multi-turn dialogues, but Large Language Models (LLMs) often struggle to comprehend these intricate contexts.
no code implementations • NeurIPS 2023 • Shuzheng Si, Wentao Ma, Haoyu Gao, Yuchuan Wu, Ting-En Lin, Yinpei Dai, Hangyu Li, Rui Yan, Fei Huang, Yongbin Li
SpokenWOZ further incorporates common spoken characteristics such as word-by-word processing and reasoning in spoken language.
1 code implementation • 19 May 2023 • Tianshu Yu, Haoyu Gao, Ting-En Lin, Min Yang, Yuchuan Wu, Wentao Ma, Chao Wang, Fei Huang, Yongbin Li
In this paper, we propose Speech-text dialog Pre-training for spoken dialog understanding with ExpliCiT cRoss-Modal Alignment (SPECTRA), which is the first-ever speech-text dialog pre-training model.
Ranked #2 on Multimodal Sentiment Analysis on MOSI
Emotion Recognition in Conversation Multimodal Intent Recognition +1
1 code implementation • 4 May 2023 • Haoyu Gao, Rui Wang, Ting-En Lin, Yuchuan Wu, Min Yang, Fei Huang, Yongbin Li
Dialogue Topic Segmentation (DTS) plays an essential role in a variety of dialogue modeling tasks.