Search Results for author: Xinyuan Zhou

Found 5 papers, 1 papers with code

The USTC-NELSLIP Offline Speech Translation Systems for IWSLT 2022

no code implementations • IWSLT (ACL) 2022 • Weitai Zhang, Zhongyi Ye, Haitao Tang, Xiaoxi Li, Xinyuan Zhou, Jing Yang, Jianwei Cui, Dan Liu, Junhua Liu, LiRong Dai

This paper describes USTC-NELSLIP’s submissions to the IWSLT 2022 Offline Speech Translation task, including speech translation of talks from English to German, English to Chinese and English to Japanese.

Translation

Paper
Add Code

DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation

no code implementations • 26 Oct 2023 • Yongxin Zhu, Zhujin Gao, Xinyuan Zhou, Zhongyi Ye, Linli Xu

While Diffusion Generative Models have achieved great success on image generation tasks, how to efficiently and effectively incorporate them into speech generation especially translation tasks remains a non-trivial problem.

Image Generation Speech-to-Speech Translation +1

Paper
Add Code

Data-Centric Financial Large Language Models

no code implementations • 7 Oct 2023 • Zhixuan Chu, Huaiyu Guo, Xinyuan Zhou, Yijia Wang, Fei Yu, Hong Chen, Wanqing Xu, Xin Lu, Qing Cui, Longfei Li, Jun Zhou, Sheng Li

Large language models (LLMs) show promise for natural language tasks but struggle when applied directly to complex domains like finance.

Paper
Add Code

CNN-based Discriminative Training for Domain Compensation in Acoustic Event Detection with Frame-wise Classifier

no code implementations • 26 Mar 2021 • Tiantian Tang, Xinyuan Zhou, Yanhua Long, Yijie Li, Jiaen Liang

Domain mismatch is a noteworthy issue in acoustic event detection tasks, as the target domain data is difficult to access in most real applications.

Event Detection

Paper
Add Code

Multi-channel target speech extraction with channel decorrelation and target speaker adaptation

1 code implementation • 19 Oct 2020 • Jiangyu Han, Xinyuan Zhou, Yanhua Long, Yijie Li

In this work, we propose two methods for exploiting the multi-channel spatial information to extract the target speech.

Speech Extraction Audio and Speech Processing

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.