Search Results for author: Qianqian Dong

Found 17 papers, 11 papers with code

Speech Translation with Large Language Models: An Industrial Practice

no code implementations • 21 Dec 2023 • Zhichao Huang, Rong Ye, Tom Ko, Qianqian Dong, Shanbo Cheng, Mingxuan Wang, Hang Li

Given the great success of large language models (LLMs) across various tasks, in this paper, we introduce LLM-ST, a novel and effective speech translation model constructed upon a pre-trained LLM.

Language Modelling Large Language Model +1

Paper
Add Code

Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition

1 code implementation • 21 Sep 2023 • Chen Xu, Xiaoqian Liu, Erfeng He, Yuhao Zhang, Qianqian Dong, Tong Xiao, Jingbo Zhu, Dapeng Man, Wu Yang

In this study, we present synchronous bilingual Connectionist Temporal Classification (CTC), an innovative framework that leverages dual CTC to bridge the gaps of both modality and language in the speech translation (ST) task.

speech-recognition Speech Recognition +1

Paper
Code

Recent Advances in Direct Speech-to-text Translation

no code implementations • 20 Jun 2023 • Chen Xu, Rong Ye, Qianqian Dong, Chengqi Zhao, Tom Ko, Mingxuan Wang, Tong Xiao, Jingbo Zhu

Recently, speech-to-text translation has attracted more and more attention and many studies have emerged rapidly.

Data Augmentation Decoder +3

Paper
Add Code

MOSPC: MOS Prediction Based on Pairwise Comparison

no code implementations • 18 Jun 2023 • Kexin Wang, Yunlong Zhao, Qianqian Dong, Tom Ko, Mingxuan Wang

And our framework also surpasses the strong baseline in ranking accuracy on each fine-grained segment.

Paper
Add Code

PolyVoice: Language Models for Speech to Speech Translation

no code implementations • 5 Jun 2023 • Qianqian Dong, Zhiying Huang, Qiao Tian, Chen Xu, Tom Ko, Yunlong Zhao, Siyuan Feng, Tang Li, Kexin Wang, Xuxin Cheng, Fengpeng Yue, Ye Bai, Xi Chen, Lu Lu, Zejun Ma, Yuping Wang, Mingxuan Wang, Yuxuan Wang

For the speech synthesis part, we adopt the existing VALL-E X approach and build a unit-based audio language model.

Language Modelling Speech Synthesis +2

Paper
Add Code

CTC-based Non-autoregressive Speech Translation

1 code implementation • 27 May 2023 • Chen Xu, Xiaoqian Liu, Xiaowen Liu, Qingxuan Sun, Yuhao Zhang, Murun Yang, Qianqian Dong, Tom Ko, Mingxuan Wang, Tong Xiao, Anxiang Ma, Jingbo Zhu

Combining end-to-end speech translation (ST) and non-autoregressive (NAR) generation is promising in language and speech processing for their advantages of less error propagation and low latency.

Translation

Paper
Code

M3ST: Mix at Three Levels for Speech Translation

no code implementations • 7 Dec 2022 • Xuxin Cheng, Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Yuexian Zou

How to solve the data scarcity problem for end-to-end speech-to-text translation (ST)?

Data Augmentation Machine Translation +3

Paper
Add Code

Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation

1 code implementation • 18 May 2022 • Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Qibing Bai, Yu Zhang

Direct Speech-to-speech translation (S2ST) has drawn more and more attention recently.

Speech-to-Speech Translation Translation

Paper
Code

Learning When to Translate for Streaming Speech

1 code implementation • ACL 2022 • Qianqian Dong, Yaoming Zhu, Mingxuan Wang, Lei LI

Given a usually long speech sequence, we develop an efficient monotonic segmentation module inside an encoder-decoder model to accumulate acoustic information incrementally and detect proper speech unit boundaries for the input in speech translation task.

Decoder Sentence +2

Paper
Code

The Volctrans Neural Speech Translation System for IWSLT 2021

1 code implementation • ACL (IWSLT) 2021 • Chengqi Zhao, Zhicheng Liu, Jian Tong, Tao Wang, Mingxuan Wang, Rong Ye, Qianqian Dong, Jun Cao, Lei LI

For offline speech translation, our best end-to-end model achieves 8. 1 BLEU improvements over the benchmark on the MuST-C test set and is even approaching the results of a strong cascade solution.

Translation

293

Paper
Code

NeurST: Neural Speech Translation Toolkit

1 code implementation • ACL 2021 • Chengqi Zhao, Mingxuan Wang, Qianqian Dong, Rong Ye, Lei LI

NeurST is an open-source toolkit for neural speech translation.

Ranked #1 on Speech-to-Text Translation on libri-trans

Speech-to-Text Translation Translation

293

Paper
Code

Consecutive Decoding for Speech-to-text Translation

1 code implementation • 21 Sep 2020 • Qianqian Dong, Mingxuan Wang, Hao Zhou, Shuang Xu, Bo Xu, Lei LI

The key idea is to generate source transcript and target translation text with a single decoder.

Decoder Machine Translation +4

Paper
Code

"Listen, Understand and Translate": Triple Supervision Decouples End-to-end Speech-to-text Translation

1 code implementation • 21 Sep 2020 • Qianqian Dong, Rong Ye, Mingxuan Wang, Hao Zhou, Shuang Xu, Bo Xu, Lei LI

Can we build a system to fully utilize signals in a parallel ST corpus?

Speech-to-Text Translation Translation

Paper
Code

CLUE: A Chinese Language Understanding Evaluation Benchmark

3 code implementations • COLING 2020 • Liang Xu, Hai Hu, Xuanwei Zhang, Lu Li, Chenjie Cao, Yudong Li, Yechen Xu, Kai Sun, Dian Yu, Cong Yu, Yin Tian, Qianqian Dong, Weitang Liu, Bo Shi, Yiming Cui, Junyi Li, Jun Zeng, Rongzhao Wang, Weijian Xie, Yanting Li, Yina Patterson, Zuoyu Tian, Yiwen Zhang, He Zhou, Shaoweihua Liu, Zhe Zhao, Qipeng Zhao, Cong Yue, Xinrui Zhang, Zhengliang Yang, Kyle Richardson, Zhenzhong Lan

The advent of natural language understanding (NLU) benchmarks for English, such as GLUE and SuperGLUE allows new NLU models to be evaluated across a diverse set of tasks.

General Classification Machine Reading Comprehension +4

3,840

Paper
Code

CLUECorpus2020: A Large-scale Chinese Corpus for Pre-training Language Model

2 code implementations • 3 Mar 2020 • Liang Xu, Xuanwei Zhang, Qianqian Dong

In this paper, we introduce the Chinese corpus from CLUE organization, CLUECorpus2020, a large-scale corpus that can be used directly for self-supervised learning such as pre-training of a language model, or language generation.

8k Language Modelling +2

886

Paper
Code

CLUENER2020: Fine-grained Named Entity Recognition Dataset and Benchmark for Chinese

3 code implementations • 13 Jan 2020 • Liang Xu, Yu tong, Qianqian Dong, Yixuan Liao, Cong Yu, Yin Tian, Weitang Liu, Lu Li, Caiquan Liu, Xuanwei Zhang

In this paper, we introduce the NER dataset from CLUE organization (CLUENER2020), a well-defined fine-grained dataset for named entity recognition in Chinese.

Chinese Named Entity Recognition named-entity-recognition +2

1,401

Paper
Code

Semi-Supervised Disfluency Detection

no code implementations • COLING 2018 • Feng Wang, Wei Chen, Zhen Yang, Qianqian Dong, Shuang Xu, Bo Xu

While the disfluency detection has achieved notable success in the past years, it still severely suffers from the data scarcity.

Generative Adversarial Network Machine Translation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.