no code implementations • ICCV 2023 • Tao Tu, Shun-Po Chuang, Yu-Lun Liu, Cheng Sun, Ke Zhang, Donna Roy, Cheng-Hao Kuo, Min Sun
The results demonstrate that ImGeoNet outperforms the current state-of-the-art multi-view image-based method, ImVoxelNet, on all three datasets in terms of detection accuracy.
Ranked #24 on 3D Object Detection on ScanNetV2
1 code implementation • 30 Nov 2022 • Dongji Gao, Jiatong Shi, Shun-Po Chuang, Leibny Paola Garcia, Hung-Yi Lee, Shinji Watanabe, Sanjeev Khudanpur
This paper describes the ESPnet Unsupervised ASR Open-source Toolkit (EURO), an end-to-end open-source toolkit for unsupervised automatic speech recognition (UASR).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 29 Jul 2022 • Da-Rong Liu, Po-chun Hsu, Yi-Chen Chen, Sung-Feng Huang, Shun-Po Chuang, Da-Yi Wu, Hung-Yi Lee
GAN training is adopted in the first stage to find the mapping relationship between unpaired speech and phone sequence.
1 code implementation • IWSLT (ACL) 2022 • Chih-Chiang Chang, Shun-Po Chuang, Hung-Yi Lee
Existing methods increase latency or introduce adaptive read-write policies for SimulMT models to handle local reordering and improve translation quality.
1 code implementation • Findings (ACL) 2021 • Shun-Po Chuang, Yung-Sung Chuang, Chih-Chiang Chang, Hung-Yi Lee
We study the possibilities of building a non-autoregressive speech-to-text translation model using connectionist temporal classification (CTC), and use CTC-based automatic speech recognition as an auxiliary task to improve the performance.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 6 Apr 2021 • Shun-Po Chuang, Heng-Jui Chang, Sung-Feng Huang, Hung-Yi Lee
Mandarin-English code-switching (CS) is frequently used among East and Southeast Asian people.
1 code implementation • 29 Oct 2020 • Sung-Feng Huang, Shun-Po Chuang, Da-Rong Liu, Yi-Chen Chen, Gene-Ping Yang, Hung-Yi Lee
Speech separation has been well developed, with the very successful permutation invariant training (PIT) approach, although the frequent label assignment switching happening during PIT training remains to be a problem when better convergence speed and achievable performance are desired.
Ranked #6 on Speech Separation on Libri2Mix (using extra training data)
no code implementations • ACL 2020 • Shun-Po Chuang, Tzu-Wei Sung, Alexander H. Liu, Hung-Yi Lee
Speech translation (ST) aims to learn transformations from speech in the source language to the text in the target language.
no code implementations • 14 Nov 2019 • Shun-Po Chuang, Tzu-Wei Sung, Hung-Yi Lee
A lack of code-switching data complicates the training of code-switching (CS) language models.
1 code implementation • 28 Oct 2019 • Alexander H. Liu, Tzu-Wei Sung, Shun-Po Chuang, Hung-Yi Lee, Lin-shan Lee
This allows the decoder to consider the semantic consistency during decoding by absorbing the information carried by the transformed decoder feature, which is learned to be close to the target word embedding.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
2 code implementations • 6 Nov 2018 • Ching-Ting Chang, Shun-Po Chuang, Hung-Yi Lee
To mitigate the issue without expensive human annotation, we proposed an unsupervised method for code-switching data augmentation.
no code implementations • 13 Aug 2018 • Chia-Hung Wan, Shun-Po Chuang, Hung-Yi Lee
Humans can imagine a scene from a sound.
no code implementations • 26 Nov 2016 • Da-Rong Liu, Shun-Po Chuang, Hung-Yi Lee
Recurrent neural networks (RNNs) have achieved great success in language modeling.