Search Results for author: Hsin-Wei Wang

Found 9 papers, 0 papers with code

Exploring the Integration of E2E ASR and Pronunciation Modeling for English Mispronunciation Detection

no code implementations • ROCLING 2021 • Hsin-Wei Wang, Bi-Cheng Yan, Yung-Chang Hsu, Berlin Chen

In the first stage, the speech uttered by an L2 learner is processed by an end-to-end ASR module to produce N-best phone sequence hypotheses.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Building an Enhanced Autoregressive Document Retriever Leveraging Supervised Contrastive Learning

no code implementations • ROCLING 2022 • Yi-Cheng Wang, Tzu-Ting Yang, Hsin-Wei Wang, Yung-Chang Hsu, Berlin Chen

DSI dramatically simplifies the whole retrieval process by encoding all information about the document collection into the parameter space of a single Transformer model, on top of which DSI can in turn generate the relevant document identities (IDs) in an autoregressive manner in response to a user query.

Contrastive Learning Information Retrieval +1

Paper
Add Code

DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition

no code implementations • 26 Mar 2024 • Yi-Cheng Wang, Hsin-Wei Wang, Bi-Cheng Yan, Chi-Han Lin, Berlin Chen

End-to-end automatic speech recognition (E2E ASR) systems often suffer from mistranscription of domain-specific phrases, such as named entities, sometimes leading to catastrophic failures in downstream tasks.

Automatic Speech Recognition Language Modelling +2

Paper
Add Code

An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement

no code implementations • 27 Feb 2024 • Tzu-Ting Yang, Hsin-Wei Wang, Yi-Cheng Wang, Chi-Han Lin, Berlin Chen

With the massive developments of end-to-end (E2E) neural networks, recent years have witnessed unprecedented breakthroughs in automatic speech recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition

no code implementations • 15 Dec 2023 • Tzu-Ting Yang, Hsin-Wei Wang, Berlin Chen

In recent years, end-to-end speech recognition has emerged as a technology that integrates the acoustic, pronunciation dictionary, and language model components of the traditional Automatic Speech Recognition model.

Automatic Speech Recognition Language Identification +3

Paper
Add Code

Preserving Phonemic Distinctions for Ordinal Regression: A Novel Loss Function for Automatic Pronunciation Assessment

no code implementations • 3 Oct 2023 • Bi-Cheng Yan, Hsin-Wei Wang, Yi-Cheng Wang, Jiun-Ting Li, Chi-Han Lin, Berlin Chen

Automatic pronunciation assessment (APA) manages to quantify the pronunciation proficiency of a second language (L2) learner in a language.

regression

Paper
Add Code

AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning

no code implementations • 4 Sep 2023 • Yi-Cheng Wang, Tzu-Ting Yang, Hsin-Wei Wang, Bi-Cheng Yan, Berlin Chen

Voice, as input, has progressively become popular on mobiles and seems to transcend almost entirely text input.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Effective Cross-Utterance Language Modeling for Conversational Speech Recognition

no code implementations • 5 Nov 2021 • Bi-Cheng Yan, Hsin-Wei Wang, Shih-Hsuan Chiu, Hsuan-Sheng Chiu, Berlin Chen

Conversational speech normally is embodied with loose syntactic structures at the utterance level but simultaneously exhibits topical coherence relations across consecutive utterances.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Exploring Non-Autoregressive End-To-End Neural Modeling For English Mispronunciation Detection And Diagnosis

no code implementations • 1 Nov 2021 • Hsin-Wei Wang, Bi-Cheng Yan, Hsuan-Sheng Chiu, Yung-Chang Hsu, Berlin Chen

In addition, we design and develop a pronunciation modeling network stacked on top of the NAR E2E models of our method to further boost the effectiveness of MD&D.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.