Search Results for author: Wilfried Michel

Found 13 papers, 3 papers with code

The RWTH Aachen LVCSR system for IWSLT-2016 German Skype conversation recognition task

no code implementations • IWSLT 2016 • Wilfried Michel, Zoltán Tüske, M. Ali Basha Shaik, Ralf Schlüter, Hermann Ney

In this paper the RWTH large vocabulary continuous speech recognition (LVCSR) systems developed for the IWSLT-2016 evaluation campaign are described.

Language Modelling speech-recognition +1

Paper
Add Code

End-to-End Training of a Neural HMM with Label and Transition Probabilities

1 code implementation • 4 Oct 2023 • Daniel Mann, Tina Raissi, Wilfried Michel, Ralf Schlüter, Hermann Ney

We investigate recognition results and additionally Viterbi alignments of our models.

Paper
Code

Efficient Training of Neural Transducer for Speech Recognition

no code implementations • 22 Apr 2022 • Wei Zhou, Wilfried Michel, Ralf Schlüter, Hermann Ney

In this work, we propose an efficient 3-stage progressive training pipeline to build highly-performing neural transducer models from scratch with very limited computation resources in a reasonable short time period.

speech-recognition Speech Recognition

Paper
Add Code

Conformer-based Hybrid ASR System for Switchboard Dataset

no code implementations • 5 Nov 2021 • Mohammad Zeineldeen, Jingjing Xu, Christoph Lüscher, Wilfried Michel, Alexander Gerstenberger, Ralf Schlüter, Hermann Ney

The recently proposed conformer architecture has been successfully used for end-to-end automatic speech recognition (ASR) architectures achieving state-of-the-art performance on different datasets.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Automatic Learning of Subword Dependent Model Scales

no code implementations • 18 Oct 2021 • Felix Meyer, Wilfried Michel, Mohammad Zeineldeen, Ralf Schlüter, Hermann Ney

We show on the LibriSpeech (LBS) and Switchboard (SWB) corpora that the model scales for a combination of attentionbased encoder-decoder acoustic model and language model can be learned as effectively as with manual tuning.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Efficient Sequence Training of Attention Models using Approximative Recombination

no code implementations • 18 Oct 2021 • Nils-Philipp Wynands, Wilfried Michel, Jan Rosendahl, Ralf Schlüter, Hermann Ney

Lastly, it is shown that this technique can be used to effectively perform sequence discriminative training for attention-based encoder-decoder acoustic models on the LibriSpeech task.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Investigating Methods to Improve Language Model Integration for Attention-based Encoder-Decoder ASR Models

no code implementations • 12 Apr 2021 • Mohammad Zeineldeen, Aleksandr Glushko, Wilfried Michel, Albert Zeyer, Ralf Schlüter, Hermann Ney

Attention-based encoder-decoder (AED) models learn an implicit internal language model (ILM) from the training transcriptions.

Decoder Language Modelling

Paper
Add Code

On Architectures and Training for Raw Waveform Feature Extraction in ASR

no code implementations • 9 Apr 2021 • Peter Vieting, Christoph Lüscher, Wilfried Michel, Ralf Schlüter, Hermann Ney

With the success of neural network based modeling in automatic speech recognition (ASR), many studies investigated acoustic modeling and learning of feature extractors directly based on the raw waveform.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Librispeech Transducer Model with Internal Language Model Prior Correction

2 code implementations • 7 Apr 2021 • Albert Zeyer, André Merboldt, Wilfried Michel, Ralf Schlüter, Hermann Ney

We present our transducer model on Librispeech.

Ranked #25 on Speech Recognition on LibriSpeech test-clean (using extra training data)

Language Modelling Sentence +1

349

Paper
Code

Early Stage LM Integration Using Local and Global Log-Linear Combination

no code implementations • 20 May 2020 • Wilfried Michel, Ralf Schlüter, Hermann Ney

This is compared to a global renormalization scheme which is equivalent to applying shallow fusion in training.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

The RWTH ASR System for TED-LIUM Release 2: Improving Hybrid HMM with SpecAugment

no code implementations • 2 Apr 2020 • Wei Zhou, Wilfried Michel, Kazuki Irie, Markus Kitza, Ralf Schlüter, Hermann Ney

We present a complete training pipeline to build a state-of-the-art hybrid HMM-based ASR system on the 2nd release of the TED-LIUM corpus.

Data Augmentation

Paper
Add Code

Comparison of Lattice-Free and Lattice-Based Sequence Discriminative Training Criteria for LVCSR

no code implementations • 1 Jul 2019 • Wilfried Michel, Ralf Schlüter, Hermann Ney

This allows for a direct comparison of lattice-based and lattice-free sequence discriminative training criteria such as MMI and sMBR, both using the same language model during training.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data Augmentation

2 code implementations • 8 May 2019 • Christoph Lüscher, Eugen Beck, Kazuki Irie, Markus Kitza, Wilfried Michel, Albert Zeyer, Ralf Schlüter, Hermann Ney

To the best knowledge of the authors, the results obtained when training on the full LibriSpeech training set, are the best published currently, both for the hybrid DNN/HMM and the attention-based systems.

Ranked #24 on Speech Recognition on LibriSpeech test-other

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

349

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.