no code implementations • LREC 2022 • Per Kummervold, Freddy Wetjen, Javier de la Rosa
Norwegian has been one of many languages lacking sufficient available text to train quality language models.
no code implementations • 2 Feb 2024 • Per E Kummervold, Javier de la Rosa, Freddy Wetjen, Rolv-Arild Braaten, Per Erik Solberg
This article introduces NB-Whisper, an adaptation of OpenAI's Whisper, specifically fine-tuned for Norwegian language Automatic Speech Recognition (ASR).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 4 Jul 2023 • Javier de la Rosa, Rolv-Arild Braaten, Per Egil Kummervold, Freddy Wetjen, Svein Arne Brygfjeld
In this paper, we present several baselines for automatic speech recognition (ASR) models for the two official written languages in Norway: Bokm{\aa}l and Nynorsk.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
2 code implementations • NoDaLiDa 2021 • Per E Kummervold, Javier de la Rosa, Freddy Wetjen, Svein Arne Brygfjeld
In this work, we show the process of building a large-scale training set from digital and digitized collections at a national library.