no code implementations • LREC 2022 • Sebastian Bayerl, Alexander Wolff von Gudenberg, Florian Hönig, Elmar Noeth, Korbinian Riedhammer
To be able to monitor speech behavior over a long time, the ability to detect stuttering events and modifications in speech could help PWSs and speech pathologists to track the level of fluency.
no code implementations • LREC 2022 • Aniruddha Tammewar, Franziska Braun, Gabriel Roccabruna, Sebastian Bayerl, Korbinian Riedhammer, Giuseppe Riccardi
In this work, we annotate a corpus of spoken personal narratives, with the emotion valence using discrete values.
no code implementations • 23 Feb 2024 • Ismael Agchar, Ilja Baumann, Franziska Braun, Paula Andrea Perez-Toro, Korbinian Riedhammer, Sebastian Trump, Martin Ullrich
In recent years, machine learning, and in particular generative adversarial neural networks (GANs) and attention-based neural networks (transformers), have been successfully used to compose and generate music, both melodies and polyphonic pieces.
no code implementations • 14 Feb 2024 • Philipp Seeberger, Korbinian Riedhammer
Automatic summarization of mass-emergency events plays a critical role in disaster management.
no code implementations • 18 Dec 2023 • Philipp Seeberger, Tobias Bocklet, Korbinian Riedhammer
User-generated information content has become an important information source in crisis situations.
no code implementations • 16 Aug 2023 • Franziska Braun, Sebastian P. Bayerl, Paula A. Pérez-Toro, Florian Hönig, Hartmut Lehfeld, Thomas Hillemacher, Elmar Nöth, Tobias Bocklet, Korbinian Riedhammer
Automated dementia screening enables early detection and intervention, reducing costs to healthcare systems and increasing quality of life for those affected.
no code implementations • 30 May 2023 • Sebastian P. Bayerl, Dominik Wagner, Ilja Baumann, Florian Hönig, Tobias Bocklet, Elmar Nöth, Korbinian Riedhammer
Most stuttering detection and classification research has viewed stuttering as a multi-class classification problem or a binary detection task for each dysfluency type; however, this does not match the nature of stuttering, in which one dysfluency seldom comes alone but rather co-occurs with others.
no code implementations • 2 Feb 2023 • Philipp Seeberger, Korbinian Riedhammer
The CrisisFACTS Track aims to tackle challenges such as multi-stream fact-finding in the domain of event tracking; participants' systems extract important facts from several disaster-related events while incorporating the temporal order.
1 code implementation • 21 Nov 2022 • Philipp Seeberger, Korbinian Riedhammer
Social media has become an important information source for crisis management and provides quick access to ongoing developments and critical information.
Hierarchical Multi-label Classification Language Modelling +3
no code implementations • 28 Oct 2022 • Sebastian P. Bayerl, Dominik Wagner, Florian Hönig, Tobias Bocklet, Elmar Nöth, Korbinian Riedhammer
This work explores an approach based on a modified wav2vec 2. 0 system for end-to-end stuttering detection and classification as a multi-label problem.
no code implementations • 28 Oct 2022 • Ilja Baumann, Dominik Wagner, Franziska Braun, Sebastian P. Bayerl, Elmar Nöth, Korbinian Riedhammer, Tobias Bocklet
Recent findings show that pre-trained wav2vec 2. 0 models are reliable feature extractors for various speaker characteristics classification tasks.
no code implementations • 27 Oct 2022 • Dominik Wagner, Ilja Baumann, Franziska Braun, Sebastian P. Bayerl, Elmar Nöth, Korbinian Riedhammer, Tobias Bocklet
The detection of pathologies from speech features is usually defined as a binary classification task with one class representing a specific pathology and the other class representing healthy speech.
no code implementations • 17 Jun 2022 • Sebastian P. Bayerl, Gabriel Roccabruna, Shammur Absar Chowdhury, Tommaso Ciulli, Morena Danieli, Korbinian Riedhammer, Giuseppe Riccardi
To the best of our knowledge, this is the first and a novel study to exploit speech and language for characterising working alliance.
no code implementations • 13 Jun 2022 • Arlo Faria, Adam Janin, Korbinian Riedhammer, Sidhi Adkoli
While commercial ASR systems are still below this threshold, a research system is shown to clearly surpass the accuracy of commercial human speech recognition.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 13 Jun 2022 • Franziska Braun, Markus Förstel, Bastian Oppermann, Andreas Erzigkeit, Thomas Hillemacher, Hartmut Lehfeld, Korbinian Riedhammer
For both SKT and CERAD-NB, we observe high to perfect correlations using manual transcripts; for certain tasks with lower correlation, the automatic scoring is stricter than the human reference since it is limited to the audio.
no code implementations • 10 Jun 2022 • Franziska Braun, Andreas Erzigkeit, Hartmut Lehfeld, Thomas Hillemacher, Korbinian Riedhammer, Sebastian P. Bayerl
Standardized tests play a crucial role in the detection of cognitive impairment.
1 code implementation • 7 Jun 2022 • Sebastian P. Bayerl, Dominik Wagner, Elmar Nöth, Tobias Bocklet, Korbinian Riedhammer
This paper empirically investigates the influence of different data splits and splitting strategies on the performance of dysfluency detection systems.
no code implementations • 13 May 2022 • Björn W. Schuller, Anton Batliner, Shahin Amiriparian, Christian Bergler, Maurice Gerczuk, Natalie Holz, Pauline Larrouy-Maestri, Sebastian P. Bayerl, Korbinian Riedhammer, Adria Mallol-Ragolta, Maria Pateraki, Harry Coppock, Ivan Kiskin, Marianne Sinka, Stephen Roberts
The ACM Multimedia 2022 Computational Paralinguistics Challenge addresses four different problems for the first time in a research competition under well-defined conditions: In the Vocalisations and Stuttering Sub-Challenges, a classification on human non-verbal vocalisations and speech has to be made; the Activity Sub-Challenge aims at beyond-audio human activity recognition from smartwatch sensor data; and in the Mosquitoes Sub-Challenge, mosquitoes need to be detected.
no code implementations • 7 Apr 2022 • Sebastian P. Bayerl, Dominik Wagner, Elmar Nöth, Korbinian Riedhammer
This paper shows that fine-tuning wav2vec 2. 0 [1] for the classification of stuttering on a sizeable English corpus containing stuttered speech, in conjunction with multi-task learning, boosts the effectiveness of the general-purpose wav2vec 2. 0 features for detecting stuttering in speech; both within and across languages.
no code implementations • 7 Apr 2022 • Sebastian P. Bayerl, Dominik Wagner, Ilja Baumann, Korbinian Riedhammer, Tobias Bocklet
Vocal fatigue refers to the feeling of tiredness and weakness of voice due to extended utilization.
no code implementations • 10 Mar 2022 • Sebastian P. Bayerl, Alexander Wolff von Gudenberg, Florian Hönig, Elmar Nöth, Korbinian Riedhammer
To be able to monitor speech behavior over a long time, the ability to detect stuttering events and modifications in speech could help PWSs and speech pathologists to track the level of fluency.
no code implementations • 13 Dec 2021 • Sebastian P. Bayerl, Aniruddha Tammewar, Korbinian Riedhammer, Giuseppe Riccardi
However, in this work, we focus on Emotion Carriers (EC) defined as the segments (speech or text) that best explain the emotional state of the narrator ("loss of father", "made me choose").
no code implementations • 15 Jun 2021 • Sebastian P. Bayerl, Marc Wenninger, Jochen Schmidt, Alexander Wolff von Gudenberg, Korbinian Riedhammer
Stuttering is a complex speech disorder identified by repeti-tions, prolongations of sounds, syllables or words and blockswhile speaking.
no code implementations • 16 Jun 2020 • Sebastian P. Bayerl, Florian Hönig, Joelle Reister, Korbinian Riedhammer
This paper introduces the Speech Control Index (SCI), a new method to evaluate the severity of stuttering.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 19 Sep 2019 • Sebastian P. Bayerl, Korbinian Riedhammer
The best word error rate (WER) regarding syllables was achieved using kaldi with a 4-gram LM, modeling all syllables observed in the training set.
1 code implementation • 19 Sep 2019 • Marc Wenninger, Sebastian P. Bayerl, Jochen Schmidt, Korbinian Riedhammer
Time series are series of values ordered by time.
no code implementations • LREC 2014 • Tobias Bocklet, Andreas Maier, Korbinian Riedhammer, Ulrich Eysholdt, Elmar N{\"o}th
In this paper we describe Erlangen-CLP, a large speech database of children with Cleft Lip and Palate.