Search Results for author: Dominique Fohr

Found 25 papers, 3 papers with code

Label Propagation-Based Semi-Supervised Learning for Hate Speech Classification

no code implementations • EMNLP (insights) 2020 • Ashwin Geet D’Sa, Irina Illina, Dominique Fohr, Dietrich Klakow, Dana Ruiter

In this paper, label propagation-based semi-supervised learning is explored for the task of hate speech classification.

Classification

Paper
Add Code

Identification des Expressions Polylexicales dans les Tweets (Identification of Multiword Expressions in Tweets)

no code implementations • JEP/TALN/RECITAL 2022 • Nicolas Zampieri, Carlos Ramisch, Irina Illina, Dominique Fohr

L’identification des expressions polylexicales (EP) dans les tweets est une tâche difficile en raison de la nature linguistique complexe des EP combinée à l’utilisation d’un langage non standard.

Paper
Add Code

Unsupervised Domain Adaptation in Cross-corpora Abusive Language Detection

no code implementations • NAACL (SocialNLP) 2021 • Tulika Bose, Irina Illina, Dominique Fohr

The state-of-the-art abusive language detection models report great in-corpus performance, but underperform when evaluated on abusive comments that differ from the training scenario.

Abusive Language Language Modelling +1

Paper
Add Code

Generalisability of Topic Models in Cross-corpora Abusive Language Detection

no code implementations • NAACL (NLP4IF) 2021 • Tulika Bose, Irina Illina, Dominique Fohr

Rapidly changing social media content calls for robust and generalisable abuse detection models.

Abuse Detection Abusive Language +1

Paper
Add Code

Identification of Multiword Expressions in Tweets for Hate Speech Detection

no code implementations • LREC 2022 • Nicolas Zampieri, Carlos Ramisch, Irina Illina, Dominique Fohr

In this article, we present joint experiments on these two related tasks on English Twitter data: first we focus on the MWE identification task, and then we observe the influence of MWE-based features on the HSD task.

Hate Speech Detection

Paper
Add Code

Transferring Knowledge via Neighborhood-Aware Optimal Transport for Low-Resource Hate Speech Detection

no code implementations • 17 Oct 2022 • Tulika Bose, Irina Illina, Dominique Fohr

The concerning rise of hateful content on online platforms has increased the attention towards automatic hate speech detection, commonly formulated as a supervised classification task.

Hate Speech Detection

Paper
Add Code

Domain Classification-based Source-specific Term Penalization for Domain Adaptation in Hate-speech Detection

no code implementations • COLING 2022 • Tulika Bose, Nikolaos Aletras, Irina Illina, Dominique Fohr

State-of-the-art approaches for hate-speech detection usually exhibit poor performance in out-of-domain settings.

Domain Adaptation domain classification +1

Paper
Add Code

Placing M-Phasis on the Plurality of Hate: A Feature-Based Corpus of Hate Online

1 code implementation • LREC 2022 • Dana Ruiter, Liane Reiners, Ashwin Geet D'Sa, Thomas Kleinbauer, Dominique Fohr, Irina Illina, Dietrich Klakow, Christian Schemer, Angeliki Monnier

Even though hate speech (HS) online has been an important object of research in the last decade, most HS-related corpora over-simplify the phenomenon of hate by attempting to label user comments as "hate" or "neutral".

Hate Speech Detection

Paper
Code

Dynamically Refined Regularization for Improving Cross-corpora Hate Speech Detection

1 code implementation • Findings (ACL) 2022 • Tulika Bose, Nikolaos Aletras, Irina Illina, Dominique Fohr

In this paper, we propose to automatically identify and reduce spurious correlations using attribution methods with dynamic refinement of the list of terms that need to be regularized during training.

Hate Speech Detection

Paper
Code

Improving Automatic Hate Speech Detection with Multiword Expression Features

no code implementations • 1 Jun 2021 • Nicolas Zampieri, Irina Illina, Dominique Fohr

To incorporate MWE features, we create a three-branch deep neural network: one branch for USE, one for MWE categories, and one for MWE embeddings.

Hate Speech Detection Sentence

Paper
Add Code

DNN-Based Semantic Model for Rescoring N-best Speech Recognition List

no code implementations • 2 Nov 2020 • Dominique Fohr, Irina Illina

We propose to perform this through rescoring of the ASR N-best hypotheses list.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Projet AMIS : r\'esum\'e et traduction automatique de vid\'eos (AMIS project : automatic summarization and translation of videos)

no code implementations • JEPTALNRECITAL 2020 • Mohamed Amine Menacer, Dominique Fohr, Denis Jouvet, Karima Abidi, David Langlois, Kamel Sma{\"\i}li

Un autre objectif du projet {\'e}tait aussi de comparer les opinions et sentiments exprim{\'e}s dans plusieurs vid{\'e}os comparables.

Paper
Add Code

Introduction d'informations s\'emantiques dans un syst\`eme de reconnaissance de la parole (Despite spectacular advances in recent years, the Automatic Speech Recognition (ASR) systems still make mistakes, especially in noisy environments)

no code implementations • JEPTALNRECITAL 2020 • St{\'e}phane Level, Irina Illina, Dominique Fohr

Malgr{\'e} les avanc{\'e}s spectaculaires ces derni{\`e}res ann{\'e}es, les syst{\`e}mes de Reconnaissance Automatique de Parole (RAP) commettent encore des erreurs, surtout dans des environnements bruit{\'e}s. Pour am{\'e}liorer la RAP, nous proposons de se diriger vers une contextualisation d{'}un syst{\`e}me RAP, car les informations s{\'e}mantiques sont importantes pour la performance de la RAP.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Reconnaissance automatique de la parole : g\'en\'eration des prononciations non natives pour l'enrichissement du lexique (In this study we propose a method for lexicon adaptation in order to improve the automatic speech recognition (ASR) of non-native speakers)

no code implementations • JEPTALNRECITAL 2020 • Ismael Bada, Dominique Fohr, Irina Illina

Pour prendre en compte ce probl{\`e}me de prononciations erron{\'e}es, notre approche propose d{'}int{\'e}grer les prononciations non natives dans le lexique et par la suite d{'}utiliser ce lexique enrichi pour la reconnaissance.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Towards non-toxic landscapes: Automatic toxic comment detection using DNN

no code implementations • LREC 2020 • Ashwin Geet D'Sa, Irina Illina, Dominique Fohr

The contribution of this paper is the design of binary classification and regression-based approaches aiming to predict whether a comment is toxic or not.

Binary Classification

Paper
Add Code

An enhanced automatic speech recognition system for Arabic

no code implementations • WS 2017 • Mohamed Amine Menacer, Odile Mella, Dominique Fohr, Denis Jouvet, David Langlois, Kamel Smaili

Despite all the classical techniques for Automatic Speech Recognition (ASR), which can be efficiently applied to Arabic speech recognition, it is essential to take into consideration the language specificities to improve the system performance.

Arabic Speech Recognition Automatic Speech Recognition +2

Paper
Add Code

Weakly-supervised text-to-speech alignment confidence measure

no code implementations • COLING 2016 • Guillaume Serri{\`e}re, Christophe Cerisara, Dominique Fohr, Odile Mella

This work proposes a new confidence measure for evaluating text-to-speech alignment systems outputs, which is a key component for many applications, such as semi-automatic corpus anonymization, lips syncing, film dubbing, corpus preparation for speech synthesis and speech recognition acoustic models training.

speech-recognition Speech Recognition +1

Paper
Add Code

Learning Word Importance with the Neural Bag-of-Words Model

1 code implementation • WS 2016 • Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linar{\`e}s

Representation Learning Sentiment Analysis +1

Paper
Code

How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News

no code implementations • LREC 2016 • Imran Sheikh, Irina Illina, Dominique Fohr

Out-Of-Vocabulary (OOV) words missed by Large Vocabulary Continuous Speech Recognition (LVCSR) systems can be recovered with the help of topic and semantic context of the OOV words captured from a diachronic text corpus.

Retrieval speech-recognition +1

Paper
Add Code

The IFCASL Corpus of French and German Non-native and Native Read Speech

no code implementations • LREC 2016 • Juergen Trouvain, Anne Bonneau, Vincent Colotte, Camille Fauth, Dominique Fohr, Denis Jouvet, Jeanin J{\"u}gler, Yves Laprie, Odile Mella, Bernd M{\"o}bius, Frank Zimmerer

The IFCASL corpus is a French-German bilingual phonetic learner corpus designed, recorded and annotated in a project on individualized feedback in computer-assisted spoken language learning.

Paper
Add Code

Learning to retrieve out-of-vocabulary words in speech recognition

no code implementations • 17 Nov 2015 • Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linarès

In this paper, we propose two neural network models targeted to retrieve OOV PNs relevant to an audio document: (a) Document level Continuous Bag of Words (D-CBOW), (b) Document level Continuous Bag of Weighted Words (D-CBOW2).

Retrieval speech-recognition +1

Paper
Add Code

Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process

no code implementations • LREC 2014 • Camille Fauth, Anne Bonneau, Frank Zimmerer, Juergen Trouvain, Bistra Andreeva, Vincent Colotte, Dominique Fohr, Denis Jouvet, Jeanin J{\"u}gler, Yves Laprie, Odile Mella, Bernd M{\"o}bius

We present the design of a corpus of native and non-native speech for the language pair French-German, with a special emphasis on phonetic and prosodic aspects.

Paper
Add Code

G\'en\'eration des prononciations de noms propres \`a l'aide des Champs Al\'eatoires Conditionnels (Pronunciation generation for proper names using Conditional Random Fields) [in French]

no code implementations • JEPTALNRECITAL 2012 • Irina Illina, Dominique Fohr, Denis Jouvet

Speech Recognition

Paper
Add Code

D\'etection de transcriptions incorrectes de parole non-native dans le cadre de l'apprentissage de langues \'etrang\`eres (Detection of incorrect transcriptions of non-native speech in the context of foreign language learning) [in French]

no code implementations • JEPTALNRECITAL 2012 • Luiza Orosanu, Denis Jouvet, Dominique Fohr, Irina Illina, Anne Bonneau

Paper
Add Code

CoALT: A Software for Comparing Automatic Labelling Tools

no code implementations • LREC 2012 • Dominique Fohr, Odile Mella

In this paper, we propose a GPL software CoALT (Comparing Automatic Labelling Tools) for comparing two automatic labellers or two speech-text alignment tools, ranking them and displaying statistics about their differences.

Speech Recognition Speech Synthesis

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.