Search Results for author: Irina Illina

Found 26 papers, 6 papers with code

Generalisability of Topic Models in Cross-corpora Abusive Language Detection

no code implementations • NAACL (NLP4IF) 2021 • Tulika Bose, Irina Illina, Dominique Fohr

Rapidly changing social media content calls for robust and generalisable abuse detection models.

Paper
Add Code

Label Propagation-Based Semi-Supervised Learning for Hate Speech Classification

no code implementations • EMNLP (insights) 2020 • Ashwin Geet D’Sa, Irina Illina, Dominique Fohr, Dietrich Klakow, Dana Ruiter

In this paper, label propagation-based semi-supervised learning is explored for the task of hate speech classification.

Classification

Paper
Add Code

Transformer versus LSTM Language Models trained on Uncertain ASR Hypotheses in Limited Data Scenarios

no code implementations • LREC 2022 • Imran Sheikh, Emmanuel Vincent, Irina Illina

Training of LSTM LMs in such limited data scenarios can benefit from alternate uncertain ASR hypotheses, as observed in our recent work.

Paper
Add Code

Identification of Multiword Expressions in Tweets for Hate Speech Detection

no code implementations • LREC 2022 • Nicolas Zampieri, Carlos Ramisch, Irina Illina, Dominique Fohr

In this article, we present joint experiments on these two related tasks on English Twitter data: first we focus on the MWE identification task, and then we observe the influence of MWE-based features on the HSD task.

Hate Speech Detection

Paper
Add Code

Unsupervised Domain Adaptation in Cross-corpora Abusive Language Detection

no code implementations • NAACL (SocialNLP) 2021 • Tulika Bose, Irina Illina, Dominique Fohr

The state-of-the-art abusive language detection models report great in-corpus performance, but underperform when evaluated on abusive comments that differ from the training scenario.

Abusive Language Language Modelling +1

Paper
Add Code

Identification des Expressions Polylexicales dans les Tweets (Identification of Multiword Expressions in Tweets)

no code implementations • JEP/TALN/RECITAL 2022 • Nicolas Zampieri, Carlos Ramisch, Irina Illina, Dominique Fohr

L’identification des expressions polylexicales (EP) dans les tweets est une tâche difficile en raison de la nature linguistique complexe des EP combinée à l’utilisation d’un langage non standard.

Paper
Add Code

SAMbA: Speech enhancement with Asynchronous ad-hoc Microphone Arrays

no code implementations • 31 Jul 2023 • Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina

Speech enhancement in ad-hoc microphone arrays is often hindered by the asynchronization of the devices composing the microphone array.

Speech Enhancement

Paper
Add Code

Transferring Knowledge via Neighborhood-Aware Optimal Transport for Low-Resource Hate Speech Detection

no code implementations • 17 Oct 2022 • Tulika Bose, Irina Illina, Dominique Fohr

The concerning rise of hateful content on online platforms has increased the attention towards automatic hate speech detection, commonly formulated as a supervised classification task.

Hate Speech Detection

Paper
Add Code

Domain Classification-based Source-specific Term Penalization for Domain Adaptation in Hate-speech Detection

no code implementations • COLING 2022 • Tulika Bose, Nikolaos Aletras, Irina Illina, Dominique Fohr

State-of-the-art approaches for hate-speech detection usually exhibit poor performance in out-of-domain settings.

Domain Adaptation domain classification +1

Paper
Add Code

Placing M-Phasis on the Plurality of Hate: A Feature-Based Corpus of Hate Online

1 code implementation • LREC 2022 • Dana Ruiter, Liane Reiners, Ashwin Geet D'Sa, Thomas Kleinbauer, Dominique Fohr, Irina Illina, Dietrich Klakow, Christian Schemer, Angeliki Monnier

Even though hate speech (HS) online has been an important object of research in the last decade, most HS-related corpora over-simplify the phenomenon of hate by attempting to label user comments as "hate" or "neutral".

Hate Speech Detection

Paper
Code

Dynamically Refined Regularization for Improving Cross-corpora Hate Speech Detection

1 code implementation • Findings (ACL) 2022 • Tulika Bose, Nikolaos Aletras, Irina Illina, Dominique Fohr

In this paper, we propose to automatically identify and reduce spurious correlations using attribution methods with dynamic refinement of the list of terms that need to be regularized during training.

Hate Speech Detection

Paper
Code

Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes

1 code implementation • 15 Jun 2021 • Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina

Speech enhancement promises higher efficiency in ad-hoc microphone arrays than in constrained microphone arrays thanks to the wide spatial coverage of the devices in the acoustic scene.

Speech Enhancement

Paper
Code

Improving Automatic Hate Speech Detection with Multiword Expression Features

no code implementations • 1 Jun 2021 • Nicolas Zampieri, Irina Illina, Dominique Fohr

To incorporate MWE features, we create a three-branch deep neural network: one branch for USE, one for MWE categories, and one for MWE embeddings.

Hate Speech Detection Sentence

Paper
Add Code

DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays

1 code implementation • 3 Nov 2020 • Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid

Deep neural network (DNN)-based speech enhancement algorithms in microphone arrays have now proven to be efficient solutions to speech understanding and speech recognition in noisy environments.

Noise Estimation Speech Enhancement +2

Paper
Code

Distributed speech separation in spatially unconstrained microphone arrays

1 code implementation • 2 Nov 2020 • Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid

We propose a distributed algorithm that can process spatial information in a spatially unconstrained microphone array.

Speech Separation

Paper
Code

DNN-Based Semantic Model for Rescoring N-best Speech Recognition List

no code implementations • 2 Nov 2020 • Dominique Fohr, Irina Illina

We propose to perform this through rescoring of the ASR N-best hypotheses list.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Adaptation de domaine non supervis\'ee pour la reconnaissance de la langue par r\'egularisation d'un r\'eseau de neurones (Unsupervised domain adaptation for language identification by regularization of a neural network)

no code implementations • JEPTALNRECITAL 2020 • Rapha{\"e}l Duroselle, Denis Jouvet, Irina Illina

Sur le corpus RATS, pour sept des huit canaux radio {\'e}tudi{\'e}s, l{'}approche permet, sans utiliser de donn{\'e}es annot{\'e}es du domaine cible, de surpasser la performance d{'}un syst{\`e}me entra{\^\i}n{\'e} de fa{\c{c}}on supervis{\'e}e avec des donn{\'e}es annot{\'e}es de ce domaine.

Language Identification Unsupervised Domain Adaptation

Paper
Add Code

Reconnaissance automatique de la parole : g\'en\'eration des prononciations non natives pour l'enrichissement du lexique (In this study we propose a method for lexicon adaptation in order to improve the automatic speech recognition (ASR) of non-native speakers)

no code implementations • JEPTALNRECITAL 2020 • Ismael Bada, Dominique Fohr, Irina Illina

Pour prendre en compte ce probl{\`e}me de prononciations erron{\'e}es, notre approche propose d{'}int{\'e}grer les prononciations non natives dans le lexique et par la suite d{'}utiliser ce lexique enrichi pour la reconnaissance.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Introduction d'informations s\'emantiques dans un syst\`eme de reconnaissance de la parole (Despite spectacular advances in recent years, the Automatic Speech Recognition (ASR) systems still make mistakes, especially in noisy environments)

no code implementations • JEPTALNRECITAL 2020 • St{\'e}phane Level, Irina Illina, Dominique Fohr

Malgr{\'e} les avanc{\'e}s spectaculaires ces derni{\`e}res ann{\'e}es, les syst{\`e}mes de Reconnaissance Automatique de Parole (RAP) commettent encore des erreurs, surtout dans des environnements bruit{\'e}s. Pour am{\'e}liorer la RAP, nous proposons de se diriger vers une contextualisation d{'}un syst{\`e}me RAP, car les informations s{\'e}mantiques sont importantes pour la performance de la RAP.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

DNN-Based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays

no code implementations • 13 Feb 2020 • Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid

Multichannel processing is widely used for speech enhancement but several limitations appear when trying to deploy these solutions to the real-world.

Speech Enhancement

Paper
Add Code

Towards non-toxic landscapes: Automatic toxic comment detection using DNN

no code implementations • LREC 2020 • Ashwin Geet D'Sa, Irina Illina, Dominique Fohr

The contribution of this paper is the design of binary classification and regression-based approaches aiming to predict whether a comment is toxic or not.

Binary Classification

Paper
Add Code

Learning Word Importance with the Neural Bag-of-Words Model

1 code implementation • WS 2016 • Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linar{\`e}s

Representation Learning Sentiment Analysis +1

Paper
Code

How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News

no code implementations • LREC 2016 • Imran Sheikh, Irina Illina, Dominique Fohr

Out-Of-Vocabulary (OOV) words missed by Large Vocabulary Continuous Speech Recognition (LVCSR) systems can be recovered with the help of topic and semantic context of the OOV words captured from a diachronic text corpus.

Retrieval speech-recognition +1

Paper
Add Code

Learning to retrieve out-of-vocabulary words in speech recognition

no code implementations • 17 Nov 2015 • Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linarès

In this paper, we propose two neural network models targeted to retrieve OOV PNs relevant to an audio document: (a) Document level Continuous Bag of Words (D-CBOW), (b) Document level Continuous Bag of Weighted Words (D-CBOW2).

Retrieval speech-recognition +1

Paper
Add Code

D\'etection de transcriptions incorrectes de parole non-native dans le cadre de l'apprentissage de langues \'etrang\`eres (Detection of incorrect transcriptions of non-native speech in the context of foreign language learning) [in French]

no code implementations • JEPTALNRECITAL 2012 • Luiza Orosanu, Denis Jouvet, Dominique Fohr, Irina Illina, Anne Bonneau

Paper
Add Code

G\'en\'eration des prononciations de noms propres \`a l'aide des Champs Al\'eatoires Conditionnels (Pronunciation generation for proper names using Conditional Random Fields) [in French]

no code implementations • JEPTALNRECITAL 2012 • Irina Illina, Dominique Fohr, Denis Jouvet

Speech Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.