Search Results for author: Andrei Popescu-Belis

Found 37 papers, 11 papers with code

On the Interaction of Regularization Factors in Low-resource Neural Machine Translation

1 code implementation • EAMT 2022 • None Àlex R. Atrio, Andrei Popescu-Belis

We explore the roles and interactions of the hyper-parameters governing regularization, and propose a range of values applicable to low-resource neural machine translation.

Low-Resource Neural Machine Translation Translation

Paper
Code

Constrained Language Models for Interactive Poem Generation

no code implementations • LREC 2022 • Andrei Popescu-Belis, Àlex Atrio, Valentin Minder, Aris Xanthos, Gabriel Luthier, Simon Mattei, Antonio Rodriguez

This paper describes a system for interactive poem generation, which combines neural language models (LMs) for poem generation with explicit constraints that can be set by users on form, topic, emotion, and rhyming scheme.

Paper
Add Code

The IICT-Yverdon System for the WMT 2021 Unsupervised MT and Very Low Resource Supervised MT Task

no code implementations • WMT (EMNLP) 2021 • Àlex R. Atrio, Gabriel Luthier, Axel Fahy, Giorgos Vernikos, Andrei Popescu-Belis, Ljiljana Dolamic

We then present the application of this system to the 2021 task for low-resource supervised Upper Sorbian (HSB) to German translation, in both directions.

Translation

Paper
Add Code

Don't Rank, Combine! Combining Machine Translation Hypotheses Using Quality Estimation

no code implementations • 12 Jan 2024 • Giorgos Vernikos, Andrei Popescu-Belis

We demonstrate that our approach generates novel translations in over half of the cases and consistently outperforms other methods across varying numbers of candidates (5-200).

Machine Translation Translation

Paper
Add Code

Assessing the Importance of Frequency versus Compositionality for Subword-based Tokenization in NMT

1 code implementation • 2 Jun 2023 • Benoist Wolleb, Romain Silvestri, Giorgos Vernikos, Ljiljana Dolamic, Andrei Popescu-Belis

Subword tokenization is the de facto standard for tokenization in neural language models and machine translation systems.

Machine Translation NMT +1

Paper
Code

Small Batch Sizes Improve Training of Low-Resource Neural MT

no code implementations • ICON 2021 • Àlex R. Atrio, Andrei Popescu-Belis

We study the role of an essential hyper-parameter that governs the training of Transformers for neural machine translation in a low-resource setting: the batch size.

Machine Translation Translation

Paper
Add Code

Subword Mapping and Anchoring across Languages

1 code implementation • Findings (EMNLP) 2021 • Giorgos Vernikos, Andrei Popescu-Belis

State-of-the-art multilingual systems rely on shared vocabularies that sufficiently cover all considered languages.

Cross-Lingual Natural Language Inference Machine Translation +1

Paper
Code

Chat or Learn: a Data-Driven Robust Question-Answering System

no code implementations • LREC 2020 • Gabriel Luthier, Andrei Popescu-Belis

We present our choices of data sets for training and testing the components, and present the experimental results that helped us optimize the parameters of the chatbot.

Chatbot coreference-resolution +2

Paper
Add Code

A Consolidated Dataset for Knowledge-based Question Generation using Predicate Mapping of Linked Data

no code implementations • LREC 2020 • Johanna Melly, Gabriel Luthier, Andrei Popescu-Belis

The new data set can be used to generate novel questions given an unseen Wikidata triple, by replacing the subjects of existing questions with the new one and then selecting the best candidate questions using semantic and syntactic criteria.

Question Generation Question-Generation

Paper
Add Code

Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction

no code implementations • WS 2016 • Liane Guillou, Christian Hardmeier, Preslav Nakov, Sara Stymne, Jörg Tiedemann, Yannick Versley, Mauro Cettolo, Bonnie Webber, Andrei Popescu-Belis

We describe the design, the evaluation setup, and the results of the 2016 WMT shared task on cross-lingual pronoun prediction.

Language Modelling POS

Paper
Add Code

Context in Neural Machine Translation: A Review of Models and Evaluations

no code implementations • 25 Jan 2019 • Andrei Popescu-Belis

This review paper discusses how context has been used in neural machine translation (NMT) in the past two years (2017-2018).

Machine Translation NMT +1

Paper
Add Code

Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation

1 code implementation • TACL 2018 • Xiao Pu, Nikolaos Pappas, James Henderson, Andrei Popescu-Belis

We show that the concatenation of these vectors, and the use of a sense selection mechanism based on the weighted average of sense vectors, outperforms several baselines including sense-aware ones.

Clustering Machine Translation +3

Paper
Code

Machine Translation of Low-Resource Spoken Dialects: Strategies for Normalizing Swiss German

1 code implementation • LREC 2018 • Pierre-Edouard Honnet, Andrei Popescu-Belis, Claudiu Musat, Michael Baeriswyl

The goal of this work is to design a machine translation (MT) system for a low-resource family of dialects, collectively known as Swiss German, which are widely spoken in Switzerland but seldom written.

Machine Translation Translation

134

Paper
Code

Self-Attentive Residual Decoder for Neural Machine Translation

1 code implementation • NAACL 2018 • Lesly Miculicich Werlen, Nikolaos Pappas, Dhananjay Ram, Andrei Popescu-Belis

Neural sequence-to-sequence networks with attention have achieved remarkable performance for machine translation.

Decoder Machine Translation +1

Paper
Code

Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT)

1 code implementation • WS 2017 • Lesly Miculicich Werlen, Andrei Popescu-Belis

In this paper, we define and assess a reference-based metric to evaluate the accuracy of pronoun translation (APT).

Machine Translation Translation

Paper
Code

Sense-Aware Statistical Machine Translation using Adaptive Context-Dependent Clustering

no code implementations • WS 2017 • Xiao Pu, Nikolaos Pappas, Andrei Popescu-Belis

Clustering Language Modelling +3

Paper
Add Code

Multilingual Hierarchical Attention Networks for Document Classification

2 code implementations • IJCNLP 2017 • Nikolaos Pappas, Andrei Popescu-Belis

Hierarchical attention networks have recently achieved remarkable performance for document classification in a given language.

Classification Computational Efficiency +3

Paper
Code

The SUMMA Platform Prototype

no code implementations • EACL 2017 • Renars Liepins, Ulrich Germann, Guntis Barzdins, Alex Birch, ra, Steve Renals, Susanne Weber, Peggy van der Kreeft, Herv{\'e} Bourlard, Jo{\~a}o Prieto, Ond{\v{r}}ej Klejch, Peter Bell, Alex Lazaridis, ros, Alfonso Mendes, Sebastian Riedel, Mariana S. C. Almeida, Pedro Balage, Shay B. Cohen, Tomasz Dwojak, Philip N. Garner, Andreas Giefer, Marcin Junczys-Dowmunt, Hina Imran, David Nogueira, Ahmed Ali, Mir, Sebasti{\~a}o a, Andrei Popescu-Belis, Lesly Miculicich Werlen, Nikos Papasarantopoulos, Abiola Obamuyide, Clive Jones, Fahim Dalvi, Andreas Vlachos, Yang Wang, Sibo Tong, Rico Sennrich, Nikolaos Pappas, Shashi Narayan, Marco Damonte, Nadir Durrani, Sameer Khurana, Ahmed Abdelali, Hassan Sajjad, Stephan Vogel, David Sheppey, Chris Hernon, Jeff Mitchell

We present the first prototype of the SUMMA Platform: an integrated platform for multilingual media monitoring.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

Consistent Translation of Repeated Nouns using Syntactic and Semantic Cues

no code implementations • EACL 2017 • Xiao Pu, Laura Mascarell, Andrei Popescu-Belis

We compare the automatic post-editing of noun translations with the re-ranking of the translation hypotheses based on the classifiers{'} output, and also use these methods in combination.

Automatic Post-Editing Re-Ranking +1

Paper
Add Code

Using Coreference Links to Improve Spanish-to-English Machine Translation

1 code implementation • WS 2017 • Lesly Miculicich Werlen, Andrei Popescu-Belis

In this paper, we present a proof-of-concept implementation of a coreference-aware decoder for document-level machine translation.

Coreference Resolution Decoder +4

Paper
Code

Machine Translation of Spanish Personal and Possessive Pronouns Using Anaphora Probabilities

1 code implementation • EACL 2017 • Ngoc Quang Luong, Andrei Popescu-Belis, Annette Rios Gonzales, Don Tuggener

We implement a fully probabilistic model to combine the hypotheses of a Spanish anaphora resolution system with those of a Spanish-English machine translation system.

Coreference Resolution Machine Translation +1

Paper
Code

Explicit Document Modeling through Weighted Multiple-Instance Learning

1 code implementation • Journal of Artificial Intelligence Research (JAIR) 2017 • Nikolaos Pappas, Andrei Popescu-Belis

Representing documents is a crucial component in many NLP tasks, for instance predicting aspect ratings in reviews.

Multiple Instance Learning Sentence +2

Paper
Code

Human versus Machine Attention in Document Classification: A Dataset with Crowdsourced Annotations

no code implementations • WS 2016 • Nikolaos Pappas, Andrei Popescu-Belis

Document Classification General Classification +2

Paper
Add Code

Improving Pronoun Translation by Modeling Coreference Uncertainty

no code implementations • WS 2016 • Ngoc Quang Luong, Andrei Popescu-Belis

Coreference Resolution Machine Translation +1

Paper
Add Code

Pronoun Language Model and Grammatical Heuristics for Aiding Pronoun Prediction

no code implementations • WS 2016 • Ngoc Quang Luong, Andrei Popescu-Belis

Language Modelling Machine Translation

Paper
Add Code

Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation

no code implementations • LREC 2016 • Jeevanthi Liyanapathirana, Andrei Popescu-Belis

This paper presents a solution to evaluate spoken post-editing of imperfect machine translation output by a human translator.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

A Contextual Language Model to Improve Machine Translation of Pronouns by Re-ranking Translation Hypotheses

no code implementations • WS 2016 • Ngoc Quang Luong, Andrei Popescu-Belis

Language Modelling Machine Translation +2

Paper
Add Code

Pronoun Translation and Prediction with or without Coreference Links

no code implementations • WS 2015 • Ngoc Quang Luong, Lesly Miculicich Werlen, Andrei Popescu-Belis

Machine Translation Translation

Paper
Add Code

Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German

no code implementations • IJCNLP 2015 • Xiao Pu, Laura Mascarell, Andrei Popescu-Belis, Mark Fishel, Ngoc-Quang Luong, Martin Volk

Machine Translation Translation

Paper
Add Code

Explaining the Stars: Weighted Multiple-Instance Learning for Aspect-Based Sentiment Analysis

no code implementations • EMNLP 2014 • Nikolaos Pappas, Andrei Popescu-Belis

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Paper
Add Code

Enforcing Topic Diversity in a Document Recommender for Conversations

no code implementations • COLING 2014 • Maryam Habibi, Andrei Popescu-Belis

Information Retrieval Keyword Extraction +1

Paper
Add Code

English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling

no code implementations • LREC 2014 • Sharid Lo{\'a}iciga, Thomas Meyer, Andrei Popescu-Belis

This paper presents a method for verb phrase (VP) alignment in an English-French parallel corpus and its use for improving statistical machine translation (SMT) of verb tenses.

Machine Translation POS +2

Paper
Add Code

Detecting Narrativity to Improve English to French Translation of Simple Past Verbs

no code implementations • WS 2013 • Thomas Meyer, Cristina Grisot, Andrei Popescu-Belis

Machine Translation Translation

Paper
Add Code

Diverse Keyword Extraction from Conversations

no code implementations • ACL 2013 • Maryam Habibi, Andrei Popescu-Belis

Keyword Extraction Recommendation Systems

Paper
Add Code

Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns

no code implementations • LREC 2012 • Andrei Popescu-Belis, Thomas Meyer, Jeevanthi Liyanapathirana, Bruno Cartoni, S Zufferey, rine

This paper describes methods and results for the annotation of two discourse-level phenomena, connectives and pronouns, over a multilingual parallel corpus.

Machine Translation Translation

Paper
Add Code

ISO 24617-2: A semantically-based standard for dialogue annotation

no code implementations • LREC 2012 • Harry Bunt, Alex, Jan ersson, Jae-Woong Choe, Alex Chengyu Fang, Koiti Hasida, Volha Petukhova, Andrei Popescu-Belis, David Traum

This paper summarizes the latest, final version of ISO standard 24617-2 ``Semantic annotation framework, Part 2: Dialogue acts''''''''.

Paper
Add Code

Using Sense-labeled Discourse Connectives for Statistical Machine Translation

no code implementations • WS 2012 • Thomas Meyer, Andrei Popescu-Belis

Machine Translation Translation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.