1 code implementation • EAMT 2022 • None Àlex R. Atrio, Andrei Popescu-Belis
We explore the roles and interactions of the hyper-parameters governing regularization, and propose a range of values applicable to low-resource neural machine translation.
no code implementations • LREC 2022 • Andrei Popescu-Belis, Àlex Atrio, Valentin Minder, Aris Xanthos, Gabriel Luthier, Simon Mattei, Antonio Rodriguez
This paper describes a system for interactive poem generation, which combines neural language models (LMs) for poem generation with explicit constraints that can be set by users on form, topic, emotion, and rhyming scheme.
no code implementations • WMT (EMNLP) 2021 • Àlex R. Atrio, Gabriel Luthier, Axel Fahy, Giorgos Vernikos, Andrei Popescu-Belis, Ljiljana Dolamic
We then present the application of this system to the 2021 task for low-resource supervised Upper Sorbian (HSB) to German translation, in both directions.
no code implementations • 12 Jan 2024 • Giorgos Vernikos, Andrei Popescu-Belis
We demonstrate that our approach generates novel translations in over half of the cases and consistently outperforms other methods across varying numbers of candidates (5-200).
1 code implementation • 2 Jun 2023 • Benoist Wolleb, Romain Silvestri, Giorgos Vernikos, Ljiljana Dolamic, Andrei Popescu-Belis
Subword tokenization is the de facto standard for tokenization in neural language models and machine translation systems.
no code implementations • ICON 2021 • Àlex R. Atrio, Andrei Popescu-Belis
We study the role of an essential hyper-parameter that governs the training of Transformers for neural machine translation in a low-resource setting: the batch size.
1 code implementation • Findings (EMNLP) 2021 • Giorgos Vernikos, Andrei Popescu-Belis
State-of-the-art multilingual systems rely on shared vocabularies that sufficiently cover all considered languages.
Cross-Lingual Natural Language Inference Machine Translation +1
no code implementations • LREC 2020 • Gabriel Luthier, Andrei Popescu-Belis
We present our choices of data sets for training and testing the components, and present the experimental results that helped us optimize the parameters of the chatbot.
no code implementations • LREC 2020 • Johanna Melly, Gabriel Luthier, Andrei Popescu-Belis
The new data set can be used to generate novel questions given an unseen Wikidata triple, by replacing the subjects of existing questions with the new one and then selecting the best candidate questions using semantic and syntactic criteria.
no code implementations • WS 2016 • Liane Guillou, Christian Hardmeier, Preslav Nakov, Sara Stymne, Jörg Tiedemann, Yannick Versley, Mauro Cettolo, Bonnie Webber, Andrei Popescu-Belis
We describe the design, the evaluation setup, and the results of the 2016 WMT shared task on cross-lingual pronoun prediction.
no code implementations • 25 Jan 2019 • Andrei Popescu-Belis
This review paper discusses how context has been used in neural machine translation (NMT) in the past two years (2017-2018).
1 code implementation • TACL 2018 • Xiao Pu, Nikolaos Pappas, James Henderson, Andrei Popescu-Belis
We show that the concatenation of these vectors, and the use of a sense selection mechanism based on the weighted average of sense vectors, outperforms several baselines including sense-aware ones.
1 code implementation • LREC 2018 • Pierre-Edouard Honnet, Andrei Popescu-Belis, Claudiu Musat, Michael Baeriswyl
The goal of this work is to design a machine translation (MT) system for a low-resource family of dialects, collectively known as Swiss German, which are widely spoken in Switzerland but seldom written.
1 code implementation • NAACL 2018 • Lesly Miculicich Werlen, Nikolaos Pappas, Dhananjay Ram, Andrei Popescu-Belis
Neural sequence-to-sequence networks with attention have achieved remarkable performance for machine translation.
1 code implementation • WS 2017 • Lesly Miculicich Werlen, Andrei Popescu-Belis
In this paper, we define and assess a reference-based metric to evaluate the accuracy of pronoun translation (APT).
2 code implementations • IJCNLP 2017 • Nikolaos Pappas, Andrei Popescu-Belis
Hierarchical attention networks have recently achieved remarkable performance for document classification in a given language.
no code implementations • EACL 2017 • Renars Liepins, Ulrich Germann, Guntis Barzdins, Alex Birch, ra, Steve Renals, Susanne Weber, Peggy van der Kreeft, Herv{\'e} Bourlard, Jo{\~a}o Prieto, Ond{\v{r}}ej Klejch, Peter Bell, Alex Lazaridis, ros, Alfonso Mendes, Sebastian Riedel, Mariana S. C. Almeida, Pedro Balage, Shay B. Cohen, Tomasz Dwojak, Philip N. Garner, Andreas Giefer, Marcin Junczys-Dowmunt, Hina Imran, David Nogueira, Ahmed Ali, Mir, Sebasti{\~a}o a, Andrei Popescu-Belis, Lesly Miculicich Werlen, Nikos Papasarantopoulos, Abiola Obamuyide, Clive Jones, Fahim Dalvi, Andreas Vlachos, Yang Wang, Sibo Tong, Rico Sennrich, Nikolaos Pappas, Shashi Narayan, Marco Damonte, Nadir Durrani, Sameer Khurana, Ahmed Abdelali, Hassan Sajjad, Stephan Vogel, David Sheppey, Chris Hernon, Jeff Mitchell
We present the first prototype of the SUMMA Platform: an integrated platform for multilingual media monitoring.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
no code implementations • EACL 2017 • Xiao Pu, Laura Mascarell, Andrei Popescu-Belis
We compare the automatic post-editing of noun translations with the re-ranking of the translation hypotheses based on the classifiers{'} output, and also use these methods in combination.
1 code implementation • WS 2017 • Lesly Miculicich Werlen, Andrei Popescu-Belis
In this paper, we present a proof-of-concept implementation of a coreference-aware decoder for document-level machine translation.
1 code implementation • EACL 2017 • Ngoc Quang Luong, Andrei Popescu-Belis, Annette Rios Gonzales, Don Tuggener
We implement a fully probabilistic model to combine the hypotheses of a Spanish anaphora resolution system with those of a Spanish-English machine translation system.
1 code implementation • Journal of Artificial Intelligence Research (JAIR) 2017 • Nikolaos Pappas, Andrei Popescu-Belis
Representing documents is a crucial component in many NLP tasks, for instance predicting aspect ratings in reviews.
no code implementations • LREC 2016 • Jeevanthi Liyanapathirana, Andrei Popescu-Belis
This paper presents a solution to evaluate spoken post-editing of imperfect machine translation output by a human translator.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • LREC 2014 • Sharid Lo{\'a}iciga, Thomas Meyer, Andrei Popescu-Belis
This paper presents a method for verb phrase (VP) alignment in an English-French parallel corpus and its use for improving statistical machine translation (SMT) of verb tenses.
no code implementations • LREC 2012 • Andrei Popescu-Belis, Thomas Meyer, Jeevanthi Liyanapathirana, Bruno Cartoni, S Zufferey, rine
This paper describes methods and results for the annotation of two discourse-level phenomena, connectives and pronouns, over a multilingual parallel corpus.
no code implementations • LREC 2012 • Harry Bunt, Alex, Jan ersson, Jae-Woong Choe, Alex Chengyu Fang, Koiti Hasida, Volha Petukhova, Andrei Popescu-Belis, David Traum
This paper summarizes the latest, final version of ISO standard 24617-2 ``Semantic annotation framework, Part 2: Dialogue acts''''''''.