Search Results for author: er

Found 337 papers, 23 papers with code

Metaphor Detection Using Contextual Word Embeddings From Transformers

no code implementations • WS 2020 • Jerry Liu, Nathan O{'}Hara, Alex Rubin, er, Rachel Draelos, Cynthia Rudin

The detection of metaphors can provide valuable information about a given text and is crucial to sentiment analysis and machine translation.

Machine Translation Sentiment Analysis +2

Paper
Add Code

Testing the role of metadata in metaphor identification

no code implementations • WS 2020 • Egon Stemle, Alex Onysko, er

The particular focus of our approach is on the potential influence that the metadata given in the ETS Corpus of Non-Native Written English might have on the automatic detection of metaphors in this dataset.

Paper
Add Code

Towards Stream Translation: Adaptive Computation Time for Simultaneous Machine Translation

no code implementations • WS 2020 • Felix Schneider, Alex Waibel, er

Simultaneous machine translation systems rely on a policy to schedule read and write operations in order to begin translating a source sentence before it is complete.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Climbing towards NLU: On Meaning, Form, and Understanding in the Age of Data

no code implementations • ACL 2020 • Emily M. Bender, Alex Koller, er

The success of the large neural language models on many NLP tasks is exciting.

Natural Language Understanding Position

Paper
Add Code

Frugal Paradigm Completion

no code implementations • ACL 2020 • Alex Erdmann, er, Tom Kenter, Markus Becker, Christian Schallhart

Lexica distinguishing all morphologically related forms of each lexeme are crucial to many language technologies, yet building them is expensive.

Paper
Add Code

FINDINGS OF THE IWSLT 2020 EVALUATION CAMPAIGN

no code implementations • WS 2020 • Ebrahim Ansari, Amittai Axelrod, Nguyen Bach, Ond{\v{r}}ej Bojar, Roldano Cattoni, Fahim Dalvi, Nadir Durrani, Marcello Federico, Christian Federmann, Jiatao Gu, Fei Huang, Kevin Knight, Xutai Ma, Ajay Nagesh, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Xing Shi, Sebastian St{\"u}ker, Marco Turchi, Alex Waibel, er, Changhan Wang

The evaluation campaign of the International Conference on Spoken Language Translation (IWSLT 2020) featured this year six challenge tracks: (i) Simultaneous speech translation, (ii) Video speech translation, (iii) Offline speech translation, (iv) Conversational speech translation, (v) Open domain translation, and (vi) Non-native speech translation.

Translation

Paper
Add Code

Enabling Low-Resource Transfer Learning across COVID-19 Corpora by Combining Event-Extraction and Co-Training

no code implementations • ACL 2020 • Alex Spangher, er, Nanyun Peng, Jonathan May, Emilio Ferrara

Event Extraction Transfer Learning

Paper
Add Code

KIT's IWSLT 2020 SLT Translation System

no code implementations • WS 2020 • Ngoc-Quan Pham, Felix Schneider, Tuan-Nam Nguyen, Thanh-Le Ha, Thai Son Nguyen, Maximilian Awiszus, Sebastian St{\"u}ker, Alex Waibel, er

This paper describes KIT{'}s submissions to the IWSLT2020 Speech Translation evaluation campaign.

Translation

Paper
Add Code

Modeling Word Formation in English--German Neural Machine Translation

no code implementations • ACL 2020 • Marion Weller-Di Marco, Alex Fraser, er

This paper studies strategies to model word formation in NMT using rich linguistic information, namely a word segmentation approach that goes beyond splitting into substrings by considering fusional morphology.

Machine Translation Morphological Analysis +3

Paper
Add Code

Towards Building an Automatic Transcription System for Language Documentation: Experiences from Muyu

no code implementations • LREC 2020 • Alex Zahrer, er, Andrej Zgank, Barbara Schuppler

The experiments are based on recordings from an ongoing documentation project for the endangered Muyu language in New Guinea.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Exploring Bilingual Word Embeddings for Hiligaynon, a Low-Resource Language

no code implementations • LREC 2020 • Leah Michel, Viktor Hangya, Alex Fraser, er

We use a publicly available Hiligaynon corpus with only 300K words, and match it with a comparable corpus in English.

Word Embeddings

Paper
Add Code

Open-Source High Quality Speech Datasets for Basque, Catalan and Galician

no code implementations • LREC 2020 • Oddur Kjartansson, Alex Gutkin, er, Alena Butryna, Isin Demirsahin, Clara Rivera

This paper introduces new open speech datasets for three of the languages of Spain: Basque, Catalan and Galician.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Is Language Modeling Enough? Evaluating Effective Embedding Combinations

no code implementations • LREC 2020 • Rudolf Schneider, Tom Oberhauser, Paul Grundmann, Felix Alex Gers, Alex Loeser, er, Steffen Staab

We present PubMedSection, a novel topic classification dataset focussed on the biomedical domain.

Entity Disambiguation General Classification +6

Paper
Add Code

Digital Language Infrastructures -- Documenting Language Actors

no code implementations • LREC 2020 • Verena Lyding, Alex K{\"o}nig, er, Monica Pretti

The major European language infrastructure initiatives like CLARIN (Hinrichs and Krauwer, 2014), DARIAH (Edmond et al., 2017) or Europeana (Europeana Foundation, 2015) have been built by focusing in the first place on institutions of larger scale, like specialized research departments and larger official units like national libraries, etc.

Paper
Add Code

Crowdsourcing Latin American Spanish for Low-Resource Text-to-Speech

no code implementations • LREC 2020 • Adriana Guevara-Rukoz, Isin Demirsahin, Fei He, Shan-Hui Cathy Chu, Supheakmungkol Sarin, Knot Pipatsrisawat, Alex Gutkin, er, Alena Butryna, Oddur Kjartansson

In this paper we present a multidialectal corpus approach for building a text-to-speech voice for a new dialect in a language with existing resources, focusing on various South American dialects of Spanish.

Paper
Add Code

Transfer of ISOSpace into a 3D Environment for Annotations and Applications

no code implementations • LREC 2020 • Alex Henlein, Giuseppe Abrami, Attila Kett, Alex Mehler, er

People{'}s visual perception is very pronounced and therefore it is usually no problem for them to describe the space around them in words.

Paper
Add Code

TextAnnotator: A UIMA Based Tool for the Simultaneous and Collaborative Annotation of Texts

no code implementations • LREC 2020 • Giuseppe Abrami, Manuel Stoeckel, Alex Mehler, er

The annotation of texts and other material in the field of digital humanities and Natural Language Processing (NLP) is a common task of research projects.

Paper
Add Code

Open-source Multi-speaker Speech Corpora for Building Gujarati, Kannada, Malayalam, Marathi, Tamil and Telugu Speech Synthesis Systems

no code implementations • LREC 2020 • Fei He, Shan-Hui Cathy Chu, Oddur Kjartansson, Clara Rivera, Anna Katanova, Alex Gutkin, er, Isin Demirsahin, Cibu Johny, Martin Jansche, Supheakmungkol Sarin, Knot Pipatsrisawat

We present free high quality multi-speaker speech corpora for Gujarati, Kannada, Malayalam, Marathi, Tamil and Telugu, which are six of the twenty two official languages of India spoken by 374 million native speakers.

Speech Synthesis

Paper
Add Code

TheRuSLan: Database of Russian Sign Language

no code implementations • LREC 2020 • Ildar Kagirov, Denis Ivanko, Dmitry Ryumin, Alex Axyonov, er, Alexey Karpov

The database includes lexical units (single words and phrases) from Russian sign language within one subject area, namely, {``}food products at the supermarket{''}, and was collected using MS Kinect 2. 0 device including both FullHD video and the depth map modes, which provides new opportunities for the lexicographical description of the Russian sign language vocabulary and enhances research in the field of automatic gesture recognition.

Gesture Recognition Sign Language Recognition

Paper
Add Code

Burmese Speech Corpus, Finite-State Text Normalization and Pronunciation Grammars with an Application to Text-to-Speech

no code implementations • LREC 2020 • Yin May Oo, Theeraphol Wattanavekin, Chenfang Li, Pasindu De Silva, Supheakmungkol Sarin, Knot Pipatsrisawat, Martin Jansche, Oddur Kjartansson, Alex Gutkin, er

This paper introduces an open-source crowd-sourced multi-speaker speech corpus along with the comprehensive set of finite-state transducer (FST) grammars for performing text normalization for the Burmese (Myanmar) language.

Paper
Add Code

Discovering Biased News Articles Leveraging Multiple Human Annotations

no code implementations • LREC 2020 • Konstantina Lazaridou, Alex L{\"o}ser, er, Maria Mestre, Felix Naumann

Yet, political propaganda and one-sided views can be found in the news and can cause distrust in media.

Paper
Add Code

Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language Learning

no code implementations • LREC 2020 • Lionel Nicolas, Verena Lyding, Claudia Borg, Corina Forascu, Kar{\"e}n Fort, Katerina Zdravkova, Iztok Kosem, Jaka {\v{C}}ibej, {\v{S}}pela Arhar Holdt, Alice Millour, Alex K{\"o}nig, er, Christos Rodosthenous, Federico Sangati, Umair ul Hassan, Anisia Katinskaia, Anabela Barreiro, Lavinia Aparaschivei, Yaakov HaCohen-Kerner

We introduce in this paper a generic approach to combine implicit crowdsourcing and language learning in order to mass-produce language resources (LRs) for any language for which a crowd of language learners can be involved.

Paper
Add Code

Developing a Corpus of Indirect Speech Act Schemas

no code implementations • LREC 2020 • Antonio Roque, Alex Tsuetaki, er, Vasanth Sarathy, Matthias Scheutz

Resolving Indirect Speech Acts (ISAs), in which the intended meaning of an utterance is not identical to its literal meaning, is essential to enabling the participation of intelligent systems in peoples{'} everyday lives.

Paper
Add Code

Open-source Multi-speaker Corpora of the English Accents in the British Isles

no code implementations • LREC 2020 • Isin Demirsahin, Oddur Kjartansson, Alex Gutkin, er, Clara Rivera

This paper presents a dataset of transcribed high-quality audio of English sentences recorded by volunteers speaking with different accents of the British Isles.

Paper
Add Code

Eidos: An Open-Source Auditory Periphery Modeling Toolkit and Evaluation of Cross-Lingual Phonemic Contrasts

no code implementations • LREC 2020 • Alex Gutkin, er

While the auditory periphery mechanisms responsible for transducing the sound pressure wave into the auditory nerve discharge are relatively well understood, the models that describe them are usually very complex because they try to faithfully simulate the behavior of several functionally distinct biological units involved in hearing.

Paper
Add Code

Modelling Frequency and Attestations for OntoLex-Lemon

no code implementations • LREC 2020 • Christian Chiarcos, Maxim Ionov, Jesse de Does, Katrien Depuydt, Anas Fahad Khan, S Stolk, er, Thierry Declerck, John Philip McCrae

Therefore, the OntoLex community has put forward the proposal for a novel module for frequency, attestation and corpus information (FrAC), that not only covers the requirements of digital lexicography, but also accommodates essential data structures for lexical information in natural language processing.

Paper
Add Code

Using Crowdsourced Exercises for Vocabulary Training to Expand ConceptNet

no code implementations • LREC 2020 • Christos Rodosthenous, Verena Lyding, Federico Sangati, Alex K{\"o}nig, er, Umair ul Hassan, Lionel Nicolas, Jolita Horbacauskiene, Anisia Katinskaia, Lavinia Aparaschivei

In this work, we report on a crowdsourcing experiment conducted using the V-TREL vocabulary trainer which is accessed via a Telegram chatbot interface to gather knowledge on word relations suitable for expanding ConceptNet.

Chatbot

Paper
Add Code

On the Influence of Coreference Resolution on Word Embeddings in Lexical-semantic Evaluation Tasks

no code implementations • LREC 2020 • Alex Henlein, Alex Mehler, er

The inclusion of CR as a pre-processing step is expected to lead to improvements in downstream tasks.

coreference-resolution Word Embeddings

Paper
Add Code

Reconstructing NER Corpora: a Case Study on Bulgarian

no code implementations • LREC 2020 • Iva Marinova, Laska Laskova, Petya Osenova, Kiril Simov, Alex Popov, er

The paper reports on the usage of deep learning methods for improving a Named Entity Recognition (NER) training corpus and for predicting and annotating new types in a test corpus.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

CLARIN: Distributed Language Resources and Technology in a European Infrastructure

no code implementations • LREC 2020 • Maria Eskevich, Franciska de Jong, Alex K{\"o}nig, er, Darja Fi{\v{s}}er, Dieter van Uytvanck, Tero Aalto, Lars Borin, Olga Gerassimenko, Jan Hajic, Henk van den Heuvel, Neeme Kahusk, Krista Liin, Martin Matthiesen, Stelios Piperidis, Kadri Vider

CLARIN is a European Research Infrastructure providing access to digital language resources and tools from across Europe and beyond to researchers in the humanities and social sciences.

Paper
Add Code

Beyond lexical semantics: notes on pragmatic frames

no code implementations • LREC 2020 • Oliver Czulo, Alex Ziem, er, Tiago Timponi Torrent

Framenets as an incarnation of frame semantics have been set up to deal with lexicographic issues (cf.

TAG

Paper
Add Code

Voting for POS tagging of Latin texts: Using the flair of FLAIR to better Ensemble Classifiers by Example of Latin

no code implementations • LREC 2020 • Manuel Stoeckel, Alex Henlein, Wahed Hemati, Alex Mehler, er

Since most of the available Latin word embeddings were trained on either few or inaccurate data, we trained several embeddings on better data in the first step.

Lemmatization Part-Of-Speech Tagging +3

Paper
Add Code

CAMeL Tools: An Open Source Python Toolkit for Arabic Natural Language Processing

1 code implementation • LREC 2020 • Ossama Obeid, Nasser Zalmout, Salam Khalifa, Dima Taji, Mai Oudah, Bashar Alhafni, Go Inoue, Fadhl Eryani, Alex Erdmann, er, Nizar Habash

We present CAMeL Tools, a collection of open-source tools for Arabic natural language processing in Python.

Arabic Sentiment Analysis Arabic Text Diacritization +6

381

Paper
Code

LMU Bilingual Dictionary Induction System with Word Surface Similarity Scores for BUCC 2020

no code implementations • LREC 2020 • Silvia Severini, Viktor Hangya, Alex Fraser, er, Hinrich Sch{\"u}tze

We participate in both the open and closed tracks of the shared task and we show improved results of our method compared to simple vector similarity based approaches.

Machine Translation Translation +2

Paper
Add Code

Consistent Unsupervised Estimators for Anchored PCFGs

no code implementations • TACL 2020 • Alex Clark, er, Nathana{\"e}l Fijalkow

Learning probabilistic context-free grammars (PCFGs) from strings is a classic problem in computational linguistics since Horning (1969).

Paper
Add Code

Automatic identification of writers' intentions: Comparing different methods for predicting relationship goals in online dating profile texts

no code implementations • WS 2019 • Chris van der Lee, van der Z, Tess en, Emiel Krahmer, Maria Mos, Alex Schouten, er

Results show that LIWC and machine learning models correlate with human evaluations in terms of content-related labels.

BIG-bench Machine Learning

Paper
Add Code

Fact Checking or Psycholinguistics: How to Distinguish Fake and True Claims?

no code implementations • WS 2019 • Aleks Wawer, er, Grzegorz Wojdyga, Justyna Sarzy{\'n}ska-Wawer

The goal of our paper is to compare psycholinguistic text features with fact checking approaches to distinguish lies from true statements.

Deception Detection Fact Checking

Paper
Add Code

BIOfid Dataset: Publishing a German Gold Standard for Named Entity Recognition in Historical Biodiversity Literature

no code implementations • CONLL 2019 • Sajawel Ahmed, Manuel Stoeckel, Christine Driller, Adrian Pachzelt, Alex Mehler, er

The Specialized Information Service Biodiversity Research (BIOfid) has been launched to mobilize valuable biological data from printed literature hidden in German libraries for over the past 250 years.

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

Saarland at MRP 2019: Compositional parsing across all graphbanks

no code implementations • CONLL 2019 • Lucia Donatelli, Meaghan Fowlie, Jonas Groschwitz, Alex Koller, er, Matthias Lindemann, Mario Mina, Pia Wei{\ss}enhorn

We describe the Saarland University submission to the shared task on Cross-Framework Meaning Representation Parsing (MRP) at the 2019 Conference on Computational Natural Language Learning (CoNLL).

Paper
Add Code

Generating Abstractive Summaries with Finetuned Language Models

no code implementations • WS 2019 • Sebastian Gehrmann, Zachary Ziegler, Alex Rush, er

Neural abstractive document summarization is commonly approached by models that exhibit a mostly extractive behavior.

Document Summarization Inductive Bias +2

Paper
Add Code

A Personalized Data-to-Text Support Tool for Cancer Patients

no code implementations • WS 2019 • Saar Hommes, Chris van der Lee, Felix Clouth, Jeroen Vermunt, X Verbeek, er, Emiel Krahmer

In this paper, we present a novel data-to-text system for cancer patients, providing information on quality of life implications after treatment, which can be embedded in the context of shared decision making.

Decision Making

Paper
Add Code

Best practices for the human evaluation of automatically generated text

no code implementations • WS 2019 • Chris van der Lee, Albert Gatt, Emiel van Miltenburg, S Wubben, er, Emiel Krahmer

Currently, there is little agreement as to how Natural Language Generation (NLG) systems should be evaluated.

Text Generation

Paper
Add Code

Talking about what is not there: Generating indefinite referring expressions in Minecraft

no code implementations • WS 2019 • Arne K{\"o}hn, Alex Koller, er

When generating technical instructions, it is often necessary to describe an object that does not exist yet.

Object

Paper
Add Code

Combining Lexical Substitutes in Neural Word Sense Induction

no code implementations • RANLP 2019 • Nikolay Arefyev, Boris Sheludko, Alex Panchenko, er

Word Sense Induction (WSI) is the task of grouping of occurrences of an ambiguous word according to their meaning.

Clustering Word Sense Induction

Paper
Add Code

Predicting Sentiment of Polish Language Short Texts

no code implementations • RANLP 2019 • Aleks Wawer, er, Julita Sobiczewska

In the second we train models on all available data except the given test collection, which we use for testing (one vs rest cross-domain).

Sentiment Analysis

Paper
Add Code

Know Your Graph. State-of-the-Art Knowledge-Based WSD

no code implementations • RANLP 2019 • Alex Popov, er, Kiril Simov, Petya Osenova

This paper introduces several improvements over the current state of the art in knowledge-based word sense disambiguation.

Word Sense Disambiguation World Knowledge

Paper
Add Code

Graph Embeddings for Frame Identification

no code implementations • RANLP 2019 • Alex Popov, er, Jennifer Sikos

Lexical resources such as WordNet (Miller, 1995) and FrameNet (Baker et al., 1998) are organized as graphs, where relationships between words are made explicit via the structure of the resource.

Paper
Add Code

v-trel: Vocabulary Trainer for Tracing Word Relations - An Implicit Crowdsourcing Approach

no code implementations • RANLP 2019 • Verena Lyding, Christos Rodosthenous, Federico Sangati, Umair ul Hassan, Lionel Nicolas, Alex K{\"o}nig, er, Jolita Horbacauskiene, Anisia Katinskaia

In this paper, we present our work on developing a vocabulary trainer that uses exercises generated from language resources such as ConceptNet and crowdsources the responses of the learners to enrich the language resource.

Paper
Add Code

A Little Linguistics Goes a Long Way: Unsupervised Segmentation with Limited Language Specific Guidance

no code implementations • WS 2019 • Alex Erdmann, er, Salam Khalifa, Mai Oudah, Nizar Habash, Houda Bouamor

We present de-lexical segmentation, a linguistically motivated alternative to greedy or other unsupervised methods, requiring only minimal language specific input.

Paper
Add Code

Detection of Adverse Drug Reaction in Tweets Using a Combination of Heterogeneous Word Embeddings

no code implementations • WS 2019 • Segun Taofeek Aroyehun, Alex Gelbukh, er

This paper details our approach to the task of detecting reportage of adverse drug reaction in tweets as part of the 2019 social media mining for healthcare applications shared task.

Word Embeddings

Paper
Add Code

The LMU Munich Unsupervised Machine Translation System for WMT19

no code implementations • WS 2019 • Dario Stojanovski, Viktor Hangya, Matthias Huck, Alex Fraser, er

We describe LMU Munich{'}s machine translation system for Germanâ†’Czech translation which was used to participate in the WMT19 shared task on unsupervised news translation.

Denoising Language Modelling +3

Paper
Add Code

Improving Anaphora Resolution in Neural Machine Translation Using Curriculum Learning

no code implementations • WS 2019 • Dario Stojanovski, Alex Fraser, er

Machine Translation Translation

Paper
Add Code

Combining Local and Document-Level Context: The LMU Munich Neural Machine Translation System at WMT19

no code implementations • WS 2019 • Dario Stojanovski, Alex Fraser, er

We describe LMU Munich{'}s machine translation system for Englishâ†’German translation which was used to participate in the WMT19 shared task on supervised news translation.

Machine Translation Sentence +1

Paper
Add Code

The Meaning of ``Most'' for Visual Question Answering Models

no code implementations • WS 2019 • Alex Kuhnle, er, Ann Copestake

The correct interpretation of quantifier statements in the context of a visual scene requires non-trivial inference mechanisms.

Question Answering Visual Question Answering

Paper
Add Code

PROMT Systems for WMT 2019 Shared Translation Task

no code implementations • WS 2019 • Alex Molchanov, er

This paper describes the PROMT submissions for the WMT 2019 Shared News Translation Task.

Translation

Paper
Add Code

A Dataset for Noun Compositionality Detection for a Slavic Language

1 code implementation • WS 2019 • Dmitry Puzyrev, Artem Shelmanov, Alex Panchenko, er, Ekaterina Artemova

This paper presents the first gold-standard resource for Russian annotated with compositionality information of noun compounds.

Paper
Code

Large-Scale Transfer Learning for Natural Language Generation

1 code implementation • ACL 2019 • Sergey Golovanov, Rauf Kurbanov, Sergey Nikolenko, Kyryl Truskovskyi, Alex Tselousov, er, Thomas Wolf

Large-scale pretrained language models define state of the art in natural language processing, achieving outstanding performance on a variety of tasks.

Open-Domain Dialog Text Generation +1

Paper
Code

Improving Neural Entity Disambiguation with Graph Embeddings

no code implementations • ACL 2019 • {\"O}zge Sevgili, Alex Panchenko, er, Chris Biemann

Entity Disambiguation (ED) is the task of linking an ambiguous entity mention to a corresponding entry in a knowledge base.

Entity Disambiguation

Paper
Add Code

TARGER: Neural Argument Mining at Your Fingertips

1 code implementation • ACL 2019 • Artem Chernodub, Oleksiy Oliynyk, Philipp Heidenreich, Alex Bondarenko, Matthias Hagen, Chris Biemann, Alex Panchenko, er

We present TARGER, an open source neural argument mining framework for tagging arguments in free input texts and for keyword-based retrieval of arguments from an argument-tagged web-scale corpus.

Argument Mining Retrieval

250

Paper
Code

Better OOV Translation with Bilingual Terminology Mining

no code implementations • ACL 2019 • Matthias Huck, Viktor Hangya, Alex Fraser, er

In our experiments we use a system trained on Europarl and mine sentences containing medical terms from monolingual data.

Machine Translation NMT +2

Paper
Add Code

Unsupervised Parallel Sentence Extraction with Parallel Segment Detection Helps Machine Translation

1 code implementation • ACL 2019 • Viktor Hangya, Alex Fraser, er

Mining parallel sentences from comparable corpora is important.

Machine Translation Sentence +2

Paper
Code

Graph-Based Meaning Representations: Design and Processing

1 code implementation • ACL 2019 • Alex Koller, er, Stephan Oepen, Weiwei Sun

This tutorial is on representing and processing sentence meaning in the form of labeled directed graphs.

Sentence

113

Paper
Code

On the Compositionality Prediction of Noun Phrases using Poincar\'e Embeddings

no code implementations • ACL 2019 • Abhik Jana, Dima Puzyrev, Alex Panchenko, er, Pawan Goyal, Chris Biemann, Animesh Mukherjee

In particular, we use hypernymy information of the multiword and its constituents encoded in the form of the recently introduced Poincar{\'e} embeddings in addition to the distributional information to detect compositionality for noun phrases.

Paper
Add Code

Cross-lingual Annotation Projection Is Effective for Neural Part-of-Speech Tagging

no code implementations • WS 2019 • Matthias Huck, Diana Dutka, Alex Fraser, er

We tackle the important task of part-of-speech tagging using a neural model in the zero-resource scenario, where we have no access to gold-standard POS training data.

Part-Of-Speech Tagging POS +1

Paper
Add Code

Scalable Methods for Annotating Legal-Decision Corpora

no code implementations • WS 2019 • Lisa Ferro, John Aberdeen, Karl Branting, Craig Pfeifer, Alex Yeh, er, Amartya Chakraborty

Recent research has demonstrated that judicial and administrative decisions can be predicted by machine-learning models trained on prior decisions.

Paper
Add Code

Practical, Efficient, and Customizable Active Learning for Named Entity Recognition in the Digital Humanities

2 code implementations • NAACL 2019 • Alex Erdmann, er, David Joseph Wrisley, Benjamin Allen, Christopher Brown, Sophie Cohen-Bod{\'e}n{\`e}s, Micha Elsner, Yukun Feng, Brian Joseph, B{\'e}atrice Joyeux-Prunel, Marie-Catherine de Marneffe

Scholars in inter-disciplinary fields like the Digital Humanities are increasingly interested in semantic annotation of specialized corpora.

Active Learning General Classification +3

Paper
Code

HHU at SemEval-2019 Task 6: Context Does Matter - Tackling Offensive Language Identification and Categorization with ELMo

no code implementations • SEMEVAL 2019 • Alex Oberstrass, er, Julia Romberg, Anke Stoll, Stefan Conrad

We present our results for OffensEval: Identifying and Categorizing Offensive Language in Social Media (SemEval 2019 - Task 6).

Language Identification

Paper
Add Code

CIC at SemEval-2019 Task 5: Simple Yet Very Efficient Approach to Hate Speech Detection, Aggressive Behavior Detection, and Target Classification in Twitter

no code implementations • SEMEVAL 2019 • Iqra Ameer, Muhammad Hammad Fahim Siddiqui, Grigori Sidorov, Alex Gelbukh, er

The goal of this paper is to detect (A) Hate speech against immigrants and women, (B) Aggressive behavior and target classification, both for English and Spanish.

Hate Speech Detection

Paper
Add Code

Neural GRANNy at SemEval-2019 Task 2: A combined approach for better modeling of semantic relationships in semantic frame induction

no code implementations • SEMEVAL 2019 • Nikolay Arefyev, Boris Sheludko, Adis Davletov, Dmitry Kharchev, Alex Nevidomsky, Alex Panchenko, er

We describe our solutions for semantic frame and role induction subtasks of SemEval 2019 Task 2.

Clustering Language Modelling +1

Paper
Add Code

Automated learning of templates for data-to-text generation: comparing rule-based, statistical and neural methods

1 code implementation • WS 2018 • Chris van der Lee, Emiel Krahmer, S Wubben, er

The current study investigated novel techniques and methods for trainable approaches to data-to-text generation.

Data-to-Text Generation Machine Translation +1

Paper
Code

Debugging Sequence-to-Sequence Models with Seq2Seq-Vis

no code implementations • WS 2018 • Hendrik Strobelt, Sebastian Gehrmann, Michael Behrisch, Adam Perer, Hanspeter Pfister, Alex Rush, er

Neural attention-based sequence-to-sequence models (seq2seq) (Sutskever et al., 2014; Bahdanau et al., 2014) have proven to be accurate and robust for many sequence prediction tasks.

Attribute Translation

Paper
Add Code

Sentence Packaging in Text Generation from Semantic Graphs as a Community Detection Problem

no code implementations • WS 2018 • Alex Shvets, er, Simon Mille, Leo Wanner

An increasing amount of research tackles the challenge of text generation from abstract ontological or semantic structures, which are in their very nature potentially large connected graphs.

Community Detection Sentence +2

Paper
Add Code

Applications of NLG in practical conversational AI settings

no code implementations • WS 2018 • S Wubben, er

Slot Filling Text Generation

Paper
Add Code

Enriching the WebNLG corpus

1 code implementation • WS 2018 • Thiago Castro Ferreira, Diego Moussallem, Emiel Krahmer, S Wubben, er

This paper describes the enrichment of WebNLG corpus (Gardent et al., 2017a, b), with the aim to further extend its usefulness as a resource for evaluating common NLG tasks, including Discourse Ordering, Lexicalization and Referring Expression Generation.

Machine Translation Referring Expression +3

Paper
Code

E2E NLG Challenge Submission: Towards Controllable Generation of Diverse Natural Language

no code implementations • WS 2018 • Henry Elder, Sebastian Gehrmann, Alex O{'}Connor, er, Qun Liu

In natural language generation (NLG), the task is to generate utterances from a more abstract input, such as structured data.

Machine Translation Task-Oriented Dialogue Systems +1

Paper
Add Code

Complementary Strategies for Low Resourced Morphological Modeling

no code implementations • WS 2018 • Alex Erdmann, er, Nizar Habash

Morphologically rich languages are challenging for natural language processing tasks due to data sparsity.

Morphological Analysis Word Embeddings

Paper
Add Code

The LMU Munich Unsupervised Machine Translation Systems

no code implementations • WS 2018 • Dario Stojanovski, Viktor Hangya, Matthias Huck, Alex Fraser, er

We describe LMU Munich{'}s unsupervised machine translation systems for Englishâ†”German translation.

Denoising Language Modelling +3

Paper
Add Code

Training for Diversity in Image Paragraph Captioning

1 code implementation • EMNLP 2018 • Luke Melas-Kyriazi, Alex Rush, er, George Han

Image paragraph captioning models aim to produce detailed descriptions of a source image.

Ranked #2 on Image Paragraph Captioning on Image Paragraph Captioning

Image Captioning Image Paragraph Captioning +4

Paper
Code

Coreference and Coherence in Neural Machine Translation: A Study Using Oracle Experiments

no code implementations • WS 2018 • Dario Stojanovski, Alex Fraser, er

We show that NMT models taking advantage of context oracle signals can achieve considerable gains in BLEU, of up to 7. 02 BLEU for coreference and 1. 89 BLEU for coherence on subtitles translation.

Coreference Resolution Language Modelling +4

Paper
Add Code

Automatic Identification of Drugs and Adverse Drug Reaction Related Tweets

no code implementations • WS 2018 • Segun Taofeek Aroyehun, Alex Gelbukh, er

We describe our submissions to the Third Social Media Mining for Health Applications Shared Task.

Paper
Add Code

IARM: Inter-Aspect Relation Modeling with Memory Networks in Aspect-Based Sentiment Analysis

1 code implementation • EMNLP 2018 • Navonil Majumder, Soujanya Poria, Alex Gelbukh, er, Md. Shad Akhtar, Erik Cambria, Asif Ekbal

Sentiment analysis has immense implications in e-commerce through user feedback mining.

Ranked #30 on Aspect-Based Sentiment Analysis (ABSA) on SemEval-2014 Task-4

Aspect-Based Sentiment Analysis Extract Aspect +3

Paper
Code

An Unsupervised System for Parallel Corpus Filtering

no code implementations • WS 2018 • Viktor Hangya, Alex Fraser, er

In this paper we describe LMU Munich{'}s submission for the \textit{WMT 2018 Parallel Corpus Filtering} shared task which addresses the problem of cleaning noisy parallel corpora.

Domain Adaptation Language Modelling +6

Paper
Add Code

LMU Munich's Neural Machine Translation Systems at WMT 2018

no code implementations • WS 2018 • Matthias Huck, Dario Stojanovski, Viktor Hangya, Alex Fraser, er

The systems were used for our participation in the WMT18 biomedical translation task and in the shared task on machine translation of news.

Domain Adaptation Translation +1

Paper
Add Code

The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2018

no code implementations • WS 2018 • Ngoc-Quan Pham, Jan Niehues, Alex Waibel, er

We present our experiments in the scope of the news translation task in WMT 2018, in directions: Englishâ†’German.

Decoder Machine Translation +2

Paper
Add Code

PROMT Systems for WMT 2018 Shared Translation Task

no code implementations • WS 2018 • Alex Molchanov, er

This paper describes the PROMT submissions for the WMT 2018 Shared News Translation Task.

Machine Translation Translation

Paper
Add Code

Transfer Learning for Entity Recognition of Novel Classes

1 code implementation • COLING 2018 • Juan Diego Rodriguez, Adam Caldwell, Alex Liu, er

Our results empirically demonstrate when each of the published approaches tends to do well.

Entity Extraction using GAN Named Entity Recognition (NER) +2

Paper
Code

KIT Lecture Translator: Multilingual Speech Translation with One-Shot Learning

no code implementations • COLING 2018 • Florian Dessloch, Thanh-Le Ha, Markus M{\"u}ller, Jan Niehues, Thai-Son Nguyen, Ngoc-Quan Pham, Elizabeth Salesky, Matthias Sperber, Sebastian St{\"u}ker, Thomas Zenkel, Alex Waibel, er

{\%} Combining these techniques, we are able to provide an adapted speech translation system for several European languages.

Automatic Speech Recognition (ASR) Machine Translation +3

Paper
Add Code

Evaluating the text quality, human likeness and tailoring component of PASS: A Dutch data-to-text system for soccer

no code implementations • COLING 2018 • Chris van der Lee, Bart Verduijn, Emiel Krahmer, S Wubben, er

We present an evaluation of PASS, a data-to-text system that generates Dutch soccer reports from match statistics which are automatically tailored towards fans of one club or the other.

Text Generation

Paper
Add Code

Aggression Detection in Social Media: Using Deep Neural Networks, Data Augmentation, and Pseudo Labeling

no code implementations • COLING 2018 • Segun Taofeek Aroyehun, Alex Gelbukh, er

On this task, we investigate the efficacy of deep neural network models of varying complexity.

Data Augmentation Feature Engineering +1

Paper
Add Code

LTV: Labeled Topic Vector

no code implementations • COLING 2018 • Daniel Baumartz, Tolga Uslu, Alex Mehler, er

In this paper we present LTV, a website and API that generates labeled topic classifications based on the Dewey Decimal Classification (DDC), an international standard for topic classification in libraries.

General Classification Semantic Textual Similarity +1

Paper
Add Code

Aspect-based summarization of pros and cons in unstructured product reviews

1 code implementation • COLING 2018 • Florian Kunneman, S Wubben, er, Antal Van den Bosch, Emiel Krahmer

In the second evaluation, the gold-standard pros and cons were assessed along with the system output.

Aspect-Based Sentiment Analysis (ABSA)

Paper
Code

OpenNMT System Description for WNMT 2018: 800 words/sec on a single-core CPU

no code implementations • WS 2018 • Jean Senellart, Dakun Zhang, Bo wang, Guillaume Klein, Ramatch, Jean-Pierre irin, Josep Crego, Alex Rush, er

We present a system description of the OpenNMT Neural Machine Translation entry for the WNMT 2018 evaluation.

Machine Translation Neural Architecture Search +3

Paper
Add Code

Discourse Coherence: Concurrent Explicit and Implicit Relations

no code implementations • ACL 2018 • Hannah Rohde, Alex Johnson, er, Nathan Schneider, Bonnie Webber

Theories of discourse coherence posit relations between discourse segments as a key feature of coherent text.

Discourse Parsing Implicit Relations

Paper
Add Code

Surface Realization Shared Task 2018 (SR18): The Tilburg University Approach

1 code implementation • WS 2018 • Thiago Castro Ferreira, S Wubben, er, Emiel Krahmer

This study describes the approach developed by the Tilburg University team to the shallow task of the Multilingual Surface Realization Shared Task 2018 (SR18).

Machine Translation Translation

Paper
Code

The Annotated Transformer

1 code implementation • WS 2018 • Alex Rush, er

A major goal of open-source NLP is to quickly and accurately reproduce the results of new work, in a manner that the community can easily use and modify.

5,090

Paper
Code

DeepPavlov: Open-Source Library for Dialogue Systems

no code implementations • ACL 2018 • Mikhail Burtsev, Alex Seliverstov, er, Rafael Airapetyan, Mikhail Arkhipov, Dilyara Baymurzina, Nickolay Bushkov, Olga Gureenkova, Taras Khakhulin, Yuri Kuratov, Denis Kuznetsov, Alexey Litinsky, Varvara Logacheva, Alexey Lymar, Valentin Malykh, Maxim Petrov, Vadim Polulyakh, Leonid Pugachev, Alexey Sorokin, Maria Vikhreva, Marat Zaynutdinov

It supports modular as well as end-to-end approaches to implementation of conversational agents.

General Classification intent-classification +5

Paper
Add Code

Two Methods for Domain Adaptation of Bilingual Tasks: Delightfully Simple and Broadly Applicable

1 code implementation • ACL 2018 • Viktor Hangya, Fabienne Braune, Alex Fraser, er, Hinrich Sch{\"u}tze

Bilingual tasks, such as bilingual lexicon induction and cross-lingual classification, are crucial for overcoming data sparsity in the target language.

Bilingual Lexicon Induction Classification +7

Paper
Code

Addressing Noise in Multidialectal Word Embeddings

no code implementations • ACL 2018 • Alex Erdmann, er, Nasser Zalmout, Nizar Habash

Arabic dialects lack large corpora and are noisy, being linguistically disparate with no standardized spelling.

Sentence Transliteration +1

Paper
Add Code

Modeling Second-Language Learning from a Psychological Perspective

no code implementations • WS 2018 • Alex Rich, er, Pamela Osborn Popp, David Halpern, Anselm Rothe, Todd Gureckis

Psychological research on learning and memory has tended to emphasize small-scale laboratory studies.

Language Acquisition

Paper
Add Code

Complex Word Identification: Convolutional Neural Network vs. Feature Engineering

no code implementations • WS 2018 • Segun Taofeek Aroyehun, Jason Angel, Daniel Alej P{\'e}rez Alvarez, ro, Alex Gelbukh, er

We describe the systems of NLP-CIC team that participated in the Complex Word Identification (CWI) 2018 shared task.

Complex Word Identification Feature Engineering +1

Paper
Add Code

Noise-Robust Morphological Disambiguation for Dialectal Arabic

no code implementations • NAACL 2018 • Nasser Zalmout, Alex Erdmann, er, Nizar Habash

User-generated text tends to be noisy with many lexical and orthographic inconsistencies, making natural language processing (NLP) tasks more challenging.

Lexical Normalization Morphological Analysis +3

Paper
Add Code

PunFields at SemEval-2018 Task 3: Detecting Irony by Tools of Humor Analysis

no code implementations • SEMEVAL 2018 • Elena Mikhalkova, Yuri Karyakin, Alex Voronov, er, Dmitry Grigoriev, Artem Leoznov

The paper describes our search for a universal algorithm of detecting intentional lexical ambiguity in different forms of creative language.

Paper
Add Code

Detecting Figurative Word Occurrences Using Recurrent Neural Networks

no code implementations • WS 2018 • Agnieszka Mykowiecka, Aleks Wawer, er, Malgorzata Marciniak

The paper addresses detection of figurative usage of words in English text.

Word Embeddings

Paper
Add Code

Using Language Learner Data for Metaphor Detection

1 code implementation • WS 2018 • Egon Stemle, Alex Onysko, er

This article describes the system that participated in the shared task on metaphor detection on the Vrije University Amsterdam Metaphor Corpus (VUA).

Language Identification Word Embeddings

Paper
Code

Multi-Module Recurrent Neural Networks with Transfer Learning

no code implementations • WS 2018 • Filip Skurniak, Maria Janicka, Aleks Wawer, er

This paper describes multiple solutions designed and tested for the problem of word-level metaphor detection.

Machine Translation Transfer Learning +2

Paper
Add Code

Literal, Metphorical or Both? Detecting Metaphoricity in Isolated Adjective-Noun Phrases

no code implementations • WS 2018 • Agnieszka Mykowiecka, Malgorzata Marciniak, Aleks Wawer, er

The paper addresses the classification of isolated Polish adjective-noun phrases according to their metaphoricity.

General Classification Machine Translation +2

Paper
Add Code

ClaiRE at SemEval-2018 Task 7: Classification of Relations using Embeddings

no code implementations • SEMEVAL 2018 • Lena Hettinger, Alex Dallmann, er, Albin Zehe, Thomas Niebler, Andreas Hotho

In this paper we describe our system for SemEval-2018 Task 7 on classification of semantic relations in scientific literature for clean (subtask 1. 1) and noisy data (subtask 1. 2).

Classification General Classification +4

Paper
Add Code

Evaluating bilingual word embeddings on the long tail

1 code implementation • NAACL 2018 • Fabienne Braune, Viktor Hangya, Tobias Eder, Alex Fraser, er

Bilingual word embeddings are useful for bilingual lexicon induction, the task of mining translations of given words.

Bilingual Lexicon Induction Word Embeddings

Paper
Code

UMD at SemEval-2018 Task 10: Can Word Embeddings Capture Discriminative Attributes?

no code implementations • SEMEVAL 2018 • Alex Zhang, er, Marine Carpuat

We describe the University of Maryland{'}s submission to SemEval-018 Task 10, {``}Capturing Discriminative Attributes{''}: given word triples (w1, w2, d), the goal is to determine whether d is a discriminating attribute belonging to w1 but not w2.

Attribute Binary Classification +2

Paper
Add Code

A High-Quality Gold Standard for Citation-based Tasks

no code implementations • LREC 2018 • Michael F{\"a}rber, Alex Thiemann, er, Adam Jatowt

Entity Linking Entity Resolution +1

Paper
Add Code

FastSense: An Efficient Word Sense Disambiguation Classifier

no code implementations • LREC 2018 • Tolga Uslu, Alex Mehler, er, Daniel Baumartz, Wahed Hemati

Entity Linking Text Classification +1

Paper
Add Code

The Effects of Unimodal Representation Choices on Multimodal Learning

no code implementations • LREC 2018 • Fern Ito, o Tadao, Helena de Medeiros Caseli, J Moreira, er

Image Classification

Paper
Add Code

A UIMA Database Interface for Managing NLP-related Text Annotations

1 code implementation • LREC 2018 • Giuseppe Abrami, Alex Mehler, er

Paper
Code

Building Open Javanese and Sundanese Corpora for Multilingual Text-to-Speech

no code implementations • LREC 2018 • Jaka Aris Eko Wibawa, Supheakmungkol Sarin, Chenfang Li, Knot Pipatsrisawat, Keshan Sodimana, Oddur Kjartansson, Alex Gutkin, er, Martin Jansche, Linne Ha

Automatic Speech Recognition (ASR) Speech Synthesis

Paper
Add Code

The Linguistic Category Model in Polish (LCM-PL)

no code implementations • LREC 2018 • Aleks Wawer, er, Justyna Sarzy{\'n}ska

Paper
Add Code

WikiDragon: A Java Framework For Diachronic Content And Network Analysis Of MediaWikis

1 code implementation • LREC 2018 • R{\"u}diger Gleim, Alex Mehler, er, Sung Y. Song

Named Entity Recognition (NER) Word Sense Disambiguation

Paper
Code

Unified Guidelines and Resources for Arabic Dialect Orthography

no code implementations • LREC 2018 • Nizar Habash, Fadhl Eryani, Salam Khalifa, Owen Rambow, Dana Abdulrahim, Alex Erdmann, er, Reem Faraj, Wajdi Zaghouani, Houda Bouamor, Nasser Zalmout, Sara Hassan, Faisal Al-Shargi, Sakhar Alkhereyf, Basma Abdulkareem, Esk, Ramy er, Mohammad Salameh, Hind Saddiki

Speech Recognition Transliteration

Paper
Add Code

KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus

no code implementations • LREC 2018 • Thanh-Le Ha, Jan Niehues, Matthias Sperber, Ngoc Quan Pham, Alex Waibel, er

Cross-Lingual Document Classification Document Classification +8

Paper
Add Code

The MADAR Arabic Dialect Corpus and Lexicon

no code implementations • LREC 2018 • Houda Bouamor, Nizar Habash, Mohammad Salameh, Wajdi Zaghouani, Owen Rambow, Dana Abdulrahim, Ossama Obeid, Salam Khalifa, Fadhl Eryani, Alex Erdmann, er, Kemal Oflazer

Transliteration

Paper
Add Code

A database of German definitory contexts from selected web sources

no code implementations • LREC 2018 • Adrien Barbaresi, Lothar Lemnitzer, Alex Geyken, er

Paper
Add Code

FonBund: A Library for Combining Cross-lingual Phonological Segment Data

1 code implementation • LREC 2018 • Alex Gutkin, er, Martin Jansche, Tatiana Merkulova

Language Modelling Speech Synthesis

365

Paper
Code

TreeAnnotator: Versatile Visual Annotation of Hierarchical Text Relations

no code implementations • LREC 2018 • Philipp Helfrich, Elias Rieb, Giuseppe Abrami, Andy L{\"u}cking, Alex Mehler, er

Lemmatization

Paper
Add Code

Neural Morphological Tagging of Lemma Sequences for Machine Translation

no code implementations • WS 2018 • Costanza Conforti, Matthias Huck, Alex Fraser, er

LEMMA Machine Translation +2

Paper
Add Code

NITMZ-JU at IJCNLP-2017 Task 4: Customer Feedback Analysis

no code implementations • IJCNLP 2017 • Somnath Banerjee, Partha Pakray, Riyanka Manna, Dipankar Das, Alex Gelbukh, er

In this paper, we describe a deep learning framework for analyzing the customer feedback as part of our participation in the shared task on Customer Feedback Analysis at the 8th International Joint Conference on Natural Language Processing (IJCNLP 2017).

Text Classification

Paper
Add Code

Event Ordering with a Generalized Model for Sieve Prediction Ranking

no code implementations • IJCNLP 2017 • Bill McDowell, Nathanael Chambers, Alex Ororbia II, er, David Reitter

Within this prediction reranking framework, we propose an alternative scoring function, showing an 8. 8{\%} relative gain over the original CAEVO.

Word Embeddings

Paper
Add Code

Target-side Word Segmentation Strategies for Neural Machine Translation

no code implementations • WS 2017 • Matthias Huck, Simon Riess, Alex Fraser, er

Machine Translation Translation

Paper
Add Code

Linguistic realisation as machine translation: Comparing different MT models for AMR-to-text generation

no code implementations • WS 2017 • Thiago Castro Ferreira, Iacer Calixto, S Wubben, er, Emiel Krahmer

In this paper, we study AMR-to-text generation, framing it as a translation task and comparing two different MT approaches (Phrase-based and Neural MT).

AMR-to-Text Generation Machine Translation +2

Paper
Add Code

Parsing Minimalist Languages with Interpreted Regular Tree Grammars

no code implementations • WS 2017 • Meaghan Fowlie, Alex Koller, er

Paper
Add Code

Rhetorical relations markers in Russian RST Treebank

no code implementations • WS 2017 • Svetlana Toldova, Dina Pisarevskaya, Margarita Ananyeva, Maria Kobozeva, Alex Nasedkin, er, Sofia Nikiforova, Irina Pavlova, Alexey Shelepov

Coreference Resolution Question Answering +1

Paper
Add Code

Coarse-to-Fine Attention Models for Document Summarization

no code implementations • WS 2017 • Jeffrey Ling, Alex Rush, er

Sequence-to-sequence models with attention have been successful for a variety of NLP problems, but their speed does not scale well for tasks with long source sequences such as document summarization.

Ranked #25 on Document Summarization on CNN / Daily Mail

Document Summarization Machine Translation +1

Paper
Add Code

A Feature Structure Algebra for FTAG

no code implementations • WS 2017 • Alex Koller, er

Paper
Add Code

The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2017

no code implementations • WS 2017 • Ngoc-Quan Pham, Jan Niehues, Thanh-Le Ha, Eunah Cho, Matthias Sperber, Alex Waibel, er

Domain Adaptation Machine Translation +1

Paper
Add Code

Entity-Centric Information Access with Human in the Loop for the Biomedical Domain

no code implementations • RANLP 2017 • Seid Muhie Yimam, Steffen Remus, Alex Panchenko, er, Andreas Holzinger, Chris Biemann

In this paper, we describe the concept of entity-centric information access for the biomedical domain.

Management

Paper
Add Code

Word Sense Disambiguation with Recurrent Neural Networks

no code implementations • RANLP 2017 • Alex Popov, er

This paper presents a neural network architecture for word sense disambiguation (WSD).

Word Sense Disambiguation

Paper
Add Code

LMU Munich's Neural Machine Translation Systems for News Articles and Health Information Texts

no code implementations • WS 2017 • Matthias Huck, Fabienne Braune, Alex Fraser, er

Machine Translation Translation

Paper
Add Code

Coarse-To-Fine Parsing for Expressive Grammar Formalisms

no code implementations • WS 2017 • Christoph Teichmann, Alex Koller, er, Jonas Groschwitz

We generalize coarse-to-fine parsing to grammar formalisms that are more expressive than PCFGs and/or describe languages of trees or graphs.

Paper
Add Code

Integrated sentence generation using charts

no code implementations • WS 2017 • Alex Koller, er, Nikos Engonopoulos

Integrating surface realization and the generation of referring expressions into a single algorithm can improve the quality of the generated sentences.

Sentence Text Generation

Paper
Add Code

Detecting Metaphorical Phrases in the Polish Language

no code implementations • RANLP 2017 • Aleks Wawer, er, Agnieszka Mykowiecka

In this paper we describe experiments with automated detection of metaphors in the Polish language.

Machine Translation Natural Language Inference +1

Paper
Add Code

PASS: A Dutch data-to-text system for soccer, targeted towards specific audiences

no code implementations • WS 2017 • Chris van der Lee, Emiel Krahmer, S Wubben, er

We present PASS, a data-to-text system that generates Dutch soccer reports from match statistics.

Data-to-Text Generation

Paper
Add Code

Parameter Free Hierarchical Graph-Based Clustering for Analyzing Continuous Word Embeddings

no code implementations • WS 2017 • Thomas Alex Trost, er, Dietrich Klakow

Word embeddings are high-dimensional vector representations of words and are thus difficult to interpret.

Clustering Dimensionality Reduction +1

Paper
Add Code

A Multimodal Dialogue System for Medical Decision Support inside Virtual Reality

no code implementations • WS 2017 • Alex Prange, er, Margarita Chikobava, Peter Poller, Michael Barz, Daniel Sonntag

We present a multimodal dialogue system that allows doctors to interact with a medical decision support system in virtual reality (VR).

Paper
Add Code

Conjunctive Categorial Grammars

no code implementations • WS 2017 • Stepan Kuznetsov, Alex Okhotin, er

Paper
Add Code

Annotating tense, mood and voice for English, French and German

no code implementations • ACL 2017 • Anita Ramm, Sharid Lo{\'a}iciga, Annemarie Friedrich, Alex Fraser, er

Paper
Add Code

Generating Contrastive Referring Expressions

no code implementations • ACL 2017 • Mart{\'\i}n Villalba, Christoph Teichmann, Alex Koller, er

The referring expressions (REs) produced by a natural language generation (NLG) system can be misunderstood by the hearer, even when they are semantically correct.

Text Generation

Paper
Add Code

Statistical Models for Unsupervised, Semi-Supervised Supervised Transliteration Mining

no code implementations • CL 2017 • Hassan Sajjad, Helmut Schmid, Alex Fraser, er, Hinrich Sch{\"u}tze

After training, the unlabeled data is disambiguated based on the posterior probabilities of the two sub-models.

Transliteration

Paper
Add Code

Coreference Resolution for Swedish and German using Distant Supervision

no code implementations • WS 2017 • Alex Wallin, er, Pierre Nugues

coreference-resolution Knowledge Graphs +1

Paper
Add Code

The ContrastMedium Algorithm: Taxonomy Induction From Noisy Knowledge Graphs With Just A Few Links

no code implementations • EACL 2017 • Stefano Faralli, Alex Panchenko, er, Chris Biemann, Simone Paolo Ponzetto

In this paper, we present ContrastMedium, an algorithm that transforms noisy semantic networks into full-fledged, clean taxonomies.

Knowledge Graphs Open Information Extraction

Paper
Add Code

A tool for extracting sense-disambiguated example sentences through user feedback

no code implementations • EACL 2017 • Beto Boullosa, Richard Eckart de Castilho, Alex Geyken, er, Lothar Lemnitzer, Iryna Gurevych

This paper describes an application system aimed to help lexicographers in the extraction of example sentences for a given headword based on its different senses.

Clustering General Classification

Paper
Add Code

Generating flexible proper name references in text: Data, models and evaluation

no code implementations • EACL 2017 • Thiago Castro Ferreira, Emiel Krahmer, S Wubben, er

The model relies on the REGnames corpus, a dataset with 53, 102 proper name references to 1, 000 people in different discourse contexts.

Text Generation

Paper
Add Code

Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction and Disambiguation

no code implementations • EACL 2017 • Alex Panchenko, er, Eugen Ruppert, Stefano Faralli, Simone Paolo Ponzetto, Chris Biemann

On the example of word sense induction and disambiguation (WSID), we show that it is possible to develop an interpretable model that matches the state-of-the-art models in accuracy.

Word Embeddings Word Sense Induction

Paper
Add Code

TextImager as a Generic Interface to R

no code implementations • EACL 2017 • Tolga Uslu, Wahed Hemati, Alex Mehler, er, Daniel Baumartz

R is a very powerful framework for statistical modeling.

Paper
Add Code

Alto: Rapid Prototyping for Parsing and Translation

no code implementations • EACL 2017 • Johannes Gontrum, Jonas Groschwitz, Alex Koller, er, Christoph Teichmann

We present Alto, a rapid prototyping tool for new grammar formalisms.

Machine Translation Semantic Parsing +1

Paper
Add Code

Supervised and Unsupervised Word Sense Disambiguation on Word Embedding Vectors of Unambigous Synonyms

no code implementations • WS 2017 • Aleks Wawer, er, Agnieszka Mykowiecka

This paper compares two approaches to word sense disambiguation using word embeddings trained on unambiguous synonyms.

Word Embeddings Word Sense Disambiguation

Paper
Add Code

Producing Unseen Morphological Variants in Statistical Machine Translation

no code implementations • EACL 2017 • Matthias Huck, Ale{\v{s}} Tamchyna, Ond{\v{r}}ej Bojar, Alex Fraser, er

Translating into morphologically rich languages is difficult.

Machine Translation Translation

Paper
Add Code

Audience Segmentation in Social Media

no code implementations • EACL 2017 • Verena Henrich, Alex Lang, er

Understanding the social media audience is becoming increasingly important for social media analysis.

Segmentation Sentiment Analysis

Paper
Add Code

Addressing Problems across Linguistic Levels in SMT: Combining Approaches to Model Morphology, Syntax and Lexical Choice

no code implementations • EACL 2017 • Marion Weller-Di Marco, Alex Fraser, er, Sabine Schulte im Walde

Many errors in phrase-based SMT can be attributed to problems on three linguistic levels: morphological complexity in the target language, structural differences and lexical choice.

Word Alignment Word Sense Disambiguation

Paper
Add Code

Understanding the Semantics of Narratives of Interpersonal Violence through Reader Annotations and Physiological Reactions

no code implementations • WS 2017 • Alex Calderwood, er, Elizabeth A. Pruett, Raymond Ptucha, Christopher Homan, Cecilia Ovesdotter Alm

Interpersonal violence (IPV) is a prominent sociological problem that affects people of all demographic backgrounds.

coreference-resolution Semantic Role Labeling +1

Paper
Add Code

Using Linked Disambiguated Distributional Networks for Word Sense Disambiguation

no code implementations • WS 2017 • Alex Panchenko, er, Stefano Faralli, Simone Paolo Ponzetto, Chris Biemann

We introduce a new method for unsupervised knowledge-based word sense disambiguation (WSD) based on a resource that links two types of sense-aware lexical networks: one is induced from a corpus using distributional semantics, the other is manually constructed.

Machine Translation Translation +2

Paper
Add Code

A constrained graph algebra for semantic parsing with AMRs

no code implementations • WS 2017 • Jonas Groschwitz, Meaghan Fowlie, Mark Johnson, Alex Koller, er

Semantic Parsing

Paper
Add Code

Towards grounding computational linguistic approaches to readability: Modeling reader-text interaction for easy and difficult texts

no code implementations • WS 2016 • Sowmya Vajjala, Detmar Meurers, Alex Eitel, er, Katharina Scheiter

Computational approaches to readability assessment are generally built and evaluated using gold standard corpora labeled by publishers or teachers rather than being grounded in observations about human performance.

Paper
Add Code

A Comparison Between Morphological Complexity Measures: Typological Data vs. Language Corpora

no code implementations • WS 2016 • Christian Bentz, Tatyana Ruzsics, Alex Koplenig, er, Tanja Samard{\v{z}}i{\'c}

Language complexity is an intriguing phenomenon argued to play an important role in both language learning and processing.

Machine Translation Word Alignment

Paper
Add Code

Interactive Relation Extraction in Main Memory Database Systems

no code implementations • COLING 2016 • Rudolf Schneider, Cordula Guder, Torsten Kilias, Alex L{\"o}ser, er, Jens Graupmann, Oleks Kozachuk, R

We present INDREX-MM, a main memory database system for interactively executing two interwoven tasks, declarative relation extraction from text and their exploitation with SQL.

Open Information Extraction Relation +1

Paper
Add Code

TextImager: a Distributed UIMA-based System for NLP

no code implementations • COLING 2016 • Wahed Hemati, Tolga Uslu, Alex Mehler, er

More and more disciplines require NLP tools for performing automatic text analyses on various levels of linguistic resolution.

Sentiment Analysis Text Classification

Paper
Add Code

Unsupervised Abbreviation Detection in Clinical Narratives

no code implementations • WS 2016 • Markus Kreuzthaler, Michel Oleynik, Alex Avian, er, Stefan Schulz

The disambiguation of period characters is therefore an important task for sentence and abbreviation detection.

Feature Engineering Sentence

Paper
Add Code

A Proposition-Based Abstractive Summariser

no code implementations • COLING 2016 • Yimai Fang, Haoyue Zhu, Ewa Muszy{\'n}ska, Alex Kuhnle, er, Simone Teufel

It is a further development of an existing summariser that has an incremental, proposition-based content selection process but lacks a natural language (NL) generator for the final output.

Language Modelling Sentence +1

Paper
Add Code

Challenges and Solutions for Latin Named Entity Recognition

no code implementations • WS 2016 • Alex Erdmann, er, Christopher Brown, Brian Joseph, Mark Janse, Petra Ajaka, Micha Elsner, Marie-Catherine de Marneffe

Although spanning thousands of years and genres as diverse as liturgy, historiography, lyric and other forms of prose and poetry, the body of Latin texts is still relatively sparse compared to English.

Active Learning Domain Adaptation +5

Paper
Add Code

TASTY: Interactive Entity Linking As-You-Type

no code implementations • COLING 2016 • Sebastian Arnold, Robert Dziuba, Alex L{\"o}ser, er

We introduce TASTY (Tag-as-you-type), a novel text editor for interactive entity linking as part of the writing process.

Entity Linking TAG +2

Paper
Add Code

Abstractive Compression of Captions with Attentive Recurrent Neural Networks

no code implementations • WS 2016 • S Wubben, er, Emiel Krahmer, Antal Van den Bosch, Suzan Verberne

Machine Translation Sentence Compression +1

Paper
Add Code

Towards proper name generation: a corpus analysis

no code implementations • WS 2016 • Thiago Castro Ferreira, S Wubben, er, Emiel Krahmer

Text Generation

Paper
Add Code

A Wizard-of-Oz Study on A Non-Task-Oriented Dialog Systems That Reacts to User Engagement

no code implementations • WS 2016 • Zhou Yu, Leah Nicolich-Henkin, Alan W. black, Alex Rudnicky, er

Machine Translation

Paper
Add Code

Strategy and Policy Learning for Non-Task-Oriented Conversational Systems

no code implementations • WS 2016 • Zhou Yu, Ziyu Xu, Alan W. black, Alex Rudnicky, er

Machine Translation

Paper
Add Code

A Framework for Discriminative Rule Selection in Hierarchical Moses

no code implementations • WS 2016 • Fabienne Braune, Alex Fraser, er, Hal Daum{\'e} III, Ale{\v{s}} Tamchyna

Machine Translation

Paper
Add Code

HPI Question Answering System in BioASQ 2016

no code implementations • WS 2016 • Frederik Schulze, Ricarda Sch{\"u}ler, Tim Draeger, Daniel Dummer, Alex Ernst, er, Pedro Flemming, Cindy Perscheid, Mariana Neves

Question Answering

Paper
Add Code

Adaptive Importance Sampling from Finite State Automata

no code implementations • WS 2016 • Christoph Teichmann, Kasimir Wansing, Alex Koller, er

Paper
Add Code

PROMT Translation Systems for WMT 2016 Translation Tasks

no code implementations • WS 2016 • Alex Molchanov, er, Fedor Bykov

Machine Translation Translation

Paper
Add Code

Modeling verbal inflection for English to German SMT

no code implementations • WS 2016 • Anita Ramm, Alex Fraser, er

Machine Translation

Paper
Add Code

The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2016

no code implementations • WS 2016 • Thanh-Le Ha, Eunah Cho, Jan Niehues, Mohammed Mediani, Matthias Sperber, Alex Allauzen, re, Alex Waibel, er

Machine Translation Translation

Paper
Add Code

The Edinburgh/LMU Hierarchical Machine Translation System for WMT 2016

no code implementations • WS 2016 • Matthias Huck, Alex Fraser, er, Barry Haddow

Machine Translation Translation

Paper
Add Code

The QT21/HimL Combined Machine Translation System

no code implementations • WS 2016 • Jan-Thorsten Peter, Tamer Alkhouli, Hermann Ney, Matthias Huck, Fabienne Braune, Alex Fraser, er, Ale{\v{s}} Tamchyna, Ond{\v{r}}ej Bojar, Barry Haddow, Rico Sennrich, Fr{\'e}d{\'e}ric Blain, Lucia Specia, Jan Niehues, Alex Waibel, Alex Allauzen, re, Lauriane Aufrant, Franck Burlot, Elena Knyazeva, Thomas Lavergne, Fran{\c{c}}ois Yvon, M{\=a}rcis Pinnis, Stella Frank

Ranked #12 on Machine Translation on WMT2016 English-Romanian

Machine Translation Translation

Paper
Add Code

Modeling Complement Types in Phrase-Based SMT

no code implementations • WS 2016 • Marion Weller-Di Marco, Alex Fraser, er, Sabine Schulte im Walde

Machine Translation

Paper
Add Code

new/s/leak -- Information Extraction and Visualization for Investigative Data Journalists

no code implementations • ACL 2016 • Seid Muhie Yimam, Heiner Ulrich, von L, Tatiana esberger, Marcel Rosenbach, Michaela Regneri, Alex Panchenko, er, Franziska Lehmann, Uli Fahrer, Chris Biemann, Kathrin Ballweg

Paper
Add Code

Towards more variation in text generation: Developing and evaluating variation models for choice of referential form

no code implementations • ACL 2016 • Thiago Castro Ferreira, Emiel Krahmer, S Wubben, er

Text Generation

Paper
Add Code

Text2voronoi: An Image-driven Approach to Differential Diagnosis

no code implementations • WS 2016 • Alex Mehler, er, Tolga Uslu, Wahed Hemati

Text Categorization

Paper
Add Code

CUNI-LMU Submissions in WMT2016: Chimera Constrained and Beaten

no code implementations • WS 2016 • Ale{\v{s}} Tamchyna, Roman Sudarikov, Ond{\v{r}}ej Bojar, Alex Fraser, er

Machine Translation

Paper
Add Code

Efficient techniques for parsing with tree automata

no code implementations • ACL 2016 • Jonas Groschwitz, Alex Koller, er, Mark Johnson

Machine Translation Semantic Parsing

Paper
Add Code

Individual Variation in the Choice of Referential Form

no code implementations • NAACL 2016 • Thiago Castro Ferreira, Emiel Krahmer, S Wubben, er

Text Generation

Paper
Add Code

NRU-HSE at SemEval-2016 Task 4: Comparative Analysis of Two Iterative Methods Using Quantification Library

no code implementations • SEMEVAL 2016 • Nikolay Karpov, Alex Porshnev, er, Kirill Rudakov

Document Classification Opinion Mining +2

Paper
Add Code

JUNITMZ at SemEval-2016 Task 1: Identifying Semantic Similarity Using Levenshtein Ratio

no code implementations • SEMEVAL 2016 • S. Sarkar, ip, Dipankar Das, Partha Pakray, Alex Gelbukh, er

Information Retrieval Machine Translation +3

Paper
Add Code

Towards Semantic-based Hybrid Machine Translation between Bulgarian and English

no code implementations • WS 2016 • Kiril Simov, Petya Osenova, Alex Popov, er

Common Sense Reasoning Language Modelling +2

Paper
Add Code

TAXI at SemEval-2016 Task 13: a Taxonomy Induction Method based on Lexico-Syntactic Patterns, Substrings and Focused Crawling

no code implementations • SEMEVAL 2016 • Alex Panchenko, er, Stefano Faralli, Eugen Ruppert, Steffen Remus, Hubert Naets, C{\'e}drick Fairon, Simone Paolo Ponzetto, Chris Biemann

Language Modelling

Paper
Add Code

SatiricLR: a Language Resource of Satirical News Articles

1 code implementation • LREC 2016 • Alice Frain, S Wubben, er

We test the viability of our data on the task of classification of satire.

General Classification

Paper
Code

Best of Both Worlds: Making Word Sense Embeddings Interpretable

no code implementations • LREC 2016 • Alex Panchenko, er

Word sense embeddings represent a word sense as a low-dimensional numeric vector.

Paper
Add Code

Finding Recurrent Features of Image Schema Gestures: the FIGURE corpus

no code implementations • LREC 2016 • Andy Luecking, Alex Mehler, er, D{\'e}sir{\'e}e Walther, Marcel Mauri, Dennis Kurf{\"u}rst

The stimulus terms have been compiled mainly from image schemata from psycholinguistics, since such schemata provide a panoply of abstract contents derived from natural language use.

Descriptive

Paper
Add Code

OPFI: A Tool for Opinion Finding in Polish

no code implementations • LREC 2016 • Aleks Wawer, er

The paper contains a description of OPFI: Opinion Finder for the Polish Language, a freely available tool for opinion target extraction.

Dependency Parsing

Paper
Add Code

Could Speaker, Gender or Age Awareness be beneficial in Speech-based Emotion Recognition?

no code implementations • LREC 2016 • Maxim Sidorov, Alex Schmitt, er, Eugene Semenkin, Wolfgang Minker

Emotion Recognition (ER) is an important part of dialogue analysis which can be used in order to improve the quality of Spoken Dialogue Systems (SDSs).

Emotion Recognition Spoken Dialogue Systems

Paper
Add Code

Resources for building applications with Dependency Minimal Recursion Semantics

no code implementations • LREC 2016 • Ann Copestake, Guy Emerson, Michael Wayne Goodman, Matic Horvat, Alex Kuhnle, er, Ewa Muszy{\'n}ska

We describe resources aimed at increasing the usability of the semantic representations utilized within the DELPH-IN (Deep Linguistic Processing with HPSG) consortium.

Paper
Add Code

TLT-CRF: A Lexicon-supported Morphological Tagger for Latin Based on Conditional Random Fields

no code implementations • LREC 2016 • Tim vor der Br{\"u}ck, Alex Mehler, er

We present a morphological tagger for Latin, called TTLab Latin Tagger based on Conditional Random Fields (TLT-CRF) which uses a large Latin lexicon.

POS

Paper
Add Code

TGermaCorp -- A (Digital) Humanities Resource for (Computational) Linguistics

no code implementations • LREC 2016 • Andy Luecking, Armin Hoenen, Alex Mehler, er

In order to introduce TGermaCorp in comparison to more homogeneous corpora of contemporary everyday language, quantitative assessments of syntactic and lexical diversity are provided.

LEMMA POS

Paper
Add Code

Lemmatization and Morphological Tagging in German and Latin: A Comparison and a Survey of the State-of-the-art

no code implementations • LREC 2016 • Steffen Eger, R{\"u}diger Gleim, Alex Mehler, er

This paper relates to the challenge of morphological tagging and lemmatization in morphologically rich languages by example of German and Latin.

Lemmatization Morphological Tagging +2

Paper
Add Code

TTS for Low Resource Languages: A Bangla Synthesizer

no code implementations • LREC 2016 • Alex Gutkin, er, Linne Ha, Martin Jansche, Knot Pipatsrisawat, Richard Sproat

We present a text-to-speech (TTS) system designed for the dialect of Bengali spoken in Bangladesh.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.