Search Results for author: Ruslan Mitkov

Found 47 papers, 9 papers with code

A Comparison between Named Entity Recognition Models in the Biomedical Domain

1 code implementation TRITON 2021 Maria Carmela Cariello, Alessandro Lenci, Ruslan Mitkov

The domain-specialised application of Named Entity Recognition (NER) is known as Biomedical NER (BioNER), which aims to identify and classify biomedical concepts that are of interest to researchers, such as genes, proteins, chemical compounds, drugs, mutations, diseases, and so on.

named-entity-recognition Named Entity Recognition +2

Benchmarking ASR Systems Based on Post-Editing Effort and Error Analysis

no code implementations TRITON 2021 Martha Maria Papadopoulou, Anna Zaretskaya, Ruslan Mitkov

This paper offers a comparative evaluation of four commercial ASR systems which are evaluated according to the post-editing effort required to reach “publishable” quality and according to the number of errors they produce.

Benchmarking

Interactive Models for Post-Editing

no code implementations TRITON 2021 Marie Escribe, Ruslan Mitkov

Despite the increasingly good quality of Machine Translation (MT) systems, MT outputs require corrections.

Automatic Post-Editing Translation

Fake News Detection for Portuguese with Deep Learning

no code implementations TRITON 2021 Lígia Venturott, Ruslan Mitkov

The exponential growth of the internet and social media in the past decade gave way to the increase in dissemination of false or misleading information.

Fact Checking Fake News Detection

Cross-Lingual Named Entity Recognition via FastAlign: a Case Study

no code implementations TRITON 2021 Ali Hatami, Ruslan Mitkov, Gloria Corpas Pastor

In this paper, we compare the performance of two alignment methods, Grow-diag-final-and and Intersect Symmetrisation heuristics, to exploit the annotation projection of English-Brazilian Portuguese bilingual corpus to detect named entities in Brazilian Portuguese.

Machine Translation named-entity-recognition +5

Towards New Generation Translation Memory Systems

no code implementations RANLP 2021 Nikola Spasovski, Ruslan Mitkov

Despite the enormous popularity of Translation Memory systems and the active research in the field, their language processing features still suffer from certain limitations.

Translation

Paragraph Similarity Matches for Generating Multiple-choice Test Items

no code implementations RANLP 2021 Halyna Maslak, Ruslan Mitkov

Although the research on the automatic or semi-automatic generation of multiple-choice test items has been conducted since the beginning of this millennium, most approaches focus on generating questions from a single sentence.

Management Multiple-choice +5

Translationese in Russian Literary Texts

no code implementations EMNLP (LaTeCHCLfL, CLFL, LaTeCH) 2021 Maria Kunilovskaya, Ekaterina Lapshinova-Koltunski, Ruslan Mitkov

We expect that literary translations from typologically distant languages should exhibit more translationese, and the fingerprints of individual source languages (and their families) are traceable in translations.

Specificity

DORE: A Dataset For Portuguese Definition Generation

no code implementations26 Mar 2024 Anna Beatriz Dimas Furtado, Tharindu Ranasinghe, Frédéric Blain, Ruslan Mitkov

In this research, we fill this gap by introducing DORE; the first dataset for Definition MOdelling for PoRtuguEse containing more than 100, 000 definitions.

Definition Modelling Text Generation

Can Model Fusing Help Transformers in Long Document Classification? An Empirical Study

1 code implementation18 Jul 2023 Damith Premasiri, Tharindu Ranasinghe, Ruslan Mitkov

Text classification is an area of research which has been studied over the years in Natural Language Processing (NLP).

Document Classification text-classification

Transformer-based Detection of Multiword Expressions in Flower and Plant Names

no code implementations16 Sep 2022 Damith Premasiri, Amal Haddad Haddad, Tharindu Ranasinghe, Ruslan Mitkov

In this paper, we explore state-of-the-art neural transformers in the task of detecting MWEs in flower and plant names.

Machine Translation Translation

DTW at Qur'an QA 2022: Utilising Transfer Learning with Transformers for Question Answering in a Low-resource Domain

1 code implementation12 May 2022 Damith Premasiri, Tharindu Ranasinghe, Wajdi Zaghouani, Ruslan Mitkov

The goal of the Qur'an QA 2022 shared task is to fill this gap by producing state-of-the-art question answering and reading comprehension research on Qur'an.

Ensemble Learning Machine Reading Comprehension +3

Biographical: A Semi-Supervised Relation Extraction Dataset

no code implementations2 May 2022 Alistair Plum, Tharindu Ranasinghe, Spencer Jones, Constantin Orasan, Ruslan Mitkov

The dataset, which is aimed towards digital humanities (DH) and historical research, is automatically compiled by aligning sentences from Wikipedia articles with matching structured data from sources including Pantheon and Wikidata.

Knowledge Graphs named-entity-recognition +6

TransQuest: Translation Quality Estimation with Cross-lingual Transformers

1 code implementation COLING 2020 Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

Recent years have seen big advances in the field of sentence-level quality estimation (QE), largely as a result of using neural-based architectures.

Sentence Transfer Learning +1

TransQuest at WMT2020: Sentence-Level Direct Assessment

1 code implementation WMT (EMNLP) 2020 Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

This paper presents the team TransQuest's participation in Sentence-Level Direct Assessment shared task in WMT 2020.

Data Augmentation Sentence

Classifying Referential and Non-referential It Using Gaze

1 code implementation EMNLP 2018 Victoria Yaneva, Le An Ha, Richard Evans, Ruslan Mitkov

When processing a text, humans and machines must disambiguate between different uses of the pronoun it, including non-referential, nominal anaphoric or clause anaphoric ones.

POS

Intelligent Translation Memory Matching and Retrieval with Sentence Encoders

no code implementations EAMT 2020 Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

Matching and retrieving previously translated segments from a Translation Memory is the key functionality in Translation Memories systems.

Retrieval Sentence +1

Semantic Textual Similarity with Siamese Neural Networks

no code implementations RANLP 2019 Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

Calculating the Semantic Textual Similarity (STS) is an important research area in natural language processing which plays a significant role in many applications such as question answering, document summarisation, information retrieval and information extraction.

Information Retrieval Question Answering +3

Enhancing Unsupervised Sentence Similarity Methods with Deep Contextualised Word Representations

no code implementations RANLP 2019 Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

Calculating Semantic Textual Similarity (STS) plays a significant role in many applications such as question answering, document summarisation, information retrieval and information extraction.

Contextualised Word Representations Information Retrieval +6

What Influences the Features of Post-editese? A Preliminary Study

no code implementations RANLP 2019 Sheila Castilho, Nat{\'a}lia Resende, Ruslan Mitkov

While a number of studies have shown evidence of translationese phenomena, that is, statistical differences between original texts and translated texts (Gellerstam, 1986), results of studies searching for translationese features in postedited texts (what has been called {''}posteditese{''} (Daems et al., 2017)) have presented mixed results.

Translation

Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions

2 code implementations NAACL 2019 Omid Rohanian, Shiva Taslimipoor, Samaneh Kouchaki, Le An Ha, Ruslan Mitkov

We introduce a new method to tag Multiword Expressions (MWEs) using a linguistically interpretable language-independent deep learning architecture.

TAG

WLV at SemEval-2018 Task 3: Dissecting Tweets in Search of Irony

no code implementations SEMEVAL 2018 Omid Rohanian, Shiva Taslimipoor, Richard Evans, Ruslan Mitkov

This paper describes the systems submitted to SemEval 2018 Task 3 {``}Irony detection in English tweets{''} for both subtasks A and B.

Sentiment Analysis

Effects of Lexical Properties on Viewing Time per Word in Autistic and Neurotypical Readers

no code implementations WS 2017 Sanja {\v{S}}tajner, Victoria Yaneva, Ruslan Mitkov, Simone Paolo Ponzetto

Eye tracking studies from the past few decades have shaped the way we think of word complexity and cognitive load: words that are long, rare and ambiguous are more difficult to read.

Lexical Simplification

Translation Memory Systems Have a Long Way to Go

no code implementations RANLP 2017 Andrea Silvestre Baquero, Ruslan Mitkov

These tools operate on fuzzy (surface) matching mostly and cannot benefit from already translated texts which are synonymous to (or paraphrased versions of) the text to be translated.

Machine Translation Sentence +1

A Corpus of Text Data and Gaze Fixations from Autistic and Non-Autistic Adults

no code implementations LREC 2016 Victoria Yaneva, Irina Temnikova, Ruslan Mitkov

This division of the groups informs researchers on whether particular fixations were elicited from skillful or less-skillful readers and allows a fair between-group comparison for two levels of reading ability.

Multiple-choice POS +2

Evaluating the Readability of Text Simplification Output for Readers with Cognitive Disabilities

no code implementations LREC 2016 Victoria Yaneva, Irina Temnikova, Ruslan Mitkov

This paper presents an approach for automatic evaluation of the readability of text simplification output for readers with cognitive disabilities.

Reading Comprehension Text Simplification

Diachronic Changes in Text Complexity in 20th Century English Language: An NLP Approach

no code implementations LREC 2012 Sanja {\v{S}}tajner, Ruslan Mitkov

In British English, we compared the complexity of texts published in 1931, 1961 and 1991, while in American English we compared the complexity of texts published in 1961 and 1992.

Machine Translation Sentence +1

CLCM - A Linguistic Resource for Effective Simplification of Instructions in the Crisis Management Domain and its Evaluations

no code implementations LREC 2012 Irina Temnikova, Constantin Orasan, Ruslan Mitkov

This article presents a new linguistic resource in the form of Controlled Language (CL) guidelines for manual text simplification in the CM domain which aims to address high TC in the CM domain and produce clear messages to be used in crisis situations.

Machine Translation Management +3

Cannot find the paper you are looking for? You can Submit a new open access paper.