Search Results for author: Ruslan Mitkov

Found 47 papers, 9 papers with code

A Comparison between Named Entity Recognition Models in the Biomedical Domain

1 code implementation • TRITON 2021 • Maria Carmela Cariello, Alessandro Lenci, Ruslan Mitkov

The domain-specialised application of Named Entity Recognition (NER) is known as Biomedical NER (BioNER), which aims to identify and classify biomedical concepts that are of interest to researchers, such as genes, proteins, chemical compounds, drugs, mutations, diseases, and so on.

named-entity-recognition Named Entity Recognition +2

Paper
Code

Benchmarking ASR Systems Based on Post-Editing Effort and Error Analysis

no code implementations • TRITON 2021 • Martha Maria Papadopoulou, Anna Zaretskaya, Ruslan Mitkov

This paper offers a comparative evaluation of four commercial ASR systems which are evaluated according to the post-editing effort required to reach “publishable” quality and according to the number of errors they produce.

Benchmarking

Paper
Add Code

Interactive Models for Post-Editing

no code implementations • TRITON 2021 • Marie Escribe, Ruslan Mitkov

Despite the increasingly good quality of Machine Translation (MT) systems, MT outputs require corrections.

Automatic Post-Editing Translation

Paper
Add Code

Fake News Detection for Portuguese with Deep Learning

no code implementations • TRITON 2021 • Lígia Venturott, Ruslan Mitkov

The exponential growth of the internet and social media in the past decade gave way to the increase in dissemination of false or misleading information.

Fact Checking Fake News Detection

Paper
Add Code

Cross-Lingual Named Entity Recognition via FastAlign: a Case Study

no code implementations • TRITON 2021 • Ali Hatami, Ruslan Mitkov, Gloria Corpas Pastor

In this paper, we compare the performance of two alignment methods, Grow-diag-final-and and Intersect Symmetrisation heuristics, to exploit the annotation projection of English-Brazilian Portuguese bilingual corpus to detect named entities in Brazilian Portuguese.

Machine Translation named-entity-recognition +5

Paper
Add Code

DTW at Qur’an QA 2022: Utilising Transfer Learning with Transformers for Question Answering in a Low-resource Domain

1 code implementation • OSACT (LREC) 2022 • Damith Premasiri, Tharindu Ranasinghe, Wajdi Zaghouani, Ruslan Mitkov

The goal of the Qur’an QA 2022 shared task is to fill this gap by producing state-of-the-art question answering and reading comprehension research on Qur’an.

Ensemble Learning Machine Reading Comprehension +3

Paper
Code

Towards New Generation Translation Memory Systems

no code implementations • RANLP 2021 • Nikola Spasovski, Ruslan Mitkov

Despite the enormous popularity of Translation Memory systems and the active research in the field, their language processing features still suffer from certain limitations.

Translation

Paper
Add Code

Paragraph Similarity Matches for Generating Multiple-choice Test Items

no code implementations • RANLP 2021 • Halyna Maslak, Ruslan Mitkov

Although the research on the automatic or semi-automatic generation of multiple-choice test items has been conducted since the beginning of this millennium, most approaches focus on generating questions from a single sentence.

Management Multiple-choice +5

Paper
Add Code

Fiction in Russian Translation: A Translationese Study

no code implementations • RANLP 2021 • Maria Kunilovskaya, Ekaterina Lapshinova-Koltunski, Ruslan Mitkov

The texts are represented with frequency-based features that capture structural and lexical properties of language.

Binary Classification Language Identification +1

Paper
Add Code

Translationese in Russian Literary Texts

no code implementations • EMNLP (LaTeCHCLfL, CLFL, LaTeCH) 2021 • Maria Kunilovskaya, Ekaterina Lapshinova-Koltunski, Ruslan Mitkov

We expect that literary translations from typologically distant languages should exhibit more translationese, and the fingerprints of individual source languages (and their families) are traceable in translations.

Specificity

Paper
Add Code

DORE: A Dataset For Portuguese Definition Generation

no code implementations • 26 Mar 2024 • Anna Beatriz Dimas Furtado, Tharindu Ranasinghe, Frédéric Blain, Ruslan Mitkov

In this research, we fill this gap by introducing DORE; the first dataset for Definition MOdelling for PoRtuguEse containing more than 100, 000 definitions.

Definition Modelling Text Generation

Paper
Add Code

Can Model Fusing Help Transformers in Long Document Classification? An Empirical Study

1 code implementation • 18 Jul 2023 • Damith Premasiri, Tharindu Ranasinghe, Ruslan Mitkov

Text classification is an area of research which has been studied over the years in Natural Language Processing (NLP).

Document Classification text-classification

Paper
Code

Deep Learning Methods for Extracting Metaphorical Names of Flowers and Plants

no code implementations • 18 May 2023 • Amal Haddad Haddad, Damith Premasiri, Tharindu Ranasinghe, Ruslan Mitkov

The domain of Botany is rich with metaphorical terms.

Machine Translation Translation

Paper
Add Code

Transformer-based Detection of Multiword Expressions in Flower and Plant Names

no code implementations • 16 Sep 2022 • Damith Premasiri, Amal Haddad Haddad, Tharindu Ranasinghe, Ruslan Mitkov

In this paper, we explore state-of-the-art neural transformers in the task of detecting MWEs in flower and plant names.

Machine Translation Translation

Paper
Add Code

DTW at Qur'an QA 2022: Utilising Transfer Learning with Transformers for Question Answering in a Low-resource Domain

1 code implementation • 12 May 2022 • Damith Premasiri, Tharindu Ranasinghe, Wajdi Zaghouani, Ruslan Mitkov

The goal of the Qur'an QA 2022 shared task is to fill this gap by producing state-of-the-art question answering and reading comprehension research on Qur'an.

Ensemble Learning Machine Reading Comprehension +3

Paper
Code

Biographical: A Semi-Supervised Relation Extraction Dataset

no code implementations • 2 May 2022 • Alistair Plum, Tharindu Ranasinghe, Spencer Jones, Constantin Orasan, Ruslan Mitkov

The dataset, which is aimed towards digital humanities (DH) and historical research, is automatically compiled by aligning sentences from Wikipedia articles with matching structured data from sources including Pantheon and Wikidata.

Knowledge Graphs named-entity-recognition +6

Paper
Add Code

An Exploratory Analysis of Multilingual Word-Level Quality Estimation with Cross-Lingual Transformers

1 code implementation • ACL 2021 • Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

Most studies on word-level Quality Estimation (QE) of machine translation focus on language-specific models.

Machine Translation Translation

102

Paper
Code

RGCL at SemEval-2020 Task 6: Neural Approaches to DefinitionExtraction

no code implementations • SEMEVAL 2020 • Tharindu Ranasinghe, Alistair Plum, Constantin Orasan, Ruslan Mitkov

This paper presents the RGCL team submission to SemEval 2020 Task 6: DeftEval, subtasks 1 and 2.

Sentence

Paper
Add Code

TransQuest: Translation Quality Estimation with Cross-lingual Transformers

1 code implementation • COLING 2020 • Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

Recent years have seen big advances in the field of sentence-level quality estimation (QE), largely as a result of using neural-based architectures.

Sentence Transfer Learning +1

102

Paper
Code

RGCL at SemEval-2020 Task 6: Neural Approaches to Definition Extraction

no code implementations • 13 Oct 2020 • Tharindu Ranasinghe, Alistair Plum, Constantin Orasan, Ruslan Mitkov

This paper presents the RGCL team submission to SemEval 2020 Task 6: DeftEval, subtasks 1 and 2.

Definition Extraction Sentence

Paper
Add Code

TransQuest at WMT2020: Sentence-Level Direct Assessment

1 code implementation • WMT (EMNLP) 2020 • Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

This paper presents the team TransQuest's participation in Sentence-Level Direct Assessment shared task in WMT 2020.

Data Augmentation Sentence

102

Paper
Code

Classifying Referential and Non-referential It Using Gaze

1 code implementation • EMNLP 2018 • Victoria Yaneva, Le An Ha, Richard Evans, Ruslan Mitkov

When processing a text, humans and machines must disambiguate between different uses of the pronoun it, including non-referential, nominal anaphoric or clause anaphoric ones.

POS

Paper
Code

Intelligent Translation Memory Matching and Retrieval with Sentence Encoders

no code implementations • EAMT 2020 • Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

Matching and retrieving previously translated segments from a Translation Memory is the key functionality in Translation Memories systems.

Retrieval Sentence +1

Paper
Add Code

Semantic Textual Similarity with Siamese Neural Networks

no code implementations • RANLP 2019 • Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

Calculating the Semantic Textual Similarity (STS) is an important research area in natural language processing which plays a significant role in many applications such as question answering, document summarisation, information retrieval and information extraction.

Information Retrieval Question Answering +3

Paper
Add Code

Enhancing Unsupervised Sentence Similarity Methods with Deep Contextualised Word Representations

no code implementations • RANLP 2019 • Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

Calculating Semantic Textual Similarity (STS) plays a significant role in many applications such as question answering, document summarisation, information retrieval and information extraction.

Contextualised Word Representations Information Retrieval +6

Paper
Add Code

What Influences the Features of Post-editese? A Preliminary Study

no code implementations • RANLP 2019 • Sheila Castilho, Nat{\'a}lia Resende, Ruslan Mitkov

While a number of studies have shown evidence of translationese phenomena, that is, statistical differences between original texts and translated texts (Gellerstam, 1986), results of studies searching for translationese features in postedited texts (what has been called {''}posteditese{''} (Daems et al., 2017)) have presented mixed results.

Translation

Paper
Add Code

RGCL-WLV at SemEval-2019 Task 12: Toponym Detection

no code implementations • SEMEVAL 2019 • Alistair Plum, Tharindu Ranasinghe, Pablo Calleja, Constantin Or{\u{a}}san, Ruslan Mitkov

This article describes the system submitted by the RGCL-WLV team to the SemEval 2019 Task 12: Toponym resolution in scientific papers.

BIG-bench Machine Learning Toponym Resolution

Paper
Add Code

Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions

2 code implementations • NAACL 2019 • Omid Rohanian, Shiva Taslimipoor, Samaneh Kouchaki, Le An Ha, Ruslan Mitkov

We introduce a new method to tag Multiword Expressions (MWEs) using a linguistically interpretable language-independent deep learning architecture.

TAG

Paper
Code

Wolves at SemEval-2018 Task 10: Semantic Discrimination based on Knowledge and Association

no code implementations • SEMEVAL 2018 • Shiva Taslimipoor, Omid Rohanian, Le An Ha, Gloria Corpas Pastor, Ruslan Mitkov

This paper describes the system submitted to SemEval 2018 shared task 10 {`}Capturing Dicriminative Attributes{'}.

Attribute Relation +2

Paper
Add Code

WLV at SemEval-2018 Task 3: Dissecting Tweets in Search of Irony

no code implementations • SEMEVAL 2018 • Omid Rohanian, Shiva Taslimipoor, Richard Evans, Ruslan Mitkov

This paper describes the systems submitted to SemEval 2018 Task 3 {``}Irony detection in English tweets{''} for both subtasks A and B.

Sentiment Analysis

Paper
Add Code

Proceedings of the International Conference Recent Advances in Natural Language Processing, {RANLP} 2017

no code implementations • RANLP 2017 • Ruslan Mitkov, Galia Angelova

Paper
Add Code

Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017

no code implementations • RANLP 2017 • Ruslan Mitkov, Galia Angelova

Paper
Add Code

Effects of Lexical Properties on Viewing Time per Word in Autistic and Neurotypical Readers

no code implementations • WS 2017 • Sanja {\v{S}}tajner, Victoria Yaneva, Ruslan Mitkov, Simone Paolo Ponzetto

Eye tracking studies from the past few decades have shaped the way we think of word complexity and cognitive load: words that are long, rare and ambiguous are more difficult to read.

Lexical Simplification

Paper
Add Code

Translation Memory Systems Have a Long Way to Go

no code implementations • RANLP 2017 • Andrea Silvestre Baquero, Ruslan Mitkov

These tools operate on fuzzy (surface) matching mostly and cannot benefit from already translated texts which are synonymous to (or paraphrased versions of) the text to be translated.

Machine Translation Sentence +1

Paper
Add Code

Investigating the Opacity of Verb-Noun Multiword Expression Usages in Context

no code implementations • WS 2017 • Shiva Taslimipoor, Omid Rohanian, Ruslan Mitkov, Afsaneh Fazly

This study investigates the supervised token-based identification of Multiword Expressions (MWEs).

Paper
Add Code

WOLVESAAR at SemEval-2016 Task 1: Replicating the Success of Monolingual Word Alignment and Neural Embeddings for Semantic Textual Similarity

no code implementations • SEMEVAL 2016 • Hannah Bechara, Rohit Gupta, Liling Tan, Constantin Or{\u{a}}san, Ruslan Mitkov, Josef van Genabith

Information Retrieval Machine Translation +4

Paper
Add Code

A Corpus of Text Data and Gaze Fixations from Autistic and Non-Autistic Adults

no code implementations • LREC 2016 • Victoria Yaneva, Irina Temnikova, Ruslan Mitkov

This division of the groups informs researchers on whether particular fixations were elicited from skillful or less-skillful readers and allows a fair between-group comparison for two levels of reading ability.

Multiple-choice POS +2

Paper
Add Code

Evaluating the Readability of Text Simplification Output for Readers with Cognitive Disabilities

no code implementations • LREC 2016 • Victoria Yaneva, Irina Temnikova, Ruslan Mitkov

This paper presents an approach for automatic evaluation of the readability of text simplification output for readers with cognitive disabilities.

Reading Comprehension Text Simplification

Paper
Add Code

Improving Translation Memory Matching through Clause Splitting

no code implementations • WS 2015 • Katerina Raisa Timonera, Ruslan Mitkov

Translation

Paper
Add Code

MiniExperts: An SVM Approach for Measuring Semantic Textual Similarity

no code implementations • SEMEVAL 2015 • Hanna B{\'e}chara, Hernani Costa, Shiva Taslimipoor, Rohit Gupta, Constantin Orasan, Gloria Corpas Pastor, Ruslan Mitkov

Information Retrieval Semantic Textual Similarity

Paper
Add Code

The Fewer, the Better? A Contrastive Study about Ways to Simplify

no code implementations • WS 2014 • Ruslan Mitkov, Sanja {\v{S}}tajner

Text Simplification

Paper
Add Code

One Step Closer to Automatic Evaluation of Text Simplification Systems

no code implementations • WS 2014 • Sanja {\v{S}}tajner, Ruslan Mitkov, Horacio Saggion

Decision Making Information Retrieval +2

Paper
Add Code

Automatic Question Generation in Multimedia-Based Learning

no code implementations • COLING 2012 • Yvonne Skalban, Le An Ha, Lucia Specia, Ruslan Mitkov

Question Generation Question-Generation

Paper
Add Code

A review corpus annotated for negation, speculation and their scope

no code implementations • LREC 2012 • Natalia Konstantinova, Sheila C.M. de Sousa, Noa P. Cruz, Manuel J. Ma{\~n}a, Maite Taboada, Ruslan Mitkov

This paper presents a freely available resource for research on handling negation and speculation in review texts.

Natural Language Inference Negation +4

Paper
Add Code

Diachronic Changes in Text Complexity in 20th Century English Language: An NLP Approach

no code implementations • LREC 2012 • Sanja {\v{S}}tajner, Ruslan Mitkov

In British English, we compared the complexity of texts published in 1931, 1961 and 1991, while in American English we compared the complexity of texts published in 1961 and 1992.

Machine Translation Sentence +1

Paper
Add Code

CLCM - A Linguistic Resource for Effective Simplification of Instructions in the Crisis Management Domain and its Evaluations

no code implementations • LREC 2012 • Irina Temnikova, Constantin Orasan, Ruslan Mitkov

This article presents a new linguistic resource in the form of Controlled Language (CL) guidelines for manual text simplification in the CM domain which aims to address high TC in the CM domain and produce clear messages to be used in crisis situations.

Machine Translation Management +3

Paper
Add Code

Elliphant: Improved Automatic Detection of Zero Subjects and Impersonal Constructions in Spanish

no code implementations • EACL 2012 • Luz Rello, Ricardo Baeza-Yates, Ruslan Mitkov

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.