Search Results for author: Md Mahfuz ibn Alam

Found 11 papers, 6 papers with code

Findings of the WMT Shared Task on Machine Translation Using Terminologies

no code implementations • WMT (EMNLP) 2021 • Md Mahfuz ibn Alam, Ivana Kvapilíková, Antonios Anastasopoulos, Laurent Besacier, Georgiana Dinu, Marcello Federico, Matthias Gallé, Kweonwoo Jung, Philipp Koehn, Vassilina Nikoulina

Language domains that require very careful use of terminology are abundant and reflect a significant part of the translation industry.

Machine Translation Translation

Paper
Add Code

Fine-Tuning MT systems for Robustness to Second-Language Speaker Variations

1 code implementation • EMNLP (WNUT) 2020 • Md Mahfuz ibn Alam, Antonios Anastasopoulos

The performance of neural machine translation (NMT) systems only trained on a single language variant degrades when confronted with even slightly different language variations.

Machine Translation NMT +1

Paper
Code

Language and Speech Technology for Central Kurdish Varieties

1 code implementation • 4 Mar 2024 • Sina Ahmadi, Daban Q. Jaff, Md Mahfuz ibn Alam, Antonios Anastasopoulos

Kurdish, an Indo-European language spoken by over 30 million speakers, is considered a dialect continuum and known for its diversity in language varieties.

Automatic Speech Recognition Language Identification +3

Paper
Code

A Case Study on Filtering for End-to-End Speech Translation

no code implementations • 2 Feb 2024 • Md Mahfuz ibn Alam, Antonios Anastasopoulos

It is relatively easy to mine a large parallel corpus for any machine learning task, such as speech-to-text or speech-to-speech translation.

Speech-to-Speech Translation Translation

Paper
Add Code

A Morphologically-Aware Dictionary-based Data Augmentation Technique for Machine Translation of Under-Represented Languages

no code implementations • 2 Feb 2024 • Md Mahfuz ibn Alam, Sina Ahmadi, Antonios Anastasopoulos

In this paper, we propose strategies to synthesize parallel data relying on morpho-syntactic information and using bilingual lexicons along with a small amount of seed parallel data.

Data Augmentation Machine Translation

Paper
Add Code

CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation

no code implementations • 26 May 2023 • Md Mahfuz ibn Alam, Sina Ahmadi, Antonios Anastasopoulos

Neural machine translation (NMT) systems exhibit limited robustness in handling source-side linguistic variations.

Machine Translation NMT +1

Paper
Add Code

BIG-C: a Multimodal Multi-Purpose Dataset for Bemba

1 code implementation • 26 May 2023 • Claytone Sikasote, Eunice Mukonde, Md Mahfuz ibn Alam, Antonios Anastasopoulos

We present BIG-C (Bemba Image Grounded Conversations), a large multimodal dataset for Bemba.

Machine Translation speech-recognition +2

Paper
Code

LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages

1 code implementation • 23 May 2023 • Milind Agarwal, Md Mahfuz ibn Alam, Antonios Anastasopoulos

Second, we propose a novel misprediction-resolution hierarchical model, LIMIt, for language identification that reduces error by 55% (from 0. 71 to 0. 32) on our compiled children's stories dataset and by 40% (from 0. 23 to 0. 14) on the FLORES-200 benchmark.

Language Identification Translation

Paper
Code

GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based Adapters

no code implementations • 25 Apr 2023 • Md Mahfuz ibn Alam, Ruoyu Xie, Fahim Faisal, Antonios Anastasopoulos

This report describes GMU's sentiment analysis system for the SemEval-2023 shared task AfriSenti-SemEval.

Language Modelling Sentiment Analysis

Paper
Add Code

SD-QA: Spoken Dialectal Question Answering for the Real World

1 code implementation • Findings (EMNLP) 2021 • Fahim Faisal, Sharlina Keshava, Md Mahfuz ibn Alam, Antonios Anastasopoulos

Question answering (QA) systems are now available through numerous commercial applications for a wide variety of domains, serving millions of users that interact with them via speech interfaces.

Fairness Question Answering +2

Paper
Code

On the Evaluation of Machine Translation for Terminology Consistency

1 code implementation • 22 Jun 2021 • Md Mahfuz ibn Alam, Antonios Anastasopoulos, Laurent Besacier, James Cross, Matthias Gallé, Philipp Koehn, Vassilina Nikoulina

As neural machine translation (NMT) systems become an important part of professional translator pipelines, a growing body of work focuses on combining NMT with terminologies.

Domain Adaptation Machine Translation +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.