Search Results for author: Amr Sharaf

Found 18 papers, 6 papers with code

Strategies to Improve Few-shot Learning for Intent Classification and Slot-Filling

no code implementations • NAACL (SUKI) 2022 • Samyadeep Basu, Amr Sharaf, Karine Ip Kiun Chong, Alex Fischer, Vishal Rohra, Michael Amoake, Hazem El-Hammamy, Ehi Nosakhare, Vijay Ramani, Benjamin Han

Intent classification (IC) and slot filling (SF) are two fundamental tasks in modern Natural Language Understanding (NLU) systems.

Contrastive Learning Data Augmentation +7

Paper
Add Code

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

1 code implementation • 16 Jan 2024 • Haoran Xu, Amr Sharaf, Yunmo Chen, Weiting Tan, Lingfeng Shen, Benjamin Van Durme, Kenton Murray, Young Jin Kim

However, even the top-performing 13B LLM-based translation models, like ALMA, does not match the performance of state-of-the-art conventional encoder-decoder translation models or larger-scale LLMs such as GPT-4.

Decoder Machine Translation +1

332

Paper
Code

A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models

1 code implementation • 20 Sep 2023 • Haoran Xu, Young Jin Kim, Amr Sharaf, Hany Hassan Awadalla

In this study, we propose a novel fine-tuning approach for LLMs that is specifically designed for the translation task, eliminating the need for the abundant parallel data that traditional translation models usually depend on.

Language Modelling Machine Translation +1

332

Paper
Code

Leveraging GPT-4 for Automatic Translation Post-Editing

no code implementations • 24 May 2023 • Vikas Raunak, Amr Sharaf, Yiren Wang, Hany Hassan Awadallah, Arul Menezes

In this work, we formalize the task of direct translation post-editing with Large Language Models (LLMs) and explore the use of GPT-4 to automatically post-edit NMT outputs across several language pairs.

Machine Translation NMT +1

Paper
Add Code

How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation

1 code implementation • 18 Feb 2023 • Amr Hendy, Mohamed Abdelrehim, Amr Sharaf, Vikas Raunak, Mohamed Gabr, Hitokazu Matsushita, Young Jin Kim, Mohamed Afify, Hany Hassan Awadalla

In this paper, we present a comprehensive evaluation of GPT models for machine translation, covering various aspects such as quality of different GPT models in comparison with state-of-the-art research and commercial systems, effect of prompting strategies, robustness towards domain shifts and document-level translation.

Machine Translation Text Generation +1

Paper
Code

On Hard Episodes in Meta-Learning

no code implementations • 21 Oct 2021 • Samyadeep Basu, Amr Sharaf, Nicolo Fusi, Soheil Feizi

To address the issue of sub-par performance on hard episodes, we investigate and benchmark different meta-training strategies based on adversarial training and curriculum learning.

Meta-Learning

Paper
Add Code

A Flexible Measurement of Diversity in Datasets with Random Network Distillation

no code implementations • 29 Sep 2021 • Liam H Fowl, Micah Goldblum, Arjun Gupta, Amr Sharaf, Tom Goldstein

We validate and deploy this metric on both images and text.

Image Generation reinforcement-learning +1

Paper
Add Code

Semi-Supervised Few-Shot Intent Classification and Slot Filling

no code implementations • 17 Sep 2021 • Samyadeep Basu, Karine lp Kiun Chong, Amr Sharaf, Alex Fischer, Vishal Rohra, Michael Amoake, Hazem El-Hammamy, Ehi Nosakhare, Vijay Ramani, Benjamin Han

Intent classification (IC) and slot filling (SF) are two fundamental tasks in modern Natural Language Understanding (NLU) systems.

Classification Contrastive Learning +8

Paper
Add Code

Data Augmentation for Meta-Learning

1 code implementation • 14 Oct 2020 • Renkun Ni, Micah Goldblum, Amr Sharaf, Kezhi Kong, Tom Goldstein

Conventional image classifiers are trained by randomly sampling mini-batches of images.

Data Augmentation Meta-Learning

Paper
Code

Random Network Distillation as a Diversity Metric for Both Image and Text Generation

no code implementations • 13 Oct 2020 • Liam Fowl, Micah Goldblum, Arjun Gupta, Amr Sharaf, Tom Goldstein

We validate and deploy this metric on both images and text.

Image Generation reinforcement-learning +2

Paper
Add Code

Active Imitation Learning with Noisy Guidance

1 code implementation • ACL 2020 • Kianté Brantley, Amr Sharaf, Hal Daumé III

Imitation learning algorithms provide state-of-the-art results on many structured prediction tasks by learning near-optimal search policies.

Active Learning Imitation Learning +1

Paper
Code

Meta-Learning for Few-Shot NMT Adaptation

no code implementations • WS 2020 • Amr Sharaf, Hany Hassan, Hal Daumé III

We frame the adaptation of NMT systems as a meta-learning problem, where we learn to adapt to new unseen domains based on simulated offline meta-training domain adaptation tasks.

Domain Adaptation Machine Translation +3

Paper
Add Code

Learning Effective Exploration Strategies For Contextual Bandits

no code implementations • 25 Sep 2019 • Amr Sharaf, Hal Daumé III

We develop a meta-learning algorithm, MELEE, that learns an exploration policy based on simulated, synthetic contextual bandit tasks.

Imitation Learning Learning-To-Rank +2

Paper
Add Code

Meta-Learning for Contextual Bandit Exploration

no code implementations • ICLR 2019 • Amr Sharaf, Hal Daumé III

We describe MELEE, a meta-learning algorithm for learning a good exploration policy in the interactive contextual bandit setting.

Imitation Learning Meta-Learning

Paper
Add Code

Cross-Lingual Approaches to Reference Resolution in Dialogue Systems

no code implementations • 27 Nov 2018 • Amr Sharaf, Arpit Gupta, Hancheng Ge, Chetan Naik, Lambert Mathias

In the cross-lingual setup, we assume there is access to annotated resources as well as a well trained model in the source language and little to no annotated data in the target language.

Cross-Lingual Transfer Data Augmentation +4