TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Grammatical Error Correction	UA-GEC	mT5 large + 10M synth	F0.5	68.09	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-low-resource-approach-to-the-grammatical/grammatical-error-correction-on-ua-gec)](https://paperswithcode.com/sota/grammatical-error-correction-on-ua-gec?p=a-low-resource-approach-to-the-grammatical)`

A Low-Resource Approach to the Grammatical Error Correction of Ukrainian

EACL 2023 · Frank Palma Gomez, Alla Rozovskaya, and Dan Roth ·

We present our system that participated in the shared task on the grammatical error correction of Ukrainian. We have implemented two approaches that make use of large pre-trained language models and synthetic data, that have been used for error correction of English as well as low-resource languages. The first approach is based on fine-tuning a large multilingual language model (mT5) in two stages: first, on synthetic data, and then on gold data. The second approach trains a (smaller) seq2seq Transformer model pre-trained on synthetic data and fine-tuned on gold data. Our mT5-based model scored first in “GEC only” track, and a very close second in the “GEC+Fluency” track. Our two key innovations are (1) finetuning in stages, first on synthetic, and then on gold data; and (2) a high-quality corruption method based on roundtrip machine translation to complement existing noisification approaches.

PDF Abstract