Consists of millions of entries in which the MT element of the training triplets has been obtained by translating the source side of publicly-available parallel corpora, and using the target side as an artificial human post-edit. Translations are obtained both with phrase-based and neural models.
Source: eSCAPE: a Large-scale Synthetic Corpus for Automatic Post-EditingPaper | Code | Results | Date | Stars |
---|