Paper tables with annotated results for Order Matters in the Presence of Dataset Imbalance for Multilingual Learning

Paper

Order Matters in the Presence of Dataset Imbalance for Multilingual Learning

In this paper, we empirically study the optimization dynamics of multi-task learning, particularly focusing on those that govern a collection of tasks with significant data imbalance. We present a simple yet effective method of pre-training on high-resource tasks, followed by fine-tuning on a mixture of high/low-resource tasks. We provide a thorough empirical study and analysis of this method's benefits showing that it achieves consistent improvements relative to the performance trade-off profile of standard static weighting. We analyze under what data regimes this method is applicable and show its improvements empirically in neural machine translation (NMT) and multi-lingual language modeling.

PDF Paper record

Results in Papers With Code

(↓ scroll down to see all results)

Order Matters in the Presence of Dataset Imbalance for Multilingual Learning

Reader Guidelines

Editor Guidelines