no code implementations • 14 Nov 2023 • Yong Xie, Karan Aggarwal, Aitzaz Ahmad
We further explore simple but effective data selection strategies for continual pre-training.