1 code implementation • Findings (EMNLP) 2021 • Ruiqi Zhong, Kristy Lee, Zheng Zhang, Dan Klein
However, the next word prediction training objective is still misaligned with the target zero-shot learning objective.
Language Modelling Natural Language Inference +2