1 code implementation • 11 Jan 2024 • Steffi Chern, Zhen Fan, Andy Liu
While state-of-the-art language models have achieved impressive results, they remain susceptible to inference-time adversarial attacks, such as adversarial prompts generated by red teams arXiv:2209. 07858.
1 code implementation • 2 Mar 2023 • Andy Liu, Hao Zhu, Emmy Liu, Yonatan Bisk, Graham Neubig
We also find some evidence that increasing task difficulty in the training process results in more fluent and precise utterances in evaluation.