1 code implementation • 13 May 2024 • Hung Tuan Le, Long Truong To, Manh Trong Nguyen, Kiet Van Nguyen
BM25 and InfoXLM (Large) achieved the best results in two tasks, with BM25 achieving an accuracy of 88. 30% for SUPPORTS, 86. 93% for REFUTES, and only 56. 67% for the NEI label in the evidence retrieval task, InfoXLM (Large) achieved an F1 score of 86. 51%.