1 code implementation • 21 Sep 2023 • Jennifer A Bishop, Qianqian Xie, Sophia Ananiadou
We create a human-annotated data set for evaluating automatic factuality metrics, LongSciVerify, which contains fine-grained factual consistency annotations for long document summaries from the scientific domain.