Search Results for author: Frank Binder

Found 1 papers, 1 papers with code

CRASS: A Novel Data Set and Benchmark to Test Counterfactual Reasoning of Large Language Models

1 code implementation LREC 2022 Jörg Frohberg, Frank Binder

We introduce the CRASS (counterfactual reasoning assessment) data set and benchmark utilizing questionized counterfactual conditionals as a novel and powerful tool to evaluate large language models.

counterfactual Counterfactual Reasoning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.