Search Results for author: Guy Kaplan

Found 1 papers, 1 papers with code

State of What Art? A Call for Multi-Prompt LLM Evaluation

1 code implementation31 Dec 2023 Moran Mizrahi, Guy Kaplan, Dan Malkin, Rotem Dror, Dafna Shahaf, Gabriel Stanovsky

Recent advances in large language models (LLMs) have led to the development of various evaluation benchmarks.

Cannot find the paper you are looking for? You can Submit a new open access paper.