Search Results for author: Guy Kaplan

State of What Art? A Call for Multi-Prompt LLM Evaluation

Recent advances in large language models (LLMs) have led to the development of various evaluation benchmarks.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.