Search Results for author: Piyush Mathur

Found 2 papers, 0 papers with code

A Literature Review and Framework for Human Evaluation of Generative Large Language Models in Healthcare

no code implementations4 May 2024 Thomas Yu CHow Tam, Sonish Sivarajkumar, Sumit Kapoor, Alisa V Stolyar, Katelyn Polanska, Karleigh R McCarthy, Hunter Osterhoudt, Xizhi Wu, Shyam Visweswaran, Sunyang Fu, Piyush Mathur, Giovanni E. Cacciamani, Cong Sun, Yifan Peng, Yanshan Wang

This review provides a comprehensive overview of the human evaluation approaches used in diverse healthcare applications. This analysis examines the human evaluation of LLMs across various medical specialties, addressing factors such as evaluation dimensions, sample types, and sizes, the selection and recruitment of evaluators, frameworks and metrics, the evaluation process, and statistical analysis of the results.

Cannot find the paper you are looking for? You can Submit a new open access paper.