Search Results for author: Polina Tsvilodub

Found 5 papers, 0 papers with code

Predictions from language models for multiple-choice tasks are not robust under variation of scoring methods

no code implementations1 Mar 2024 Polina Tsvilodub, Hening Wang, Sharon Grosch, Michael Franke

This paper systematically compares different methods of deriving item-level predictions of language models for multiple-choice tasks.

Multiple-choice

Evaluating Pragmatic Abilities of Image Captioners on A3DS

no code implementations22 May 2023 Polina Tsvilodub, Michael Franke

Evaluating grounded neural language model performance with respect to pragmatic qualities like the trade off between truthfulness, contrastivity and overinformativity of generated utterances remains a challenge in absence of data collected from humans.

Language Modelling

Overinformative Question Answering by Humans and Machines

no code implementations11 May 2023 Polina Tsvilodub, Michael Franke, Robert D. Hawkins, Noah D. Goodman

When faced with a polar question, speakers often provide overinformative answers going beyond a simple "yes" or "no".

Question Answering

Cannot find the paper you are looking for? You can Submit a new open access paper.