Search Results for author: Polina Tsvilodub

Found 5 papers, 0 papers with code

Paper
Add Code

Experimental Pragmatics with Machines: Testing LLM Predictions for the Inferences of Plain and Embedded Disjunctions

no code implementations • 9 May 2024 • Polina Tsvilodub, Paul Marty, Sonia Ramotowska, Jacopo Romoli, Michael Franke

Human communication is based on a variety of inferences that we draw from sentences, often going beyond what is literally said.

Implicatures

Paper
Add Code

Predictions from language models for multiple-choice tasks are not robust under variation of scoring methods

no code implementations • 1 Mar 2024 • Polina Tsvilodub, Hening Wang, Sharon Grosch, Michael Franke

This paper systematically compares different methods of deriving item-level predictions of language models for multiple-choice tasks.

Multiple-choice

Paper
Add Code

Evaluating Pragmatic Abilities of Image Captioners on A3DS

no code implementations • 22 May 2023 • Polina Tsvilodub, Michael Franke

Evaluating grounded neural language model performance with respect to pragmatic qualities like the trade off between truthfulness, contrastivity and overinformativity of generated utterances remains a challenge in absence of data collected from humans.

Language Modelling

Paper
Add Code

Overinformative Question Answering by Humans and Machines

no code implementations • 11 May 2023 • Polina Tsvilodub, Michael Franke, Robert D. Hawkins, Noah D. Goodman

When faced with a polar question, speakers often provide overinformative answers going beyond a simple "yes" or "no".

Question Answering

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.

Search Results for author: Polina Tsvilodub

Found 5 papers, 0 papers with code

Modeling German Word Order Acquisition via Bayesian Inference

Experimental Pragmatics with Machines: Testing LLM Predictions for the Inferences of Plain and Embedded Disjunctions

Predictions from language models for multiple-choice tasks are not robust under variation of scoring methods

Evaluating Pragmatic Abilities of Image Captioners on A3DS

Overinformative Question Answering by Humans and Machines