Search Results for author: Qianqi Yan

Found 2 papers, 1 papers with code

Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA

1 code implementation • 30 May 2024 • Qianqi Yan, Xuehai He, Xiang Yue, Xin Eric Wang

This study reveals that state-of-the-art models, when subjected to simple probing evaluation, perform worse than random guessing on medical diagnosis questions.

Medical Diagnosis Medical Visual Question Answering +3

Paper
Code

Muffin or Chihuahua? Challenging Large Vision-Language Models with Multipanel VQA

no code implementations • 29 Jan 2024 • Yue Fan, Jing Gu, Kaiwen Zhou, Qianqi Yan, Shan Jiang, Ching-Chen Kuo, Xinze Guan, Xin Eric Wang

Our evaluation shows that questions in the MultipanelVQA benchmark pose significant challenges to the state-of-the-art Large Vision Language Models (LVLMs) tested, even though humans can attain approximately 99\% accuracy on these questions.

Benchmarking Image Comprehension +4

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.