no code implementations • 28 Feb 2024 • Xiujie Song, Mengyue Wu, Kenny Q. Zhu, Chunhao Zhang, Yanyi Chen
Large Vision Language Models (LVLMs), despite their recent success, are hardly comprehensively tested for their cognitive abilities.
Question Answering Visual Question Answering