Search Results for author: Kazuki Hayashi

Found 2 papers, 0 papers with code

Artwork Explanation in Large-scale Vision Language Models

no code implementations29 Feb 2024 Kazuki Hayashi, Yusuke Sakai, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

To address this issue, we propose a new task: the artwork explanation generation task, along with its evaluation dataset and metric for quantitatively assessing the understanding and utilization of knowledge about artworks.

Explanation Generation Text Generation

Evaluating Image Review Ability of Vision Language Models

no code implementations19 Feb 2024 Shigeki Saito, Kazuki Hayashi, Yusuke Ide, Yusuke Sakai, Kazuma Onishi, Toma Suzuki, Seiji Gobara, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

Large-scale vision language models (LVLMs) are language models that are capable of processing images and text inputs by a single model.

Image Captioning

Cannot find the paper you are looking for? You can Submit a new open access paper.