Search Results for author: Daivat Bhatt

Multi-Modal Image Captioning for the Visually Impaired

In this work, we propose altering AoANet, a state-of-the-art image captioning model, to leverage the text detected in the image as an input feature.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.