no code implementations • 24 Aug 2022 • Min Wang, Ata Mahjoubfar, Anupama Joshi
We see that using the same transformer for encoding the question and decoding the answer, as in language models, achieves maximum accuracy, showing that visual language models (VLMs) make the best visual question answering systems for our dataset.
no code implementations • 25 May 2022 • Dipendra Jha, Ata Mahjoubfar, Anupama Joshi
On-Shelf Availability (OSA) of products in retail stores is a critical business criterion in the fast moving consumer goods and retails sector.