Search Results for author: Prajwal Gatti

Found 2 papers, 0 papers with code

Towards Scene-Text to Scene-Text Translation

no code implementations6 Aug 2023 Onkar Susladkar, Prajwal Gatti, Anand Mishra

In this work, we study the task of ``visually" translating scene text from a source language (e. g., English) to a target language (e. g., Chinese).

Scene Text Editing Translation

COFAR: Commonsense and Factual Reasoning in Image Search

no code implementations16 Oct 2022 Prajwal Gatti, Abhirama Subramanyam Penamakuri, Revant Teotia, Anand Mishra, Shubhashis Sengupta, Roshni Ramnani

To enable both commonsense and factual reasoning in the image search, we present a unified framework, namely Knowledge Retrieval-Augmented Multimodal Transformer (KRAMT), that treats the named visual entities in an image as a gateway to encyclopedic knowledge and leverages them along with natural language query to ground relevant knowledge.

Image Retrieval Retrieval +1

Cannot find the paper you are looking for? You can Submit a new open access paper.