1 code implementation • 24 Oct 2020 • Zan-Xia Jin, Heran Wu, Chun Yang, Fang Zhou, Jingyan Qin, Lei Xiao, Xu-Cheng Yin
Text-based visual question answering (VQA) requires to read and understand text in an image to correctly answer a given question.
Optical Character Recognition Optical Character Recognition (OCR) +2