no code implementations • 8 May 2023 • Jiajun Wei, Hongjian Zhan, Xiao Tu, Yue Lu, Umapada Pal
Inspired by ITC, the SITM network combines the visual features and the text features of all candidates to identify the candidate with the minimum distance in the feature space.