1 code implementation • 18 Nov 2023 • Shobhit Agarwal, Yevgeniy R. Semenov, William Lotter
By leveraging a pre-trained joint embedding space between images and text, our approach estimates a new classification task as a linear combination of words, resulting in a weight for each word that indicates its alignment with the vision-based classifier.