no code implementations • 10 Oct 2022 • Pedro Rodriguez, Mahmoud Azab, Becka Silvert, Renato Sanchez, Linzy Labson, Hardik Shah, Seungwhan Moon
Searching troves of videos with textual descriptions is a core multimodal retrieval task.
Retrieval Text to Video Retrieval +2