no code implementations • 1 Apr 2022 • Arsha Nagrani, Paul Hongsuck Seo, Bryan Seybold, Anja Hauth, Santiago Manen, Chen Sun, Cordelia Schmid
To close this gap we propose a new video mining pipeline which involves transferring captions from image captioning datasets to video clips with no additional manual effort.
Ranked #6 on Zero-shot Text to Audio Retrieval on AudioCaps
no code implementations • ICCV 2017 • Santiago Manen, Michael Gygli, Dengxin Dai, Luc van Gool
We further validate our approach by crowdsourcing the PathTrack dataset, with more than 15, 000 person trajectories in 720 sequences.