2 code implementations • 8 Jun 2021 • Ioannis Kazakos, Carles Ventura, Miriam Bellver, Carina Silberer, Xavier Giro-i-Nieto
Recent advances in deep learning have brought significant progress in visual grounding tasks such as language-guided video object segmentation.
2 code implementations • 1 Oct 2020 • Miriam Bellver, Carles Ventura, Carina Silberer, Ioannis Kazakos, Jordi Torres, Xavier Giro-i-Nieto
The task of video object segmentation with referring expressions (language-guided VOS) is to, given a linguistic phrase and a video, generate binary masks for the object to which the phrase refers.
Ranked #1 on Referring Expression Segmentation on A2Dre test