1 code implementation • NeurIPS 2023 • Ziyi Bai, Ruiping Wang, Xilin Chen
Instead of that, we train an Encoder-Decoder to generate a set of dynamic event memories at the glancing stage.
Ranked #1 on Video Question Answering on AGQA 2.0 balanced
no code implementations • ICCV 2021 • Difei Gao, Ruiping Wang, Ziyi Bai, Xilin Chen
Visual understanding goes well beyond the study of images or videos on the web.