Long Term Anticipation

0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

HierVL: Learning Hierarchical Video-Language Embeddings

no code yet • CVPR 2023

Video-language embeddings are a promising avenue for injecting semantics into visual representations, but existing methods capture only short-term associations between seconds-long video clips and their accompanying text.