1 code implementation • 2 Nov 2023 • Haosen Yang, Chuofan Ma, Bin Wen, Yi Jiang, Zehuan Yuan, Xiatian Zhu
Understanding the semantics of individual regions or patches within unconstrained images, such as in open-world object detection, represents a critical yet challenging task in computer vision.
1 code implementation • 9 Oct 2022 • Haosen Yang, Deng Huang, Bin Wen, Jiannan Wu, Hongxun Yao, Yi Jiang, Xiatian Zhu, Zehuan Yuan
As a result, our model can extract effectively both static appearance and dynamic motion spontaneously, leading to superior spatiotemporal representation learning capability.
2 code implementations • 5 Mar 2022 • Qishuai Diao, Yi Jiang, Bin Wen, Jia Sun, Zehuan Yuan
Fine-Grained Visual Classification(FGVC) is the task that requires recognizing the objects belonging to multiple subordinate categories of a super-category.
Ranked #1 on Fine-Grained Image Classification on NABirds (using extra training data)
no code implementations • 1 Feb 2020 • Bin Wen, Jie Luo, Xianglong Liu, Lei Huang
Extracting graph representation of visual scenes in image is a challenging task in computer vision.
no code implementations • 16 Apr 2019 • Bin Wen, Jianhou Gan, Juan L. G. Guirao, Wei Gao
With the rise of knowledge management and knowledge economy, the knowledge elements that directly link and embody the knowledge system have become the research focus and hotspot in certain areas.