no code implementations • ECCV 2020 • Guo-Sen Xie, Li Liu, Fan Zhu, Fang Zhao, Zheng Zhang, Yazhou Yao, Jie Qin, Ling Shao
To exploit the progressive interactions among these regions, we represent them as a region graph, on which the parts relation reasoning is performed with graph convolutions, thus leading to our PRR branch.
no code implementations • 4 May 2024 • Meiqi Cao, Rui Yan, Xiangbo Shu, Guangzhao Dai, Yazhou Yao, Guo-Sen Xie
Therefore, we propose a novel Adapt-Focused bi-Propagating Prototype learning (AdaFPP) framework to jointly recognize individual, group, and global activities in panoramic activity scenes by learning an adapt-focused detector and multi-granularity prototypes as the pretext tasks in an end-to-end way.
no code implementations • 30 Apr 2024 • Wang Zhang, TingTing Li, Yuntian Zhang, Gensheng Pei, Xiruo Jiang, Yazhou Yao
Nevertheless, annotated data is frequently limited in the field of remote sensing image matching.
1 code implementation • 21 Apr 2024 • Gensheng Pei, Yazhou Yao, Jianbo Jiao, Wenguan Wang, Liqiang Nie, Jinhui Tang
To achieve this objective, we present a unified self-supervised approach to learn visual representations of static-dynamic feature similarity.
no code implementations • 23 Mar 2024 • Zhenhuang Cai, Chuanyi Zhang, Dan Huang, Yuanbo Chen, Xiuyun Guan, Yazhou Yao
Comprehensive experimental results on synthetic and real-world datasets demonstrate the superiority of GRIP over the existing state-of-the-art methods.
1 code implementation • 10 Mar 2024 • Xinhao Cai, Qiuxia Lai, Yuwei Wang, Wenguan Wang, Zeren Sun, Yazhou Yao
Object detection in remote sensing images (RSIs) often suffers from several increasing challenges, including the large variation in object scales and the diverse-ranging context.
1 code implementation • 29 Feb 2024 • Gensheng Pei, Tao Chen, Xiruo Jiang, Huafeng Liu, Zeren Sun, Yazhou Yao
In this paper, we propose a new approach termed as \textbf{VideoMAC}, which combines video masked autoencoders with resource-friendly ConvNets.
no code implementations • 17 Feb 2024 • Huafeng Liu, Mengmeng Sheng, Zeren Sun, Yazhou Yao, Xian-Sheng Hua, Heng-Tao Shen
Specifically, we propose Class-Balance-based sample Selection (CBS) to prevent the tail class samples from being neglected during training.
1 code implementation • 20 Jan 2024 • Tao Chen, Yazhou Yao, Xingguo Huang, Zechao Li, Liqiang Nie, Jinhui Tang
In this paper, we propose spatial structure constraints (SSC) for weakly supervised semantic segmentation to alleviate the unwanted object over-activation of attention expansion.
1 code implementation • 15 Dec 2023 • Gensheng Pei, Fumin Shen, Yazhou Yao, Tao Chen, Xian-Sheng Hua, Heng-Tao Shen
However, existing optical flow-based methods have a significant dependency on optical flow, which results in poor performance when the optical flow estimation fails for a particular scene.
no code implementations • 15 Dec 2023 • Mengmeng Sheng, Zeren Sun, Zhenhuang Cai, Tao Chen, Yichao Zhou, Yazhou Yao
There has been significant attention devoted to the effectiveness of various domains, such as semi-supervised learning, contrastive learning, and meta-learning, in enhancing the performance of methods for noisy label learning (NLL) tasks.
1 code implementation • 16 Jul 2023 • Yin Tang, Tao Chen, Xiruo Jiang, Yazhou Yao, Guo-Sen Xie, Heng-Tao Shen
Existing methods have demonstrated that the domain agent-based attention mechanism is effective in FSVOS by learning the correlation between support images and query frames.
1 code implementation • 9 May 2023 • Tao Chen, Yazhou Yao, Jinhui Tang
Weakly supervised semantic segmentation (WSSS) models relying on class activation maps (CAMs) have achieved desirable performance comparing to the non-CAMs-based counterparts.
1 code implementation • 23 Apr 2023 • Rui Chen, Tao Chen, Qiong Wang, Yazhou Yao
The most common approach is to generate pseudo-labels for unlabeled images to augment the training data.
1 code implementation • 8 Apr 2023 • Gensheng Pei, Yazhou Yao, Fumin Shen, Dan Huang, Xingguo Huang, Heng-Tao Shen
Zero-shot video object segmentation (ZS-VOS) aims to segment foreground objects in a video sequence without prior knowledge of these objects.
1 code implementation • 4 Apr 2023 • Junzhu Mao, Yazhou Yao, Zeren Sun, Xingguo Huang, Fumin Shen, Heng-Tao Shen
Then we combine the similarity and first-order gradients of key tokens along the query dimension for token importance estimation and remove redundant key and value tokens to further reduce the inference complexity.
1 code implementation • 19 Jan 2023 • Huafeng Liu, Pai Peng, Tao Chen, Qiong Wang, Yazhou Yao, Xian-Sheng Hua
Few-shot semantic segmentation is the task of learning to locate each pixel of the novel class in the query image with only a few annotated support images.
Ranked #2 on Few-Shot Semantic Segmentation on COCO-20i (10-shot)
1 code implementation • 18 Jul 2022 • Gensheng Pei, Fumin Shen, Yazhou Yao, Guo-Sen Xie, Zhenmin Tang, Jinhui Tang
Optical flow is an easily conceived and precious cue for advancing unsupervised video object segmentation (UVOS).
1 code implementation • 20 Jun 2022 • Tao Chen, Yazhou Yao, Lei Zhang, Qiong Wang, Guo-Sen Xie, Fumin Shen
Specifically, we propose a saliency guided class-agnostic distance module to pull the intra-category features closer by aligning features to their class prototypes.
no code implementations • 22 Mar 2022 • Tiantian He, Zhibin Li, Yongshun Gong, Yazhou Yao, Xiushan Nie, Yilong Yin
Non-linear activation functions, e. g., Sigmoid, ReLU, and Tanh, have achieved great success in neural networks (NNs).
no code implementations • CVPR 2022 • Zeren Sun, Fumin Shen, Dan Huang, Qiong Wang, Xiangbo Shu, Yazhou Yao, Jinhui Tang
Label noise has been a practical challenge in deep learning due to the strong capability of deep neural networks in fitting all training data.
1 code implementation • ICCV 2021 • Zeren Sun, Yazhou Yao, Xiu-Shen Wei, Yongshun Zhang, Fumin Shen, Jianxin Wu, Jian Zhang, Heng-Tao Shen
Learning from the web can ease the extreme dependence of deep learning on large-scale manually labeled datasets.
1 code implementation • CVPR 2021 • Yazhou Yao, Tao Chen, GuoSen Xie, Chuanyi Zhang, Fumin Shen, Qi Wu, Zhenmin Tang, Jian Zhang
To further mine the non-salient region objects, we propose to exert the segmentation network's self-correction ability.
no code implementations • CVPR 2021 • Yazhou Yao, Zeren Sun, Chuanyi Zhang, Fumin Shen, Qi Wu, Jian Zhang, Zhenmin Tang
Due to the memorization effect in Deep Neural Networks (DNNs), training with noisy labels usually results in inferior model performance.
1 code implementation • 22 Feb 2021 • Tao Chen, GuoSen Xie, Yazhou Yao, Qiong Wang, Fumin Shen, Zhenmin Tang, Jian Zhang
Then we utilize the fused prototype to guide the final segmentation of the query image.
1 code implementation • 23 Jan 2021 • Huafeng Liu, Chuanyi Zhang, Yazhou Yao, Xiushen Wei, Fumin Shen, Jian Zhang, Zhenmin Tang
Labeling objects at a subordinate level typically requires expert knowledge, which is not always available when using random annotators.
no code implementations • ICCV 2021 • Guo-Sen Xie, Huan Xiong, Jie Liu, Yazhou Yao, Ling Shao
Specifically, we first generate N pairs (key and value) of multi-resolution query features guided by the support feature and its mask.
no code implementations • 29 Dec 2020 • Xiu-Shen Wei, Yu-Yan Xu, Yazhou Yao, Jia Wei, Si Xi, Wenyuan Xu, Weidong Zhang, Xiaoxin Lv, Dengpan Fu, Qing Li, Baoying Chen, Haojie Guo, Taolue Xue, Haipeng Jing, Zhiheng Wang, Tianming Zhang, Mingwen Zhang
WebFG 2020 is an international challenge hosted by Nanjing University of Science and Technology, University of Edinburgh, Nanjing University, The University of Adelaide, Waseda University, etc.
1 code implementation • NeurIPS 2020 • Zhibin Li, Jian Zhang, Yongshun Gong, Yazhou Yao, Qiang Wu
We present a model that utilizes linear models with variance and low-rank constraints, to help it generalize better and reduce the number of parameters.
1 code implementation • 6 Aug 2020 • Zeren Sun, Xian-Sheng Hua, Yazhou Yao, Xiu-Shen Wei, Guosheng Hu, Jian Zhang
To this end, we propose a certainty-based reusable sample selection and correction approach, termed as CRSSC, for coping with label noise in training deep FG models with web images.
1 code implementation • 6 Aug 2020 • Chuanyi Zhang, Yazhou Yao, Xiangbo Shu, Zechao Li, Zhenmin Tang, Qi Wu
To this end, we propose a data-driven meta-set based approach to deal with noisy web images for fine-grained recognition.
1 code implementation • 2 May 2020 • Benyi Hu, Ren-Jie Song, Xiu-Shen Wei, Yazhou Yao, Xian-Sheng Hua, Yuehu Liu
Despite significant progress of applying deep learning methods to the field of content-based image retrieval, there has not been a software library that covers these methods in a unified manner.
1 code implementation • 9 Mar 2020 • Tianfei Zhou, Shunzhou Wang, Yi Zhou, Yazhou Yao, Jianwu Li, Ling Shao
In this paper, we present a novel Motion-Attentive Transition Network (MATNet) for zero-shot video object segmentation, which provides a new way of leveraging motion information to reinforce spatio-temporal object representation.
Ranked #12 on Unsupervised Video Object Segmentation on FBMS test
no code implementations • 7 Jun 2019 • Yazhou Yao, Jian Zhang, Xian-Sheng Hua, Fumin Shen, Zhenmin Tang
Recent successes in visual recognition can be primarily attributed to feature representation, learning algorithms, and the ever-increasing size of labeled training data.
no code implementations • 27 May 2019 • Yazhou Yao, Zeren Sun, Fumin Shen, Li Liu, Li-Min Wang, Fan Zhu, Lizhong Ding, Gangshan Wu, Ling Shao
To address this issue, we present an adaptive multi-model framework that resolves polysemy by visual disambiguation.
no code implementations • 26 May 2019 • Huafeng Liu, Xiaofeng Han, Xiangrui Li, Yazhou Yao, Pu Huang, Zhenming Tang
We project the LiDAR point clouds onto the image plane to generate LiDAR images and feed them into one of the branches of the network.
no code implementations • 26 May 2019 • Huafeng Liu, Yazhou Yao, Zeren Sun, Xiangrui Li, Ke Jia, Zhenmin Tang
Robust road segmentation is a key challenge in self-driving research.
no code implementations • 22 Aug 2017 • Yazhou Yao, Jian Zhang, Fumin Shen, Li Liu, Fan Zhu, Dongxiang Zhang, Heng-Tao Shen
To eliminate manual annotation, in this work, we propose a novel image dataset construction framework by employing multiple textual queries.
no code implementations • 16 Mar 2017 • Yazhou Yao, Jian Zhang, Fumin Shen, Xian-Sheng Hua, Wankou Yang, Zhenmin Tang
To tackle these problems, in this work, we exploit general corpus information to automatically select and subsequently classify web images into semantic rich (sub-)categories.
no code implementations • 22 Nov 2016 • Yazhou Yao, Jian Zhang, Fumin Shen, Xian-Sheng Hua, Jingsong Xu, Zhenmin Tang
To reduce the cost of manual labelling, there has been increased research interest in automatically constructing image datasets by exploiting web images.