Search Results for author: Wei Suo

Found 5 papers, 2 papers with code

CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning

1 code implementation • 15 Mar 2024 • Yukun Li, Guansong Pang, Wei Suo, Chenchen Jing, Yuling Xi, Lingqiao Liu, Hao Chen, Guoqiang Liang, Peng Wang

Large pre-trained VLMs like CLIP have demonstrated superior zero-shot recognition ability, and a number of recent studies leverage this ability to mitigate catastrophic forgetting in CL, but they focus on closed-set CL in a single domain dataset.

Class Incremental Learning Incremental Learning +1

Paper
Code

S3C: Semi-Supervised VQA Natural Language Explanation via Self-Critical Learning

no code implementations • CVPR 2023 • Wei Suo, Mengyang Sun, Weisong Liu, Yiqi Gao, Peng Wang, Yanning Zhang, Qi Wu

VQA Natural Language Explanation (VQA-NLE) task aims to explain the decision-making process of VQA models in natural language.

Decision Making Visual Question Answering (VQA)

Paper
Add Code

A Simple and Robust Correlation Filtering Method for Text-based Person Search

1 code implementation • ECCV 2022 2022 • Wei Suo, Mengyang Sun, Kai Niu, Yiqi Gao, Peng Wang, Yanning Zhang, Qi Wu

Text-based person search aims to associate pedestrian images with natural language descriptions.

Ranked #8 on Text based Person Retrieval on ICFG-PEDES

Denoising Person Search +3

Paper
Code

Dual-Level Decoupled Transformer for Video Captioning

no code implementations • 6 May 2022 • Yiqi Gao, Xinglin Hou, Wei Suo, Mengyang Sun, Tiezheng Ge, Yuning Jiang, Peng Wang

As for the latter, \textbf{\textit{"couple"}} means treating the generation of visual semantic and syntax-related words equally.

Descriptive Sentence +1

Paper
Add Code

Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention

no code implementations • 5 May 2021 • Wei Suo, Mengyang Sun, Peng Wang, Qi Wu

Referring Expression Comprehension (REC) has become one of the most important tasks in visual reasoning, since it is an essential step for many vision-and-language tasks such as visual question answering.

Question Answering Referring Expression +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.