no code implementations • CVPR 2023 • Hui Wu, Min Wang, Wengang Zhou, Zhenbo Lu, Houqiang Li
Then, a dynamic mixer is introduced to aggregate these features into compact embedding for efficient search.
no code implementations • 24 Oct 2023 • Yunyao Mao, Jiajun Deng, Wengang Zhou, Zhenbo Lu, Wanli Ouyang, Houqiang Li
Different from existing distillation solutions that transfer the knowledge of a pre-trained and fixed teacher to the student, in CMD, the knowledge is continuously updated and bidirectionally distilled between modalities during pre-training.
no code implementations • 17 Aug 2023 • Yuechen Wang, Wengang Zhou, Zhenbo Lu, Houqiang Li
Visual storytelling aims to generate a narrative based on a sequence of images, necessitating both vision-language alignment and coherent story generation.
1 code implementation • 15 Oct 2022 • Yonghui Wang, Wengang Zhou, Zhenbo Lu, Houqiang Li
To this end, we propose UDoc-GAN, the first framework to address the problem of document illumination correction under the unpaired setting.
no code implementations • 28 Sep 2022 • Yang He, Yuheng Jia, Liyang Hu, Chengchuan An, Zhenbo Lu, Jingxin Xia
In this study, we proposed a Parameter-Free Non-Convex Tensor Completion model (TC-PFNC) for traffic data recovery, in which a log-based relaxation term was designed to approximate tensor algebraic rank.
1 code implementation • 26 Aug 2022 • Yunyao Mao, Wengang Zhou, Zhenbo Lu, Jiajun Deng, Houqiang Li
In this work, we formulate the cross-modal interaction as a bidirectional knowledge distillation problem.
1 code implementation • 8 May 2022 • Qing Li, Wengang Zhou, Zhenbo Lu, Houqiang Li
Actor-critic Reinforcement Learning (RL) algorithms have achieved impressive performance in continuous control tasks.
1 code implementation • 22 Feb 2022 • Zeyu Fang, Jian Zhao, Mingyu Yang, Wengang Zhou, Zhenbo Lu, Houqiang Li
In our approach, we regard each camera as an agent and address AMOT with a multi-agent reinforcement learning solution.