no code implementations • COLING 2022 • Xin Guan, Biwei Cao, Qingqing Gao, Zheng Yin, Bo Liu, Jiuxin Cao
In this paper, we propose a novel model, Co-Reasoning Network (CORN), which adopts a bidirectional multi-level connection structure based on Co-Attention Transformer.
no code implementations • 7 May 2024 • Ziqing Zhu, Guan Yuan, Tao Zhou, Jiuxin Cao
With the generated alignment matrices, the method could enhance the fusion degree of the global community by detecting overlapping user communities across networks.
no code implementations • 2 Jan 2024 • Xuelin Zhu, Jian Liu, Dongqi Tang, Jiawei Ge, Weijia Liu, Bo Liu, Jiuxin Cao
Identifying labels that did not appear during training, known as multi-label zero-shot learning, is a non-trivial task in computer vision.
no code implementations • 7 Dec 2023 • Xuelin Zhu, Jiuxin Cao, Jian Liu, Dongqi Tang, Furong Xu, Weijia Liu, Jiawei Ge, Bo Liu, Qingpei Guo, Tianyi Zhang
Pre-trained vision-language models have notably accelerated progress of open-world concept recognition.
no code implementations • 28 Nov 2023 • Jiawei Ge, Xiangmei Chen, Jiuxin Cao, Xuelin Zhu, Bo Liu
However, current VL trackers have not fully exploited the power of VL learning, as they suffer from limitations such as heavily relying on off-the-shelf backbones for feature extraction, ineffective VL fusion designs, and the absence of VL-related loss functions.
no code implementations • 18 Sep 2023 • Tianyi Song, Jiuxin Cao, Kun Wang, Bo Liu, Xiaofeng Zhang
The current state-of-the-art method combines the features of historical captions, historical frames, and the current captions as conditions for generating the current frame.
no code implementations • 10 May 2023 • Xin Guan, Biwei Cao, Qingqing Gao, Zheng Yin, Bo Liu, Jiuxin Cao
Commonsense question answering (QA) research requires machines to answer questions based on commonsense knowledge.
no code implementations • 31 Mar 2023 • Biwei Cao, Lulu Hua, Jiuxin Cao, Jie Gui, Bo Liu, James Tin-Yau Kwok
Different from popular methods which take full advantage of the propagation topology structure, in this paper, we propose a novel framework for fake news detection from perspectives of semantic, emotion and data enhancement, which excavates the emotional evolution patterns of news participants during the propagation process, and a dual deep interaction channel network of semantic and emotion is designed to obtain a more comprehensive and fine-grained news representation with the consideration of comments.
no code implementations • ICCV 2023 • Xuelin Zhu, Jian Liu, Weijia Liu, Jiawei Ge, Bo Liu, Jiuxin Cao
Multi-label image classification refers to assigning a set of labels for an image.
no code implementations • 16 Nov 2022 • Biwei Cao, Jiuxin Cao, Jie Gui, Jiayun Shen, Bo Liu, Lei He, Yuan Yan Tang, James Tin-Yau Kwok
Such approaches, however, ignore the VE's unique nature of relation inference between the premise and hypothesis.
1 code implementation • ACMMM 2022 • Xuelin Zhu, Jiuxin Cao, Jiawei Ge, Weijia Liu, Bo Liu
Specifically, in each layer of TSFormer, a cross-modal attention module is developed to aggregate visual features from spatial stream into semantic stream and update label semantics via a residual connection.
1 code implementation • 7 Jun 2021 • Jie Gui, Xiaofeng Cong, Yuan Cao, Wenqi Ren, Jun Zhang, Jing Zhang, Jiuxin Cao, DaCheng Tao
With the development of convolutional neural networks, hundreds of deep learning based dehazing methods have been proposed.