1 code implementation • 27 Nov 2022 • Huaishao Luo, Junwei Bao, Youzheng Wu, Xiaodong He, Tianrui Li
The pre-trained model can capture enriched visual concepts for images by learning from a large scale of text-image data.
Ranked #1 on Semantic Segmentation on PASCAL VOC
no code implementations • 11 Oct 2022 • Haoning Zhang, Junwei Bao, Haipeng Sun, Huaishao Luo, Wenye Li, Shuguang Cui
The unlabeled data of the DST task is incorporated into the self-training iterations, where the pseudo labels are predicted by a DST model trained on limited labeled data in advance.
no code implementations • 2 Dec 2021 • Huaishao Luo, Lei Ji, Yanyong Huang, Bin Wang, Shenggong Ji, Tianrui Li
This paper proposes a fusion model named ScaleVLAD to gather multi-Scale representation from text, video, and audio with shared Vectors of Locally Aggregated Descriptors to improve unaligned multimodal sentiment analysis.
no code implementations • ACL 2021 • Kun Yan, Lei Ji, Huaishao Luo, Ming Zhou, Nan Duan, Shuai Ma
Moreover, the controllability and explainability of LoopCAG are validated by analyzing spatial and temporal sensitivity during the generation process.
Ranked #1 on Image Captioning on Localized Narratives
1 code implementation • Findings (ACL) 2021 • Lin Su, Nan Duan, Edward Cui, Lei Ji, Chenfei Wu, Huaishao Luo, Yongfei Liu, Ming Zhong, Taroon Bharti, Arun Sacheti
Comparing with existing multimodal datasets such as MSCOCO and Flicker30K for image-language tasks, YouCook2 and MSR-VTT for video-language tasks, GEM is not only the largest vision-language dataset covering image-language tasks and video-language tasks at the same time, but also labeled in multiple languages.
5 code implementations • 18 Apr 2021 • Huaishao Luo, Lei Ji, Ming Zhong, Yang Chen, Wen Lei, Nan Duan, Tianrui Li
In this paper, we propose a CLIP4Clip model to transfer the knowledge of the CLIP model to video-language retrieval in an end-to-end manner.
Ranked #1 on Text to Video Retrieval on MSR-VTT
no code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Huaishao Luo, Yu Shi, Ming Gong, Linjun Shou, Tianrui Li
In this paper, we propose a novel approach that extends the probability vector to a probability matrix.
1 code implementation • Findings of the Association for Computational Linguistics 2020 • Huaishao Luo, Lei Ji, Tianrui Li, Nan Duan, Daxin Jiang
Specifically, a cascaded labeling module is developed to enhance the interchange between aspect terms and improve the attention of sentiment tokens when labeling sentiment polarities.
Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +4
2 code implementations • 15 Feb 2020 • Huaishao Luo, Lei Ji, Botian Shi, Haoyang Huang, Nan Duan, Tianrui Li, Jason Li, Taroon Bharti, Ming Zhou
However, most of the existing multimodal models are pre-trained for understanding tasks, leading to a pretrain-finetune discrepancy for generation tasks.
Ranked #2 on Action Segmentation on COIN (using extra training data)
1 code implementation • ACL 2019 • Huaishao Luo, Tianrui Li, Bing Liu, Junbo Zhang
This paper focuses on two related subtasks of aspect-based sentiment analysis, namely aspect term extraction and aspect sentiment classification, which we call aspect term-polarity co-extraction.
Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +3
3 code implementations • 22 Dec 2018 • Bin Wang, Jie Lu, Zheng Yan, Huaishao Luo, Tianrui Li, Yu Zheng, Guangquan Zhang
We cast the weather forecasting problem as an end-to-end deep learning problem and solve it by proposing a novel negative log-likelihood error (NLE) loss function.
1 code implementation • 21 May 2018 • Huaishao Luo, Tianrui Li, Bing Liu, Bin Wang, Herwig Unger
The key idea is to explicitly incorporate both representations gained separately from the bottom-up and top-down propagation on the given dependency syntactic tree.
Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1