Search Results for author: Huaishao Luo

Found 13 papers, 8 papers with code

Hashing based Efficient Inference for Image-Text Matching

no code implementations • Findings (ACL) 2021 • Rong-Cheng Tu, Lei Ji, Huaishao Luo, Botian Shi, Heyan Huang, Nan Duan, Xian-Ling Mao

Image-text matching Text Matching

Paper
Add Code

SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation

1 code implementation • 27 Nov 2022 • Huaishao Luo, Junwei Bao, Youzheng Wu, Xiaodong He, Tianrui Li

The pre-trained model can capture enriched visual concepts for images by learning from a large scale of text-image data.

Ranked #1 on Semantic Segmentation on PASCAL VOC

Open Vocabulary Semantic Segmentation Segmentation +1

Paper
Code

CSS: Combining Self-training and Self-supervised Learning for Few-shot Dialogue State Tracking

no code implementations • 11 Oct 2022 • Haoning Zhang, Junwei Bao, Haipeng Sun, Huaishao Luo, Wenye Li, Shuguang Cui

The unlabeled data of the DST task is incorporated into the self-training iterations, where the pseudo labels are predicted by a DST model trained on limited labeled data in advance.

Dialogue State Tracking Machine Reading Comprehension +2

Paper
Add Code

ScaleVLAD: Improving Multimodal Sentiment Analysis via Multi-Scale Fusion of Locally Descriptors

no code implementations • 2 Dec 2021 • Huaishao Luo, Lei Ji, Yanyong Huang, Bin Wang, Shenggong Ji, Tianrui Li

This paper proposes a fusion model named ScaleVLAD to gather multi-Scale representation from text, video, and audio with shared Vectors of Locally Aggregated Descriptors to improve unaligned multimodal sentiment analysis.

Multimodal Sentiment Analysis

Paper
Add Code

Control Image Captioning Spatially and Temporally

no code implementations • ACL 2021 • Kun Yan, Lei Ji, Huaishao Luo, Ming Zhou, Nan Duan, Shuai Ma

Moreover, the controllability and explainability of LoopCAG are validated by analyzing spatial and temporal sensitivity during the generation process.

Ranked #1 on Image Captioning on Localized Narratives

Contrastive Learning Image Captioning +1

Paper
Add Code

GEM: A General Evaluation Benchmark for Multimodal Tasks

1 code implementation • Findings (ACL) 2021 • Lin Su, Nan Duan, Edward Cui, Lei Ji, Chenfei Wu, Huaishao Luo, Yongfei Liu, Ming Zhong, Taroon Bharti, Arun Sacheti

Comparing with existing multimodal datasets such as MSCOCO and Flicker30K for image-language tasks, YouCook2 and MSR-VTT for video-language tasks, GEM is not only the largest vision-language dataset covering image-language tasks and video-language tasks at the same time, but also labeled in multiple languages.

Paper
Code

CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval

5 code implementations • 18 Apr 2021 • Huaishao Luo, Lei Ji, Ming Zhong, Yang Chen, Wen Lei, Nan Duan, Tianrui Li

In this paper, we propose a CLIP4Clip model to transfer the knowledge of the CLIP model to video-language retrieval in an end-to-end manner.

Ranked #1 on Text to Video Retrieval on MSR-VTT

Retrieval Text Retrieval +4

3,009

Paper
Code

MaP: A Matrix-based Prediction Approach to Improve Span Extraction in Machine Reading Comprehension

no code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Huaishao Luo, Yu Shi, Ming Gong, Linjun Shou, Tianrui Li

In this paper, we propose a novel approach that extends the probability vector to a probability matrix.

Machine Reading Comprehension Position +1

Paper
Add Code

GRACE: Gradient Harmonized and Cascaded Labeling for Aspect-based Sentiment Analysis

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Huaishao Luo, Lei Ji, Tianrui Li, Nan Duan, Daxin Jiang

Specifically, a cascaded labeling module is developed to enhance the interchange between aspect terms and improve the attention of sentiment tokens when labeling sentiment polarities.

Ranked #2 on Sentiment Analysis on SemEval 2014 Task 4 Subtask 1+2

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +4

Paper
Code

UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

2 code implementations • 15 Feb 2020 • Huaishao Luo, Lei Ji, Botian Shi, Haoyang Huang, Nan Duan, Tianrui Li, Jason Li, Taroon Bharti, Ming Zhou

However, most of the existing multimodal models are pre-trained for understanding tasks, leading to a pretrain-finetune discrepancy for generation tasks.

Ranked #2 on Action Segmentation on COIN (using extra training data)

Action Segmentation Decoder +3

329

Paper
Code

DOER: Dual Cross-Shared RNN for Aspect Term-Polarity Co-Extraction

1 code implementation • ACL 2019 • Huaishao Luo, Tianrui Li, Bing Liu, Junbo Zhang

This paper focuses on two related subtasks of aspect-based sentiment analysis, namely aspect term extraction and aspect sentiment classification, which we call aspect term-polarity co-extraction.

Ranked #6 on Aspect-Based Sentiment Analysis (ABSA) on SemEval 2014 Task 4 Laptop

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +3

Paper
Code

Deep Uncertainty Quantification: A Machine Learning Approach for Weather Forecasting

3 code implementations • 22 Dec 2018 • Bin Wang, Jie Lu, Zheng Yan, Huaishao Luo, Tianrui Li, Yu Zheng, Guangquan Zhang

We cast the weather forecasting problem as an end-to-end deep learning problem and solve it by proposing a novel negative log-likelihood error (NLE) loss function.

BIG-bench Machine Learning Uncertainty Quantification +1

Paper
Code

Improving Aspect Term Extraction with Bidirectional Dependency Tree Representation

1 code implementation • 21 May 2018 • Huaishao Luo, Tianrui Li, Bing Liu, Bin Wang, Herwig Unger

The key idea is to explicitly incorporate both representations gained separately from the bottom-up and top-down propagation on the given dependency syntactic tree.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.