Search Results for author: Zihan Xu

Found 11 papers, 5 papers with code

Sinkhorn Distance Minimization for Knowledge Distillation

1 code implementation • 27 Feb 2024 • Xiao Cui, Yulei Qin, Yuting Gao, Enwei Zhang, Zihan Xu, Tong Wu, Ke Li, Xing Sun, Wengang Zhou, Houqiang Li

We propose the Sinkhorn Knowledge Distillation (SinKD) that exploits the Sinkhorn distance to ensure a nuanced and precise assessment of the disparity between teacher and student distributions.

Decoder Knowledge Distillation

Paper
Code

Towards Robust Text Retrieval with Progressive Learning

1 code implementation • 20 Nov 2023 • Tong Wu, Yulei Qin, Enwei Zhang, Zihan Xu, Yuting Gao, Ke Li, Xing Sun

However, existing embedding models for text retrieval usually have three non-negligible limitations.

Machine Reading Comprehension Question Answering +2

Paper
Code

Devil in the Number: Towards Robust Multi-modality Data Filter

no code implementations • 24 Sep 2023 • Yichen Xu, Zihan Xu, Wenhao Chai, Zhonghan Zhao, Enxin Song, Gaoang Wang

In order to appropriately filter multi-modality data sets on a web-scale, it becomes crucial to employ suitable filtering methods to boost performance and reduce training costs.

Paper
Add Code

SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger

no code implementations • 30 Mar 2023 • Yuting Gao, Jinfeng Liu, Zihan Xu, Tong Wu Enwei Zhang, Wei Liu, Jie Yang, Ke Li, Xing Sun

During the preceding biennium, vision-language pre-training has achieved noteworthy success on several downstream tasks.

Zero-Shot Learning

Paper
Add Code

PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining

no code implementations • 29 Apr 2022 • Yuting Gao, Jinfeng Liu, Zihan Xu, Jun Zhang, Ke Li, Rongrong Ji, Chunhua Shen

Large-scale vision-language pre-training has achieved promising results on downstream tasks.

Image Classification Language Modelling +3

Paper
Add Code

ShadowGNN: Graph Projection Neural Network for Text-to-SQL Parser

no code implementations • NAACL 2021 • Zhi Chen, Lu Chen, Yanbin Zhao, Ruisheng Cao, Zihan Xu, Su Zhu, Kai Yu

Given a database schema, Text-to-SQL aims to translate a natural language question into the corresponding SQL query.

Decoder Semantic Parsing +1

Paper
Add Code

ReCU: Reviving the Dead Weights in Binary Neural Networks

3 code implementations • ICCV 2021 • Zihan Xu, Mingbao Lin, Jianzhuang Liu, Jie Chen, Ling Shao, Yue Gao, Yonghong Tian, Rongrong Ji

We prove that reviving the "dead weights" by ReCU can result in a smaller quantization error.

Binarization Quantization

Paper
Code

SiMaN: Sign-to-Magnitude Network Binarization

2 code implementations • 16 Feb 2021 • Mingbao Lin, Rongrong Ji, Zihan Xu, Baochang Zhang, Fei Chao, Chia-Wen Lin, Ling Shao

In this paper, we show that our weight binarization provides an analytical solution by encoding high-magnitude weights into +1s, and 0s otherwise.

Binarization

Paper
Code

Answer-driven Deep Question Generation based on Reinforcement Learning

no code implementations • COLING 2020 • Liuyin Wang, Zihan Xu, Zibo Lin, Haitao Zheng, Ying Shen

First, we propose an answer-aware initialization module with a gated connection layer which introduces both document and answer information to the decoder, thus helping to guide the choice of answer-focused question words.

Decoder Question Generation +3

Paper
Add Code

Rotated Binary Neural Network

2 code implementations • NeurIPS 2020 • Mingbao Lin, Rongrong Ji, Zihan Xu, Baochang Zhang, Yan Wang, Yongjian Wu, Feiyue Huang, Chia-Wen Lin

In this paper, for the first time, we explore the influence of angular bias on the quantization error and then introduce a Rotated Binary Neural Network (RBNN), which considers the angle alignment between the full-precision weight vector and its binarized version.

Binarization Quantization

Paper
Code

CREDIT: Coarse-to-Fine Sequence Generation for Dialogue State Tracking

no code implementations • 22 Sep 2020 • Zhi Chen, Lu Chen, Zihan Xu, Yanbin Zhao, Su Zhu, Kai Yu

In dialogue systems, a dialogue state tracker aims to accurately find a compact representation of the current dialogue status, based on the entire dialogue history.

Dialogue State Tracking

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.