Search Results for author: Wancai Zhang

Found 3 papers, 3 papers with code

Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

2 code implementations • 14 Nov 2023 • Peng Jin, Ryuichi Takanobu, Wancai Zhang, Xiaochun Cao, Li Yuan

Large language models have demonstrated impressive universal capabilities across a wide range of open-ended tasks and have extended their utility to encompass multimodal conversations.

Ranked #1 on Image-based Generative Performance Benchmarking on ImageInstruct

Image-based Generative Performance Benchmarking Language Modelling +9

670

Paper
Code

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

4 code implementations • 3 Oct 2023 • Bin Zhu, Bin Lin, Munan Ning, Yang Yan, Jiaxi Cui, Hongfa Wang, Yatian Pang, Wenhao Jiang, Junwu Zhang, Zongwei Li, Wancai Zhang, Zhifeng Li, Wei Liu, Li Yuan

We thus propose VIDAL-10M with Video, Infrared, Depth, Audio and their corresponding Language, naming as VIDAL-10M.

Ranked #1 on Zero-shot Audio Classification on VGG-Sound (using extra training data)

Audio Classification Contrastive Learning +12

2,564

Paper
Code

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

7 code implementations • 14 Dec 2020 • Haoyi Zhou, Shanghang Zhang, Jieqi Peng, Shuai Zhang, JianXin Li, Hui Xiong, Wancai Zhang

Many real-world applications require the prediction of long sequence time-series, such as electricity consumption planning.

Ranked #1 on Time Series Forecasting on ETTh2 (336) Univariate

Decoder Multivariate Time Series Forecasting +2

5,026

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.