Search Results for author: Zhonghua Zhai

Found 5 papers, 0 papers with code

Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

no code implementations • 26 Apr 2024 • Zhengze Xu, Mengting Chen, Zhao Wang, Linyu Xing, Zhonghua Zhai, Nong Sang, Jinsong Lan, Shuai Xiao, Changxin Gao

To generate coherent motions, we first leverage the Kalman filter to construct smooth crops in the focus tunnel and inject the position embedding of the tunnel into attention layers to improve the continuity of the generated videos.

Virtual Try-on

Paper
Add Code

Cell Variational Information Bottleneck Network

no code implementations • 22 Mar 2024 • Zhonghua Zhai, Chen Ju, Jinsong Lan, Shuai Xiao

In this work, we propose Cell Variational Information Bottleneck Network (cellVIB), a convolutional neural network using information bottleneck mechanism, which can be combined with the latest feedforward network architecture in an end-to-end training method.

Face Recognition Representation Learning

Paper
Add Code

Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment

no code implementations • 19 Mar 2024 • Mengting Chen, Xi Chen, Zhonghua Zhai, Chen Ju, Xuewen Hong, Jinsong Lan, Shuai Xiao

This paper introduces a novel framework for virtual try-on, termed Wear-Any-Way.

Virtual Try-on

Paper
Add Code

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models

no code implementations • 12 Dec 2023 • Chen Ju, Haicheng Wang, Zeqian Li, Xu Chen, Zhonghua Zhai, Weilin Huang, Shuai Xiao

Vision-Language Large Models (VLMs) have become primary backbone of AI, due to the impressive performance.

Paper
Add Code

Image to Multi-Modal Retrieval for Industrial Scenarios

no code implementations • 6 May 2023 • Zida Cheng, Chen Ju, Xu Chen, Zhonghua Zhai, Shuai Xiao, Xiaoyi Zeng, Weilin Huang

We formally define a novel valuable information retrieval task: image-to-multi-modal-retrieval (IMMR), where the query is an image and the doc is an entity with both image and textual description.

Cross-Modal Retrieval Information Retrieval +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.