Search Results for author: Zhonghua Zhai

Found 5 papers, 0 papers with code

Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

no code implementations26 Apr 2024 Zhengze Xu, Mengting Chen, Zhao Wang, Linyu Xing, Zhonghua Zhai, Nong Sang, Jinsong Lan, Shuai Xiao, Changxin Gao

To generate coherent motions, we first leverage the Kalman filter to construct smooth crops in the focus tunnel and inject the position embedding of the tunnel into attention layers to improve the continuity of the generated videos.

Virtual Try-on

Cell Variational Information Bottleneck Network

no code implementations22 Mar 2024 Zhonghua Zhai, Chen Ju, Jinsong Lan, Shuai Xiao

In this work, we propose Cell Variational Information Bottleneck Network (cellVIB), a convolutional neural network using information bottleneck mechanism, which can be combined with the latest feedforward network architecture in an end-to-end training method.

Face Recognition Representation Learning

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models

no code implementations12 Dec 2023 Chen Ju, Haicheng Wang, Zeqian Li, Xu Chen, Zhonghua Zhai, Weilin Huang, Shuai Xiao

Vision-Language Large Models (VLMs) have become primary backbone of AI, due to the impressive performance.

Image to Multi-Modal Retrieval for Industrial Scenarios

no code implementations6 May 2023 Zida Cheng, Chen Ju, Xu Chen, Zhonghua Zhai, Shuai Xiao, Xiaoyi Zeng, Weilin Huang

We formally define a novel valuable information retrieval task: image-to-multi-modal-retrieval (IMMR), where the query is an image and the doc is an entity with both image and textual description.

Cross-Modal Retrieval Information Retrieval +2

Cannot find the paper you are looking for? You can Submit a new open access paper.