Search Results for author: Zixiao Wang

Found 9 papers, 1 papers with code

Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition

no code implementations • 9 May 2024 • Zuan Gao, Yuxin Wang, Yadong Qu, Boqiang Zhang, Zixiao Wang, Jianjun Xu, Hongtao Xie

At the pixel level, we reconstruct the original and inverted images to capture character shapes and texture-level linguistic context.

Paper
Add Code

ChatPattern: Layout Pattern Customization via Natural Language

no code implementations • 15 Mar 2024 • Zixiao Wang, Yunheng Shen, Xufeng Yao, Wenqian Zhao, Yang Bai, Farzan Farnia, Bei Yu

Existing works focus on fixed-size layout pattern generation, while the more practical free-size pattern generation receives limited attention.

Language Modelling Large Language Model

Paper
Add Code

Spectral Clustering for Discrete Distributions

no code implementations • 25 Jan 2024 • Zixiao Wang, Dong Qiao, Jicong Fan

Discrete distribution clustering (D2C) was often solved by Wasserstein barycenter methods.

Clustering Computational Efficiency

Paper
Add Code

Identification and Estimation for Nonignorable Missing Data: A Data Fusion Approach

no code implementations • 15 Nov 2023 • Zixiao Wang, AmirEmad Ghassami, Ilya Shpitser

We consider the task of identifying and estimating a parameter of interest in settings where data is missing not at random (MNAR).

Paper
Add Code

On the Evaluation of Generative Models in Distributed Learning Tasks

no code implementations • 18 Oct 2023 • Zixiao Wang, Farzan Farnia, Zhenghao Lin, Yunheng Shen, Bei Yu

First, we focus on the Fr\'echet inception distance (FID) and consider the following FID-based aggregate scores over the clients: 1) FID-avg as the mean of clients' individual FID scores, 2) FID-all as the FID distance of the trained model to the collective dataset containing all clients' data.

Avg Federated Learning

Paper
Add Code

Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition

1 code implementation • 8 Oct 2023 • Zixiao Wang, Hongtao Xie, Yuxin Wang, Jianjun Xu, Boqiang Zhang, Yongdong Zhang

In this paper, we explore the potential of the Contrastive Language-Image Pretraining (CLIP) model in scene text recognition (STR), and establish a novel Symmetrical Linguistic Feature Distillation framework (named CLIP-OCR) to leverage both visual and linguistic knowledge in CLIP.

Optical Character Recognition (OCR) Scene Text Recognition