Search Results for author: Xiaoguang Hu

Found 16 papers, 12 papers with code

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

no code implementations • 5 Jun 2023 • Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, MingYu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai

It is hoped that this competition will attract many researchers in the field of CV and NLP, and bring some new thoughts to the field of Document AI.

Document AI Entity Linking +1

Paper
Add Code

A bioinspired three-stage model for camouflaged object detection

no code implementations • 22 May 2023 • Tianyou Chen, Jin Xiao, Xiaoguang Hu, Guofeng Zhang, Shaojie Wang

Furthermore, considering the significance of multi-scale information, we have designed a multi-scale feature enhancement module that enlarges the receptive field while preserving detailed structural cues.

Object object-detection +1

Paper
Add Code

PP-YOLOE-R: An Efficient Anchor-Free Rotated Object Detector

2 code implementations • 4 Nov 2022 • Xinxin Wang, Guanzhong Wang, Qingqing Dang, Yi Liu, Xiaoguang Hu, dianhai yu

With multi-scale training and testing, PP-YOLOE-R-l and PP-YOLOE-R-x further improve the detection precision to 80. 02 and 80. 73 mAP.

Ranked #6 on Oriented Object Detection on DOTA 1.0

Object object-detection +3

21,695

Paper
Code

PP-StructureV2: A Stronger Document Analysis System

1 code implementation • 11 Oct 2022 • Chenxia Li, Ruoyu Guo, Jun Zhou, Mengtao An, Yuning Du, Lingfeng Zhu, Yi Liu, Xiaoguang Hu, dianhai yu

For Table Recognition model, we utilize PP-LCNet, CSP-PAN and SLAHead to optimize the backbone module, feature fusion module and decoding module, respectively, which improved the table structure accuracy by 6\% with comparable inference speed.

Ranked #1 on Network Pruning on CIFAR-100 (Inference Time (ms) metric)

Key Information Extraction Knowledge Distillation +3

39,416

Paper
Code

PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System

1 code implementation • 7 Jun 2022 • Chenxia Li, Weiwei Liu, Ruoyu Guo, Xiaoting Yin, Kaitao Jiang, Yongkun Du, Yuning Du, Lingfeng Zhu, Baohua Lai, Xiaoguang Hu, dianhai yu, Yanjun Ma

For text recognizer, the base model is replaced from CRNN to SVTR, and we introduce lightweight text recognition network SVTR LCNet, guided training of CTC by attention, data augmentation strategy TextConAug, better pre-trained model by self-supervised TextRotNet, UDML, and UIM to accelerate the model and improve the effect.

Data Augmentation Optical Character Recognition +2

39,416

Paper
Code

PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

2 code implementations • NAACL (ACL) 2022 • HUI ZHANG, Tian Yuan, Junkun Chen, Xintong Li, Renjie Zheng, Yuxin Huang, Xiaojie Chen, Enlei Gong, Zeyu Chen, Xiaoguang Hu, dianhai yu, Yanjun Ma, Liang Huang

PaddleSpeech is an open-source all-in-one speech toolkit.

Automatic Speech Recognition (ASR) Environmental Sound Classification +9

10,349

Paper
Code

PP-Matting: High-Accuracy Natural Image Matting

1 code implementation • 20 Apr 2022 • Guowei Chen, Yi Liu, Jian Wang, Juncai Peng, Yuying Hao, Lutao Chu, Shiyu Tang, Zewu Wu, Zeyu Chen, Zhiliang Yu, Yuning Du, Qingqing Dang, Xiaoguang Hu, dianhai yu

Also, we propose a semantic context branch (SCB) that adopts a semantic segmentation subtask.

Ranked #4 on Image Matting on Distinctions-646

Image Matting Semantic Segmentation +1

8,347

Paper
Code

PP-LiteSeg: A Superior Real-Time Semantic Segmentation Model

3 code implementations • 6 Apr 2022 • Juncai Peng, Yi Liu, Shiyu Tang, Yuying Hao, Lutao Chu, Guowei Chen, Zewu Wu, Zeyu Chen, Zhiliang Yu, Yuning Du, Qingqing Dang, Baohua Lai, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

Real-world applications have high demands for semantic segmentation methods.

Ranked #4 on Real-Time Semantic Segmentation on Cityscapes val

Decoder Real-Time Semantic Segmentation +1

8,347

Paper
Code

PP-PicoDet: A Better Real-Time Object Detector on Mobile Devices

4 code implementations • 1 Nov 2021 • Guanghua Yu, Qinyao Chang, Wenyu Lv, Chang Xu, Cheng Cui, Wei Ji, Qingqing Dang, Kaipeng Deng, Guanzhong Wang, Yuning Du, Baohua Lai, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

We investigate the applicability of the anchor-free strategy on lightweight object detection models.

Ranked #1 on Object Detection on MSCOCO

Object object-detection +1

12,231

Paper
Code

PP-ShiTu: A Practical Lightweight Image Recognition System

2 code implementations • 1 Nov 2021 • Shengyu Wei, Ruoyu Guo, Cheng Cui, Bin Lu, Shuilong Dong, Tingquan Gao, Yuning Du, Ying Zhou, Xueying Lyu, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

In recent years, image recognition applications have developed rapidly.

Face Recognition Knowledge Distillation +4

5,294

Paper
Code

PP-LCNet: A Lightweight CPU Convolutional Neural Network

8 code implementations • 17 Sep 2021 • Cheng Cui, Tingquan Gao, Shengyu Wei, Yuning Du, Ruoyu Guo, Shuilong Dong, Bin Lu, Ying Zhou, Xueying Lv, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

We propose a lightweight CPU network based on the MKLDNN acceleration strategy, named PP-LCNet, which improves the performance of lightweight models on multiple tasks.

Image Classification object-detection +2

39,416

Paper
Code

PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System

3 code implementations • 7 Sep 2021 • Yuning Du, Chenxia Li, Ruoyu Guo, Cheng Cui, Weiwei Liu, Jun Zhou, Bin Lu, Yehua Yang, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

Optical Character Recognition (OCR) systems have been widely used in various of application scenarios.

Optical Character Recognition Optical Character Recognition (OCR)

39,416

Paper
Code

PP-YOLOv2: A Practical Object Detector

1 code implementation • 21 Apr 2021 • Xin Huang, Xinxin Wang, Wenyu Lv, Xiaying Bai, Xiang Long, Kaipeng Deng, Qingqing Dang, Shumin Han, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma, Osamu Yoshie

To meet these two concerns, we comprehensively evaluate a collection of existing refinements to improve the performance of PP-YOLO while almost keep the infer time unchanged.

Object Real-Time Object Detection