Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks

29 Nov 2022  ·  Rui-Yang Ju, Yu-Shian Lin, Yanlin Jin, Chih-Chia Chen, Chun-Tse Chien, Jen-Shiun Chiang ·

The efficient segmentation of foreground text information from the background in degraded color document images is a critical challenge in the preservation of ancient manuscripts. The imperfect preservation of ancient manuscripts has led to various types of degradation over time, such as staining, yellowing, and ink seepage, significantly affecting image binarization results. This work proposes a three-stage method using generative adversarial networks (GANs) for the degraded color document images binarization. Stage-1 involves applying discrete wavelet transform (DWT) and retaining the low-low (LL) subband images for image enhancement. In Stage-2, the original input image is split into red, green, and blue (RGB) three single-channel images and one grayscale image, and each image is trained with independent adversarial networks to extract color foreground information. In Stage-3, the output image from Stage-2 and the resized input image are used to train independent adversarial networks for document binarization, enabling the integration of global and local features. The experimental results demonstrate that our proposed method outperforms other traditional and state-of-the-art (SOTA) methods on the Document Image Binarization Contest (DIBCO) datasets. We have released our implementation code at https://github.com/abcpp12383/ThreeStageBinarization.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Binarization DIBCO 2011 Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks PSNR 20.70 # 4
F-Measure 94.97 # 3
DRD 1.68 # 5
Pseudo-F-measure 98.25 # 1
Binarization DIBCO 2013 Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks F-Measure 96.05 # 2
Pseudo-F-measure 98.35 # 1
PSNR 22.48 # 3
DRD 1.58 # 3
Binarization DIBCO 2017 Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks F-Measure 92.04 # 4
DRD 2.82 # 3
PSNR 18.82 # 4
Pseudo-F-measure 94.79 # 3
Binarization H-DIBCO 2014 Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks F-Measure 97.45 # 3
Pseudo-F-measure 98.95 # 1
PSNR 22.49 # 3
DRD 0.94 # 3
Binarization H-DIBCO 2016 Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks F-Measure 92.49 # 3
PSNR 19.96 # 3
DRD 2.77 # 3
Pseudo-F-measure 96.82 # 1
Binarization H-DIBCO 2018 Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks PSNR 20.41 # 2
F-Measure 92.98 # 2
DRD 2.21 # 2
Pseudo-F-measure 97.06 # 1

Methods