Authors:
Veeru Dumpala
1
;
Sheela Raju Kurupathi
1
;
Syed Saqib Bukhari
2
and
Andreas Dengel
3
Affiliations:
1
University of Kaiserslautern and Germany
;
2
German Research Center for Artificial Intelligence (DFKI), Kaiserslautern and Germany
;
3
University of Kaiserslautern, Germany, German Research Center for Artificial Intelligence (DFKI), Kaiserslautern and Germany
Keyword(s):
Historical Documents, Degradations, Document Binarization, Conditional GANs.
Abstract:
One of the most crucial problem in document analysis and OCR pipeline is document binarization. Many traditional algorithms over the past few decades like Sauvola, Niblack, Otsu etc,. were used for binarization which gave insufficient results for historical texts with degradations. Recently many attempts have been made to solve binarization using deep learning approaches like Autoencoders, FCNs. However, these models do not generalize well to real world historical document images qualitatively. In this paper, we propose a model based on conditional GAN, well known for its high-resolution image synthesis. Here, the proposed model is used for image manipulation task which can remove different degradations in historical documents like stains, bleed-through and non-uniform shadings. The performance of the proposed model outperforms recent state-of-the-art models for document image binarization. We support our claims by benchmarking the proposed model on publicly available PHIBC 2012, DIB
CO (2009-2017) and Palm Leaf datasets. The main objective of this paper is to illuminate the advantages of generative modeling and adversarial training for document image binarization in supervised setting which shows good generalization capabilities on different inter/intra class domain document images.
(More)