panel

Binarisation of photographed documents image quality and processing time assessment

Authors:

Rafael Dueire Lins,

Steven J. Simske,

Rodrigo Barros BernardinoAuthors Info & Claims

DocEng '21: Proceedings of the 21st ACM Symposium on Document Engineering

Article No.: 3, Pages 1 - 6

https://doi.org/10.1145/3469096.3470833

Published: 16 August 2021 Publication History

Abstract

Smartphones with cameras are omnipresent in today's world and are very often used to photograph documents. Document binarization is a key process in many document processing platforms. This competition on binarizing photographed documents assessed the quality and time performance of 13 new algorithms and 50 existing algorithms. The evaluation dataset is composed of offset, laser, and deskjet printed documents, photographed using four widely-used mobile devices with the strobe flash on and off, under two different angles and places of capture.

References

[1]

Y. Akbari et. al. 2019. Binarization of Degraded Document Images using Convolutional Neural Networks based on predicted Two-Channel Images. In ICDAR'19.

[2]

Reza Azad, M. Asadi-Aghbolaghi, M. Fathy, and S. Escalera. 2019. Bi-directional ConvLSTM U-net with densley connected convolutions. ICCVW 2019 (2019).

[3]

Bilal Bataineh, S. N. H. S. Abdullah, and K. Omar. 2011. An adaptive local binarization method for document images based on a novel thresholding method and dynamic windows. Pattern Recog. Letters 32, 14 (2011).

Digital Library

[4]

Suman K. Bera et al. 2021. A non-parametric binarization method based on ensemble of clustering algorithms. Multim. Tools and Applications 80, 5 (2021).

[5]

J Bernsen. 1986. Dynamic thresholding of gray-level images. In ICPR.

[6]

Derek Bradley and G. Roth. 2007. Adaptive Thresholding using the Integral Image. Journal of Graphics Tools 12, 2 (2007).

[7]

Jorge Calvo-Zaragoza and A. Gallego. 2019. A selectional auto-encoder approach for document image binarization. Pattern Recog. 86 (2019).

[8]

W. Doyle. 1962. Operations Useful for Similarity-Invariant Pattern Recognition. J. ACM 9, 2 (1962), 259--267.

Digital Library

[9]

R. Dueire Lins, S. J. Simske, and R. B. Bernardino. 2020. DocEng'20 Time-Quality Competition on Binarizing Photographed Documents. In DocEng'20. ACM.

[10]

A. Gattal, F. Abbas, and M. R. Laouar. 2018. Automatic Parameter Tuning of K-Means Algorithm for Document Binarization. In ICSENT.

[11]

C Glasbey. 1993. An Analysis of Histogram-Based Thresholding Algorithms. Graphical Models and Image Processing 55, 6 (1993), 532--537.

Digital Library

[12]

Zineb Hadjadj et al. 2016. ISauvola: Improved Sauvola's Algorithm for Document Image Binarization. Lecture Notes in CS, Vol. 3212. Springer Berlin Heidelberg.

[13]

Sheng He and L. Schomaker. 2019. DeepOtsu: Document Enhancement and Binarization using Iterative Deep Learning. Pattern Recognition 91 (2019).

[14]

Nicholas R. Howe. 2013. Document binarization with automatic parameter tuning. International Journal on Document Analysis and Recognition (IJDAR) 16, 3 (2013).

[15]

L. Kai Huang and M. J. J. Wang. 1995. Image thresholding by minimizing the measures of fuzziness. Pattern Recognition 28, 1 (1995), 41--51.

[16]

Fuxi Jia, C. Shi, K. He, C. Wang, and B. Xiao. 2018. Degraded document image binarization using structural symmetry of strokes. Pattern Recognition 74 (2018).

[17]

J Johannsen, G and Bille. 1982. A threshold selection method using information measures. In Int'l Conf. Pattern Recognition. 140--143.

[18]

J.N. Kapur et al. 1985. A new method for gray-level picture thresholding using the entropy of the histogram. Comp. Vision, Graphics, and Im. Proc. 29, 1 (1985).

[19]

E. Kavallieratou and S. Stathis. 2006. Adaptive binarization of historical document images. ICPR 3 (2006).

[20]

Khurram Khurshid, I. Siddiqi, C. Faure, and N. Vincent. 2009. Comparison of Niblack inspired binarization methods for ancient documents. In SPIE Proceedings, Kathrin Berkner and Laurence Likforman-Sulem (Eds.).

[21]

J. Kittler and J. Illingworth. 1986. Minimum error thresholding. Pattern Recognition 19, 1 (1986), 41--47.

Digital Library

[22]

Xiangmao Kong, G. Sun, Q. Wu, J. Liu, and F. Lin. 2018. Hybrid pyramid u-net model for brain tumor segmentation. In ICIIP. Springer.

[23]

C.H. Li and P.K.S. Tam. 1998. An iterative algorithm for minimum cross entropy thresholding. Pattern Recognition Letters 19, 8 (1998).

Digital Library

[24]

Rafael Dueire Lins, R. B. Bernardino, et al. 2021. DocEng'2021 Direct Binarization A Quality-and-Time Efficient Binarization Strategy. In DocEng 2021. ACM.

[25]

Rafael Dueire Lins, R. B. Bernardino, and et. al. 2017. Binarizing Document Images Acquired with Portable Cameras. In 2017 14th ICDAR. IEEE.

[26]

Rafael Dueire Lins, E. Kavallieratou, E. B. Smith, R. B. Bernardino, and D. M. de Jesus. 2019. ICDAR 2019 Time-Quality Binarization Competition. In 2019 15th ICDAR.

[27]

Shijian Lu, Bolan Su, and Chew Lim Tan. 2010. Document image binarization using background estimation and stroke edges. IJDAR 13, 4 (2010), 303--314.

Digital Library

[28]

Wu Lu, M. Songde, and H. Lu. 1998. An effective entropic thresholding for ultrasonic images. 14th ICPR (1998).

[29]

C A B Mello and Rafael Dueire Lins. 2000. Image segmentation of historical documents. Visual 2000 (2000).

[30]

W. A. Mustafa and A. M. Kader. 2018. Binarization of Document Image Using Optimum Threshold Modification. J. of Physics: Conference Series 1019, 1 (2018).

[31]

Wayne Niblack. 1985. An introduction to digital image processing. Strandberg Publishing Company.

[32]

Sofia A. Oliveira, B. Seguin, and F. Kaplan. 2018. dhSegment: A generic deep-learning approach for document segmentation. CoRR abs/1804.1 (2018).

[33]

Nobuyuki Otsu. 1979. A threshold selection method from gray-level histograms. IEEE Transactions on Systems, Man, and Cybernetics 9, 1 (1979).

[34]

Judith M. S. Prewitt and M. L. Mendelsohn. 2006. The Analysis of Cell Images. Annals of the New York Academy of Sciences 128, 3 (dec 2006), 1035--1053.

[35]

T. Pun. 1981. Entropic thresholding, a new approach. Computer Graphics and Image Processing 16, 3 (1981), 210--239.

[36]

Khairun Saddami, P. Afrah, V. Mutiawani, et al. 2018. A New Adaptive Thresholding Technique for Binarizing Ancient Document. In 2018 INAPR. IEEE.

[37]

Khairun Saddami, K. Munadi, et al. 2017. Improved Thresholding Method for Enhancing Jawi Binarization Performance. In 2017 14th ICDAR, Vol. 1. IEEE.

[38]

Khairun Saddami, K. Munadi, Y. Away, et al. 2019. Combination Local and Global Thresholding Method for Binarizing Ancient Jawi Document. JTIIK (2019).

[39]

Prasanna Sahoo, C. Wilkins, and J. Yeager. 1997. Threshold selection using Renyi's entropy. Pattern Recognition 30, 1 (1997).

[40]

J. Sauvola and M. Pietikäinen. 2000. Adaptive document image binarization. Pattern Recognition 33, 2 (2000).

[41]

A.G. G Shanbhag. 1994. Utilization of Information Measure as a Means of Image Thresholding. CVGIP: Graphical Models and Image Processing 56, 5 (1994).

[42]

J. M. M. Silva, Rafael Dueire Lins, and V. C. Rocha. 2006. Binarizing and Filtering Historical Documents with Back-to-Front Interference. In ACM SIGAPP 2006.

[43]

T. Romen Singh, S. Roy, O. I. Singh, et al. 2011. A New Local Adaptive Thresholding Technique in Binarization. IJCSI 08, 6 (2011).

[44]

Elisa B. Smith, L. Likforman-Sulem, and J. Darbon. 2010. Effect of pre-processing on binarization. In Document Recognition and Retrieval XVII.

[45]

V. Sokratis, E. Kavallieratou, R. Paredes, and K. Sotiropoulos. 2011. A Hybrid Binarization Technique for Document Images. In Studies in Comp. Intelligence.

[46]

M. A. Souibgui and Y. Kessentini. 2021. DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement. IEEE T. P. A. M. Int. (2021).

[47]

Bolan Su, S. Lu, and C. L. Tan. 2010. Binarization of historical document images using the local maximum and minimum. In 8th IAPR DAS '10. ACM Press.

[48]

Wen-Hsiang Tsai. 1985. Moment-preserving thresolding: A new approach. Computer Vision, Graphics, and Image Processing 29, 3 (1985).

[49]

Flavio R. Velasco. 1979. Thresholding Using the Isodata Clustering Algorithm. Technical Report. OSD or Non-Service DoD Agency.

[50]

Christian Wolf et. al. 2003. Text localization, enhancement and binarization in multimedia documents, Vol. 2. IEEE Comput. Soc.

[51]

Jui Cheng Yen, F. J. Chang, and S. Chang. 1995. A New Criterion for Automatic Multilevel Thresholding. IEEE Transactions on Image Processing 4, 3 (1995).

[52]

G W Zack, W E Rogers, and S A Latt. 1977. Automatic measurement of sister chromatid exchange frequency. Journal of Histochem. and Cytochem. 25, 7 (1977).

[53]

Lichen Zhou et al. 2018. D-linknet: Linknet with pretrained encoder and dilated convolution for high resolution satellite imagery road extraction. In IEEE CS CCVPR Workshops.

Cited By

Bank HHerber D(2024)CatalogBank: A Structured and Interoperable Catalog Dataset with a Semi-Automatic Annotation Tool (DocumentLabeler) for Engineering System DesignProceedings of the ACM Symposium on Document Engineering 202410.1145/3685650.3685665(1-9)Online publication date: 20-Aug-2024
https://dl.acm.org/doi/10.1145/3685650.3685665
Bernardino RLins RBarboza R(2023)A Quality, Size and Time Assessment of the Binarization of Documents Photographed by SmartphonesJournal of Imaging10.3390/jimaging90200419:2(41)Online publication date: 13-Feb-2023
https://doi.org/10.3390/jimaging9020041
Bloechle JHennebert JGisler C(2023)YinYang, a Fast and Robust Adaptive Document Image Binarization for Optical Character RecognitionProceedings of the ACM Symposium on Document Engineering 202310.1145/3573128.3609354(1-4)Online publication date: 22-Aug-2023
https://dl.acm.org/doi/10.1145/3573128.3609354

Recommendations

Binarization of photographed documents image quality, processing time and size assessment
DocEng '22: Proceedings of the 22nd ACM Symposium on Document Engineering

Today, over eighty percent of the world's population owns a smart-phone with an in-built camera, and they are very often used to photograph documents. Document binarization is a key process in many document processing platforms. This competition on ...
DocEng'2020 Time-Quality Competition on Binarizing Photographed Documents
DocEng '20: Proceedings of the ACM Symposium on Document Engineering 2020

Document image binarization is a key process in many document processing platforms. The DocEng'2020 Time-Quality Competition on Binarizing Photographed Documents assessed the performance of eight new algorithms and also 41 other "classical" algorithms. ...
Quality, Space and Time Competition on Binarizing Photographed Document Images
DocEng '23: Proceedings of the ACM Symposium on Document Engineering 2023

Document image binarization is a fundamental step in many document processes. No binarization algorithm performs well on all types of document images, as the different kinds of digitalization devices and the physical noises present in the document and ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

DocEng '21: Proceedings of the 21st ACM Symposium on Document Engineering

August 2021

178 pages

ISBN:9781450385961

DOI:10.1145/3469096

General Chairs:
Patrick Healy
University of Limerick, Ireland
,
Mihai Bilauca
University of Limerick, Ireland
,
Program Chair:
Alexandra Bonnici
University of Malta, Malta

Copyright © 2021 Owner/Author.

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 August 2021

Check for updates

Author Tags

Qualifiers

Panel

Funding Sources

CNPq - Brazilian Goverment

Conference

DocEng '21

Sponsor:

SIGWEB

DocEng '21: ACM Symposium on Document Engineering 2021

August 24 - 27, 2021

Limerick, Ireland

Acceptance Rates

Overall Acceptance Rate 194 of 564 submissions, 34%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
56
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 19 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Bank HHerber D(2024)CatalogBank: A Structured and Interoperable Catalog Dataset with a Semi-Automatic Annotation Tool (DocumentLabeler) for Engineering System DesignProceedings of the ACM Symposium on Document Engineering 202410.1145/3685650.3685665(1-9)Online publication date: 20-Aug-2024
https://dl.acm.org/doi/10.1145/3685650.3685665
Bernardino RLins RBarboza R(2023)A Quality, Size and Time Assessment of the Binarization of Documents Photographed by SmartphonesJournal of Imaging10.3390/jimaging90200419:2(41)Online publication date: 13-Feb-2023
https://doi.org/10.3390/jimaging9020041
Bloechle JHennebert JGisler C(2023)YinYang, a Fast and Robust Adaptive Document Image Binarization for Optical Character RecognitionProceedings of the ACM Symposium on Document Engineering 202310.1145/3573128.3609354(1-4)Online publication date: 22-Aug-2023
https://dl.acm.org/doi/10.1145/3573128.3609354

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten