skip to main content
10.1145/3573128.3604903acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
panel

Quality, Space and Time Competition on Binarizing Photographed Document Images

Published: 22 August 2023 Publication History

Abstract

Document image binarization is a fundamental step in many document processes. No binarization algorithm performs well on all types of document images, as the different kinds of digitalization devices and the physical noises present in the document and acquired in the digitalization process alter their performance. Besides that, the processing time is also an important factor that may restrict its applicability. This competition on binarizing photographed documents assessed the quality, time, space, and performance of five new algorithms and sixty-four "classical" and alternative algorithms. The evaluation dataset is composed of laser and deskjet printed documents, photographed using six widely-used mobile devices with the strobe flash on and off, under two different angles and places of capture.

References

[1]
Younes Akbari et al. 2019. Binarization of Degraded Document Images using Convolutional Neural Networks based on predicted Two-Channel Images. In ICDAR.
[2]
Elisa H. Barney Smith, Laurence Likforman-Sulem, and Jérôme Darbon. 2010. Effect of Pre-processing on Binarization. In Document Recognition and Retrieval XVII. 75340H.
[3]
Bilal Bataineh et al. 2011. An adaptive local bin. method for doc. images based on a novel thresh. method and dynamic windows. Ptrn. Recog. Letters 32, 14 (2011).
[4]
Suman Kumar Bera et al. 2021. A non-parametric binarization method based on ensemble of clustering algorithms. Multimedia Tools and Applications 80, 5 (2021), 7653--7673.
[5]
J Bernsen. 1986. Dynamic thresholding of gray-level images. In International Conference on Pattern Recognition. 1251--1255.
[6]
Showmik Bhowmik, Ram Sarkar, Bishwadeep Das, and David Doermann. 2018. GiB: a Game theory Inspired Binarization technique for degraded document images. IEEE Transactions on Image Processing 28, 3 (2018), 1443--1455.
[7]
Bolan Su, Shijian Lu, and Chew Lim Tan. 2013. Robust Document Image Binarization Technique for Degraded Document Images. T. on I. Processing 22, 4 (2013), 1408--1417.
[8]
Derek Bradley and Gerhard Roth. 2007. Adaptive Thresholding using the Integral Image. Journal of Graphics Tools 12, 2 (2007), 13--21.
[9]
Jorge Calvo-Zaragoza and Antonio-Javier Gallego. 2019. A selectional auto-encoder approach for document image binarization. Pattern Recognition 86 (2019), 37--47.
[10]
W. Doyle. 1962. Operations Useful for Similarity-Invariant Pattern Recognition. J. ACM 9, 2 (1962), 259--267.
[11]
Rafael Dueire Lins, Rodrigo Bernardino, and Darlisson Marinho Jesus. 2019. A Quality and Time Assessment of Binarization Algorithms. In 2019 International Conference on Document Analysis and Recognition (ICDAR). 1444--1450. https://doi.org/10.1109/ICDAR.2019.00232
[12]
Gabriel Pereira e Silva and Rafael Dueire Lins. 2007. PhotoDoc: A Toolbox for Processing Document Images Acquired Using Portable Digital Cameras. In Camera-Based Document Analysis and Recognition (CBDAR 2007), Vol. 1. 107--115.
[13]
Abdeljalil Gattal, Faycel Abbas, and Mohamed Ridda Laouar. 2018. Automatic Parameter Tuning of K-Means Algorithm for Document Binarization. In 7th ICSENT. ACM Press, 1--4.
[14]
C Glasbey. 1993. An Analysis of Histogram-Based Thresholding Algorithms. Graphical Models and Image Processing 55, 6 (1993), 532--537.
[15]
Zineb Hadjadj, Abdelkrim Meziane, Yazid Cherfa, Mohamed Cheriet, and Insaf Setitra. [n.d.]. ISauvola: Improved Sauvola's Algorithm for Document Image Binarization. 737--745.
[16]
Sheng He and Lambert Schomaker. 2019. DeepOtsu: Document Enhancement and Binarization using Iterative Deep Learning. Pattern Recognition 91 (jan 2019), 379--390.
[17]
Nicholas R. Howe. 2013. Doc. binarization with automatic parameter tuning. IJDAR 16 (2013).
[18]
Liang Kai Huang and Mao Jiun J. Wang. 1995. Image thresholding by minimizing the measures of fuzziness. Pattern Recognition 28, 1 (1995), 41--51.
[19]
Fuxi Jia, Cunzhao Shi, Kun He, Chunheng Wang, and Baihua Xiao. 2018. Degraded document image binarization using structural symmetry of strokes. Pattern Recognition 74 (2018), 225--240.
[20]
Hiuyi Cheng Fengjun Guo Kai Ding Lianwen Jin Jiaxin Zhang, Bangdong Chen. 2012. DocAligner: Annotating Real-world Photographic Document Images by Simply Taking Pictures. Computerworld 2 (2012). https://doi.org/10.48550/arXiv.2306.05749
[21]
J Johannsen, G and Bille. 1982. A threshold selection method using information measures. In Int'l Conf. Pattern Recognition. 140--143.
[22]
J.N. Kapur, P.K. Sahoo, and A.K.C. Wong. [n.d.]. A new method for gray-level picture thresholding using the entropy of the histogram. C. Vision, Graphics, I. Processing 29, 1 ([n.d.]), 140.
[23]
Ergina Kavallieratou. 2005. A binarization algorithm specialized on document images and photos. ICDAR 2005, 1 (2005), 463--467.
[24]
Ergina Kavallieratou and Stamatatos Stathis. 2006. Adaptive binarization of historical document images. Proceedings - International Conference on Pattern Recognition 3 (2006), 742--745.
[25]
Khurram Khurshid, Imran Siddiqi, Claudie Faure, and Nicole Vincent. 2009. Comparison of Niblack inspired binarization methods for ancient documents. In SPIE. 72470U.
[26]
J. Kittler et al. 1986. Minimum error thresholding. Pattrn. Recog. 19, 1 (1986).
[27]
C.H. Li and P.K.S. Tam. 1998. An iterative algorithm for minimum cross entropy thresholding. Pattern Recognition Letters 19, 8 (1998), 771--776.
[28]
R. Lins, G. Pereira e Silva, and Andre Ricardson Gomes e Silva. 2007. Assessing and Improving the Quality of Document Images Acquired with Portable Digital Cameras. In Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), Vol. 2. 569--573. https://doi.org/10.1109/ICDAR.2007.4376979
[29]
Rafael Dueire Lins. 2009. A Taxonomy for Noise in Images of Paper Documents - The Physical Noises. In Lecture Notes in Computer Science, Vol. 5627 LNCS. 844--854. https://doi.org/10.1007/978-3-642-02611-9_83
[30]
Rafael Dueire Lins, R. B. Bernardino, et al. 2021. DocEng'2021 Direct Binarization A Quality-and-Time Efficient Binarization Strategy. In DocEng 2021. ACM.
[31]
Rafael Dueire Lins, Rodrigo Barros Bernardino, Ricardo Barboza, and Steven J. Simske. 2022. DocEng'2022 Quality, Space, and Time Competition on Binarizing Photographed Documents. In DocEng'22. ACM, 1--4.
[32]
Rafael Dueire Lins, Rodrigo Barros Bernardino, Elisa Barney Smith, and Ergina Kavallieratou. [n.d.]. ICDAR 2021 Competition on Time-Quality Document Image Binarization. In ICDAR 2021. 1539--1546. https://doi.org/10.1007/978-3-030-86337-1_47
[33]
Rafael Dueire Lins, R. B. Bernardino, and et.al. 2017. Binarizing Document Images Acquired with Portable Cameras. In 2017 14th ICDAR. IEEE.
[34]
Rafael Dueire Lins, Rodrigo Barros Bernardino, and Steven J. Simske. 2021. DocEng'2021 Time-Quality Competition on Binarizing Photographed Documents. In DocEng'2021. ACM, 1--4. https://doi.org/10.1145/3395027.3419578
[35]
Rafael Dueire Lins, Ergina Kavallieratou, Elisa Barney Smith, Rodrigo Barros Bernardino, and Darlisson Marinho de Jesus. [n.d.]. ICDAR 2019 Time-Quality Binarization Competition. In ICDAR. 1539--1546. https://doi.org/10.1109/ICDAR.2019.00248
[36]
Rafael Dueire Lins and Domingos Machado. 2004. Comparative study of file formats for image storage and transmission. Journal of Electronic Imaging 13, 1 (2004), 175--181. https://doi.org/10.1117/1.1634591
[37]
Rafael Dueire Lins, Steven J. Simske, and Rodrigo Barros Bernardino. 2020. DocEng'2020 Time-Quality Competition on Binarizing Photographed Documents. In DocEng '20: ACM Symposium on Document Engineering 2020, Virtual Event, CA, USA, September 29 - October 1, 2020. ACM. https://doi.org/10.1145/3395027.3419578
[38]
Shijian Lu, Bolan Su, and Chew Lim Tan. 2010. Document image binarization using background estimation and stroke edges. IJDAR 13, 4 (2010), 303--314.
[39]
Wu Lu, Ma Songde, and Hanqing Lu. 1998. An effective entropic thresholding for ultrasonic images. 14th ICPR (1998), 1552--1554, vol. 2.
[40]
Carlos A.B. Mello and Rafael Dueire Lins. 2002. Generation of images of historical documents by composition. In DocEng '02: Proceedings of the 2002 ACM symposium on Document engineering. 127--133. https://doi.org/10.1145/585058.585082
[41]
Carlos A. B. Mello and Rafael Dueire Lins. 2000. Image segmentation of historical documents. Visual 2000 (2000).
[42]
Hubert Michalak and Krzysztof Okarma. 2019. Adaptive image binarization based on multi-layered stack of regions. In International Conference on Computer Analysis of Images and Patterns. Springer, 281--293.
[43]
Hubert Michalak and Krzysztof Okarma. 2019. Fast Binarization of Unevenly Illuminated Document Images Based on Background Estimation for Optical Character Recognition Purposes. J. Univers. Comput. Sci. 25, 6 (2019), 627--646.
[44]
Hubert Michalak and Krzysztof Okarma. 2019. Improvement of image binarization methods using image preprocessing with local entropy filtering for alphanumerical character recognition purposes. entropy 21, 6 (2019), 562.
[45]
Wan Azani Mustafa and Mohamed Mydin M. Abdul Kader. 2018. Binarization of Document Image Using Optimum Threshold Modification. J. Physics: C. Series 1019, 1 (2018), 012022.
[46]
Wayne Niblack. 1985. An introduction to digital image processing. Strandberg.
[47]
Nobuyuki Otsu. 1979. A threshold selection method from gray-level histograms. IEEE T. on Systems, Man, and Cybernetics 9, 1 (1979), 62--66.
[48]
Ioannis Pratikakis et al. 2019. ICDAR 2019 Competition on Document Image Binarization. In ICDAR.
[49]
Judith M. S. Prewitt and Mortimer L. Mendelsohn. 2006. The Analysis of Cell Images. Annals of the New York Academy of Sciences 128, 3 (2006), 1035--1053.
[50]
T. Pun. 1981. Entropic thresholding, a new approach. Computer Graphics and Image Processing 16, 3 (1981), 210--239.
[51]
A.H. Robinson and C. Cherry. 1967. Results of a prototype television bandwidth compression scheme. Proc. IEEE 55, 3 (1967), 356--364. https://doi.org/10.1109/PROC.1967.5493
[52]
Khairun Saddami, Putri Afrah, Viska Mutiawani, and Fitri Arnia. 2018. A New Adaptive Thresholding Technique for Binarizing Ancient Document. In INAPR. IEEE, 57--61.
[53]
Khairun Saddami, Khairul Munadi, Yuwaldi Away, and Fitri Arnia. 2019. Effective and fast binarization method for combined degradation on ancient documents. Heliyon (2019).
[54]
Khairun Saddami, Khairul Munadi, Sayed Muchallil, and Fitri Arnia. 2017. Improved Thresholding Method for Enhancing Jawi Binarization Performance. In ICDAR, Vol. 1. IEEE.
[55]
Prasanna Sahoo, Carrye Wilkins, and Jerry Yeager. 1997. Threshold selection using Renyi's entropy. Pattern Recognition 30, 1 (1997), 71--84.
[56]
Jaakko Sauvola, Tapio Seppanen, Sami Haapakoski, and Matti Pietikainen. 1997. Adaptive document binarization. In ICDAR, Vol. 1. IEEE Comput. Soc, 147--152.
[57]
A.G. G Shanbhag. 1994. Utilization of Information Measure as a Means of Image Thresholding. CVGIP: Graphical Models and Image Processing 56, 5 (1994), 414--419.
[58]
João Marcelo M. Silva, Rafael Dueire Lins, and Valdemar C Rocha. 2006. Binarizing and Filtering Historical Documents with Back-to-Front Interference. In ACM SAC 2006. 853--858. https://doi.org/10.1145/1141277.1141471
[59]
Steven J. Simske. 2013. Meta-Algorithmics: Patterns for Robust, Low Cost, High Quality Systems. Wiley-IEEE Press.
[60]
T. Romen Singh, Sudipta Roy, O. Imocha Singh, Tejmani Sinam, and Kh. Manglem Singh. 2011. A New Local Adaptive Thresholding Technique in Binarization. IJCSI 08, 6 (2011), 271--277.
[61]
Mohamed Ali Souibgui, Sanket Biswas, Sana Khamekhem Jemni, Yousri Kessentini, Alicia Fornés, Josep Lladós, and Umapada Pal. 2022. DocEnTr: An end-to-end document image enhancement transformer. In 2022 26th International Conference on Pattern Recognition (ICPR).
[62]
Mohamed Ali Souibgui and Yousri Kessentini. 2021. DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement. Ptrn. Analysis and Machine Intellig. (2021).
[63]
Mohamed Ali Souibgui and Yousri Kessentini. 2022. DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement. IEEE Transactions on Pattern Analysis and Machine Intelligence 44, 3 (2022), 1180--1191. https://doi.org/10.1109/TPAMI.2020.3022406
[64]
Bolan Su, Shijian Lu, and Chew Lim Tan. 2010. Binarization of historical document images using the local maximum and minimum. In 8th IAPR DAS. ACM Press, 159--166.
[65]
Richard Trenholm. 2021. History of digital cameras: From '70s prototypes to iPhone and Galaxy's everyday wonders, Vol. 2. https://doi.org/tech/computing/history-of-digital-cameras-from-70s-prototypes-to-iphone-and-galaxys-everyday-wonders/
[66]
Wen-Hsiang Tsai. 1985. Moment-preserving thresolding: A new approach. Computer Vision, Graphics, and Image Processing 29, 3 (1985), 377--393.
[67]
Flavio R. Velasco. 1979. Thresholding Using the Isodata Clustering Algorithm. Technical Report. OSD or Non-Service DoD Agency. 14 pages.
[68]
Christian Wolf and David Doermann. 2002. Binarization of low quality text using a Markov random field model. In Object recognition supported by user interaction for service robots, Vol. 3. IEEE Comput. Soc, 160--163.
[69]
Serdar Yegulalp. 2012. Camera phones: A look back and forward Is it a camera that also has a phone, or a phone that also has a camera? The short -- and continuing -- history of camera phones. Computerworld 2 (2012). https://doi.org/web/20191009064125/https://www.computerworld.com/article/2473084/camera-phones--a-look-back-and-forward.html
[70]
F J.; Chang S Yen J. C.; Chang, Jui Cheng Yen, Fu Juay Chang, and Shyang Chang. 1995. A New Criterion for Automatic Multilevel Thresholding. T. on Image Processing 4, 3 (1995), 370--378.
[71]
G W Zack, W E Rogers, and S A Latt. 1977. Automatic measurement of sister chromatid exchange frequency. J. Histochemistry and Cytochemistry 25, 7 (1977), 741--753.
[72]
Lichen Zhou et al. 2018. D-linknet: Linknet with pretrained encoder and dilated convolution for satellite imagery road extraction. In Comp. Vision and Ptrn. Recog.

Cited By

View all
  • (2024)Competition on Binarizing Photographed Document Images 2024 Quality, Time and Space ReportProceedings of the ACM Symposium on Document Engineering 202410.1145/3685650.3686793(1-12)Online publication date: 20-Aug-2024
  • (2024)CatalogBank: A Structured and Interoperable Catalog Dataset with a Semi-Automatic Annotation Tool (DocumentLabeler) for Engineering System DesignProceedings of the ACM Symposium on Document Engineering 202410.1145/3685650.3685665(1-9)Online publication date: 20-Aug-2024
  • (2024)Handheld Video Document Scanning: A Robust On-Device Model for Multi-Page Document ScanningProceedings of the ACM Symposium on Document Engineering 202410.1145/3685650.3685662(1-9)Online publication date: 20-Aug-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DocEng '23: Proceedings of the ACM Symposium on Document Engineering 2023
August 2023
187 pages
ISBN:9798400700279
DOI:10.1145/3573128
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 August 2023

Check for updates

Author Tags

  1. Binarization
  2. Binarization algorithms
  3. Document Engineering
  4. Photographed Documents
  5. Quality evaluation
  6. Scanned Documents
  7. Space evaluation
  8. Time evaluation

Qualifiers

  • Panel
  • Research
  • Refereed limited

Conference

DocEng '23
Sponsor:
DocEng '23: ACM Symposium on Document Engineering 2023
August 22 - 25, 2023
Limerick, Ireland

Acceptance Rates

DocEng '23 Paper Acceptance Rate 9 of 27 submissions, 33%;
Overall Acceptance Rate 194 of 564 submissions, 34%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)2
Reflects downloads up to 19 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Competition on Binarizing Photographed Document Images 2024 Quality, Time and Space ReportProceedings of the ACM Symposium on Document Engineering 202410.1145/3685650.3686793(1-12)Online publication date: 20-Aug-2024
  • (2024)CatalogBank: A Structured and Interoperable Catalog Dataset with a Semi-Automatic Annotation Tool (DocumentLabeler) for Engineering System DesignProceedings of the ACM Symposium on Document Engineering 202410.1145/3685650.3685665(1-9)Online publication date: 20-Aug-2024
  • (2024)Handheld Video Document Scanning: A Robust On-Device Model for Multi-Page Document ScanningProceedings of the ACM Symposium on Document Engineering 202410.1145/3685650.3685662(1-9)Online publication date: 20-Aug-2024
  • (2024)How to Choose a Binarization Algorithm for a Document Image?2024 37th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)10.1109/SIBGRAPI62404.2024.10716338(1-6)Online publication date: 30-Sep-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media