skip to main content
10.1145/3395027.3419578acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
tutorial

DocEng'2020 Time-Quality Competition on Binarizing Photographed Documents

Published: 29 September 2020 Publication History

Abstract

Document image binarization is a key process in many document processing platforms. The DocEng'2020 Time-Quality Competition on Binarizing Photographed Documents assessed the performance of eight new algorithms and also 41 other "classical" algorithms. Besides the quality of the binary image, the execution time of the algorithms was assessed. The evaluation dataset is composed of 32 documents photographed using four widely-used mobile devices with the strobe flash on and off, under several different angles of capture.

References

[1]
Bilal Bataineh, Siti Norul Huda Sheikh Abdullah, and Khairuddin Omar. 2011. An adaptive local binarization method for document images based on a novel thresholding method and dynamic windows. Pattern Recognition Letters 32, 14 (oct 2011), 1805--1813.
[2]
J Bernsen. 1986. Dynamic thresholding of gray-level images. In International Conference on Pattern Recognition. 1251--1255.
[3]
Derek Bradley and Gerhard Roth. 2007. Adaptive Thresholding using the Integral Image. Journal of Graphics Tools 12, 2 (jan 2007), 13--21.
[4]
Jorge Calvo-Zaragoza and Antonio-Javier Gallego. 2019. A selectional auto-encoder approach for document image binarization. Pattern Recognition 86 (feb 2019), 37--47.
[5]
Abdeljalil Gattal, Faycel Abbas, and Mohamed Ridda Laouar. 2018. Automatic Parameter Tuning of K-Means Algorithm for Document Binarization. In Proceedings of the 7th International Conference on Software Engineering and New Technologies - ICSENT 2018. ACM Press, New York, New York, USA, 1--4.
[6]
Zineb Hadjadj, Abdelkrim Meziane, Yazid Cherfa, Mohamed Cheriet, and Insaf Setitra. 2016. ISauvola: Improved Sauvola's Algorithm for Document Image Binarization. Lecture Notes in Computer Science, Vol. 3212. Springer Berlin Heidelberg, Berlin, Heidelberg, 737--745.
[7]
Sheng He and Lambert Schomaker. 2019. DeepOtsu: Document Enhancement and Binarization using Iterative Deep Learning. Pattern Recognition 91 (jan 2019), 379--390.
[8]
Fuxi Jia, Cunzhao Shi, Kun He, Chunheng Wang, and Baihua Xiao. 2018. Degraded document image binarization using structural symmetry of strokes. Pattern Recognition 74 (feb 2018), 225--240.
[9]
Khurram Khurshid, Imran Siddiqi, Claudie Faure, and Nicole Vincent. 2009. Comparison of Niblack inspired binarization methods for ancient documents. In SPIE Proceedings, Kathrin Berkner and Laurence Likforman-Sulem (Eds.). 72470U.
[10]
C.H. Li and P.K.S. Tam. 1998. An iterative algorithm for minimum cross entropy thresholding. Pattern Recognition Letters 19, 8 (1998), 771--776.
[11]
Rafael Dueire Lins, Rodrigo Barros Bernardino, Darlisson Marinho de Jesus, and Jose Mario Oliveira. 2017. Binarizing Document Images Acquired with Portable Cameras. In 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). IEEE, 45--50.
[12]
Rafael Dueire Lins, Ergina Kavallieratou, Elisa Barney Smith, Rodrigo Barros Bernardino, and Darlisson Marinho de Jesus. 2019. ICDAR 2019 Time-Quality Binarization Competition. In 2019 15th IAPR International Conference on Document Analysis and Recognition (ICDAR). 1539--1546.
[13]
Di Lu, Xin Huang, and LiXue Xue Sui. 2018. Binarization of degraded document images based on contrast enhancement. International Journal on Document Analysis and Recognition 21, 1-2 (jun 2018), 123--135.
[14]
Shijian Lu, Bolan Su, and Chew Lim Tan. 2010. Document image binarization using background estimation and stroke edges. International Journal on Document Analysis and Recognition (IJDAR) 13, 4 (dec 2010), 303--314.
[15]
Wan Azani Mustafa and Mohamed Mydin M. Abdul Kader. 2018. Binarization of Document Image Using Optimum Threshold Modification. Journal of Physics: Conference Series 1019, 1 (jun 2018), 012022.
[16]
Nobuyuki Otsu. 1979. A threshold selection method from gray-level histograms. IEEE Transactions on Systems, Man, and Cybernetics 9, 1 (1979), 62--66.
[17]
Judith M. S. Prewitt and Mortimer L. Mendelsohn. 2006. THE ANALYSIS OF CELL IMAGES. Annals of the New York Academy of Sciences 128, 3 (dec 2006), 1035--1053.
[18]
Khairun Saddami, Putri Afrah, Viska Mutiawani, and Fitri Arnia. 2018. A New Adaptive Thresholding Technique for Binarizing Ancient Document. In 2018 Indonesian Association for Pattern Recognition International Conference (INAPR). IEEE, 57--61.
[19]
Khairun Saddami, Khairul Munadi, Yuwaldi Away, and Fitri Arnia. 2019. Effective and fast binarization method for combined degradation on ancient documents. Heliyon (2019).
[20]
Khairun Saddami, Khairul Munadi, Sayed Muchallil, and Fitri Arnia. 2017. Improved Thresholding Method for Enhancing Jawi Binarization Performance. In 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Vol. 1. IEEE, 1108--1113.
[21]
J. Sauvola, M. Pietikäinen, and M Pietikainem. 2000. Adaptive document image binarization. Pattern Recognition 33, 2 (2000), 225--236.
[22]
T. Romen Singh, Sudipta Roy, O. Imocha Singh, Tejmani Sinam, and Kh. Manglem Singh. 2011. A New Local Adaptive Thresholding Technique in Binarization. IJCSI International Journal of Computer Science Issues 08, 6 (dec 2011), 271--277.
[23]
Vavilis Sokratis, Ergina Kavallieratou, Roberto Paredes, and Kostas Sotiropoulos. 2011. A Hybrid Binarization Technique for Document Images. In Studies in Computational Intelligence. 165--179.
[24]
Bolan Su, Shijian Lu, and Chew Lim Tan. 2010. Binarization of historical document images using the local maximum and minimum. In Proceedings of the 8th IAPR International Workshop on Document Analysis Systems - DAS '10. ACM Press, New York, New York, USA, 159--166.
[25]
Wen-Hsiang Tsai. 1985. Moment-preserving thresolding: A new approach. Computer Vision, Graphics, and Image Processing 29, 3 (1985), 377--393.
[26]
Flavio R. Velasco. 1979. Thresholding Using the Isodata Clustering Algorithm. Technical Report. OSD or Non-Service DoD Agency. 14 pages.
[27]
Christian Wolf, Jean-Michel Jolion, and Françoise Chassaing. 2003. Text localization, enhancement and binarization in multimedia documents. In Object recognition supported by user interaction for service robots, Vol. 2. IEEE Comput. Soc, 1037--1040.
[28]
Lichen Zhou, Chuang Zhang, and Ming Wu. 2018. D-linknet: Linknet with pretrained encoder and dilated convolution for high resolution satellite imagery road extraction. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

Cited By

View all
  • (2024)Competition on Binarizing Photographed Document Images 2024 Quality, Time and Space ReportProceedings of the ACM Symposium on Document Engineering 202410.1145/3685650.3686793(1-12)Online publication date: 20-Aug-2024
  • (2024)Texture-based Document BinarizationProceedings of the ACM Symposium on Document Engineering 202410.1145/3685650.3685663(1-10)Online publication date: 20-Aug-2024
  • (2024)How to Choose a Binarization Algorithm for a Document Image?2024 37th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)10.1109/SIBGRAPI62404.2024.10716338(1-6)Online publication date: 30-Sep-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DocEng '20: Proceedings of the ACM Symposium on Document Engineering 2020
September 2020
130 pages
ISBN:9781450380003
DOI:10.1145/3395027
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 September 2020

Check for updates

Author Tags

  1. Binarization
  2. Documents
  3. Performance evaluation
  4. Quality evaluation

Qualifiers

  • Tutorial
  • Research
  • Refereed limited

Conference

DocEng '20
Sponsor:
DocEng '20: ACM Symposium on Document Engineering 2020
September 29 - October 1, 2020
CA, Virtual Event, USA

Acceptance Rates

Overall Acceptance Rate 194 of 564 submissions, 34%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 19 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Competition on Binarizing Photographed Document Images 2024 Quality, Time and Space ReportProceedings of the ACM Symposium on Document Engineering 202410.1145/3685650.3686793(1-12)Online publication date: 20-Aug-2024
  • (2024)Texture-based Document BinarizationProceedings of the ACM Symposium on Document Engineering 202410.1145/3685650.3685663(1-10)Online publication date: 20-Aug-2024
  • (2024)How to Choose a Binarization Algorithm for a Document Image?2024 37th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)10.1109/SIBGRAPI62404.2024.10716338(1-6)Online publication date: 30-Sep-2024
  • (2023)A Quality, Size and Time Assessment of the Binarization of Documents Photographed by SmartphonesJournal of Imaging10.3390/jimaging90200419:2(41)Online publication date: 13-Feb-2023
  • (2023)Quality, Space and Time Competition on Binarizing Photographed Document ImagesProceedings of the ACM Symposium on Document Engineering 202310.1145/3573128.3604903(1-10)Online publication date: 22-Aug-2023
  • (2022)Using Paper Texture for Choosing a Suitable Algorithm for Scanned Document Image BinarizationJournal of Imaging10.3390/jimaging81002728:10(272)Online publication date: 5-Oct-2022
  • (2022)Binarization of photographed documents image quality, processing time and size assessmentProceedings of the 22nd ACM Symposium on Document Engineering10.1145/3558100.3564159(1-10)Online publication date: 20-Sep-2022
  • (2022)The Winner Takes It All: Choosing the “best” Binarization Algorithm for Photographed DocumentsDocument Analysis Systems10.1007/978-3-031-06555-2_4(48-64)Online publication date: 18-May-2022
  • (2021)Direct binarization a quality-and-time efficient binarization strategyProceedings of the 21st ACM Symposium on Document Engineering10.1145/3469096.3474932(1-4)Online publication date: 16-Aug-2021
  • (2021)Assessing the Relationship Between Binarization and OCR in the Context of Deep Learning-Based ID Document AnalysisProgress in Artificial Intelligence and Pattern Recognition10.1007/978-3-030-89691-1_14(134-144)Online publication date: 4-Nov-2021
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media