skip to main content
10.1145/1815330.1815351acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdasConference Proceedingsconference-collections
research-article

Binarization of historical document images using the local maximum and minimum

Published: 09 June 2010 Publication History

Abstract

This paper presents a new document image binarization technique that segments the text from badly degraded historical document images. The proposed technique makes use of the image contrast that is defined by the local image maximum and minimum. Compared with the image gradient, the image contrast evaluated by the local maximum and minimum has a nice property that it is more tolerant to the uneven illumination and other types of document degradation such as smear. Given a historical document image, the proposed technique first constructs a contrast image and then detects the high contrast image pixels which usually lie around the text stroke boundary. The document text is then segmented by using local thresholds that are estimated from the detected high contrast pixels within a local neighborhood window. The proposed technique has been tested over the dataset that is used in the recent Document Image Binarization Contest (DIBCO) 2009. Experiments show its superior performance.

References

[1]
J. Bernsen. Dynamic thresholding of gray-level images. International Conference on Pattern Recognition, pages 1251--1255, October 1986.
[2]
A. Brink. Thresholding of digital images using two-dimensional entropies. Pattern Recognition, 25(8):803--808, 1992.
[3]
J. Canny. A computational approach to edge detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 8(6):679--698, January 1986.
[4]
Y. Chen and G. Leedham. Decompose algorithm for thresholding degraded historical document images. IEEE Proceedings on Vision, Image and Signal Processing, 152(6):702--714, December 2005.
[5]
A. Dawoud. Iterative cross section sequence graph for handwritten character segmentation. IEEE Transactions on Image Processing, 16(8):2150--2154, August 2007.
[6]
L. Eikvil, T. Taxt, and K. Moen. A fast adaptive method for binarization of document images. International Conference on Document Analysis and Recognition, pages 435--443, September 1991.
[7]
B. Gatos, K. Ntirogiannis, and I. Pratikakis. ICDAR 2009 document image binarization contest (DIBCO 2009). In International Conference on Document Analysis and Recognition, pages 1375--1382, July 2009.
[8]
B. Gatos, I. Pratikakis, and S. Perantonis. Adaptive degraded document image binarization. Pattern Recognition, 39(3):317--327, 2006.
[9]
I.-K. Kim, D.-W. Jung, and R.-H. Park. Document image binarization based on topographic analysis using a water flow model. Pattern Recognition, 35(1):265--277, 2002.
[10]
J. Kittler and J. Illingworth. On threshold selection using clustering criteria. IEEE transactions on systems, man, and cybernetics, 15:652--655, 1985.
[11]
G. Leedham, C. Yan, K. Takru, J. H. N. Tan, and L. Mian. Comparison of some thresholding algorithms for text/background segmentation in difficult document images. International Conference on Document Analysis and Recognition, pages 859--864, 2003.
[12]
Y. Liu and S. Srihari. Document image binarization based on texture features. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(5):540--544, May 1997.
[13]
S. J. Lu and C. L. Tan. Binarization of badly illuminated document images through shading estimation and compensation. International Conference on Document Analysis and Recognition, pages 312--316, 2007.
[14]
W. Niblack. An Introduction to Digital Image Processing. Prentice-Hall, Englewood Cliffs, New Jersey, 1986.
[15]
N. Otsu. A threshold selection method from gray level histogram. IEEE Transactions on System, Man, Cybernetics, 19(1):62--66, January 1978.
[16]
J. Parker, C. Jennings, and A. Salkauskas. Thresholding using an illumination model. International Conference on Document Analysis and Recognition, pages 270--273, October 1993.
[17]
J. Sauvola and M. Pietikainen. Adaptive document image binarization. Pattern Recognition, 33(2):225--236, 2000.
[18]
O. Trier and T. Taxt. Evaluation of binarization methods for document images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 17(3):312--315, March 1995.
[19]
M. van Herk. A fast algorithm for local minimum and maximum filters on rectangular and octagonal kernels. Pattern Recognition Letters, 13(7):517--521, 1992.
[20]
J.-D. Yang, Y.-S. Chen, and W.-H. Hsu. Adaptive thresholding algorithm and its hardware implementation. Pattern Recognition Letters, 15(2):141--150, 1994.

Cited By

View all
  • (2025)BA-GAN: A Boundary-Aware Generative Adversarial Network for Document RestorationIntelligent Systems and Pattern Recognition10.1007/978-3-031-82153-0_26(344-358)Online publication date: 5-Mar-2025
  • (2024)A Review of Document Binarization: Main Techniques, New Challenges, and TrendsElectronics10.3390/electronics1307139413:7(1394)Online publication date: 7-Apr-2024
  • (2024)Comparative study: enhancing legibility of ancient Indian script images from diverse stone background structures using 34 different pre-processing methodsHeritage Science10.1186/s40494-024-01169-612:1Online publication date: 20-Feb-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
June 2010
490 pages
ISBN:9781605587738
DOI:10.1145/1815330
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. document image analysis
  2. document image binarization
  3. image contrast
  4. image pixel classification

Qualifiers

  • Research-article

Conference

DAS '10

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)34
  • Downloads (Last 6 weeks)1
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)BA-GAN: A Boundary-Aware Generative Adversarial Network for Document RestorationIntelligent Systems and Pattern Recognition10.1007/978-3-031-82153-0_26(344-358)Online publication date: 5-Mar-2025
  • (2024)A Review of Document Binarization: Main Techniques, New Challenges, and TrendsElectronics10.3390/electronics1307139413:7(1394)Online publication date: 7-Apr-2024
  • (2024)Comparative study: enhancing legibility of ancient Indian script images from diverse stone background structures using 34 different pre-processing methodsHeritage Science10.1186/s40494-024-01169-612:1Online publication date: 20-Feb-2024
  • (2024)Performance of Binarization Algorithms on Tamizhi Inscription Images: An AnalysisACM Transactions on Asian and Low-Resource Language Information Processing10.1145/365658323:5(1-29)Online publication date: 10-May-2024
  • (2024)Applicability of OCR Engines for Text Recognition in Vehicle Number Plates, Receipts and HandwritingJournal of Circuits, Systems and Computers10.1142/S021812662350321832:18Online publication date: 2-Feb-2024
  • (2024)Binarization via Local Gradient-like Differences in four Directions2024 International Conference Automatics and Informatics (ICAI)10.1109/ICAI63388.2024.10851623(414-419)Online publication date: 10-Oct-2024
  • (2024)WAS: Dataset and Methods for Artistic Text SegmentationComputer Vision – ECCV 202410.1007/978-3-031-73001-6_14(237-254)Online publication date: 27-Nov-2024
  • (2024)EAFormer: Scene Text Segmentation with Edge-Aware TransformersComputer Vision – ECCV 202410.1007/978-3-031-72698-9_24(410-427)Online publication date: 26-Oct-2024
  • (2024)SAGHOG: Self-supervised Autoencoder for Generating HOG Features for Writer RetrievalDocument Analysis and Recognition - ICDAR 202410.1007/978-3-031-70536-6_8(121-138)Online publication date: 3-Sep-2024
  • (2023)TransDocUNet: A Transformer-based UNet Architecture for Degraded Document Image BinarizationProceedings of the Fourteenth Indian Conference on Computer Vision, Graphics and Image Processing10.1145/3627631.3627639(1-9)Online publication date: 15-Dec-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media