Skip to main content
Log in

Document image binarization using background estimation and stroke edges

  • Original Paper
  • Published:
International Journal on Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Abstract

Document images often suffer from different types of degradation that renders the document image binarization a challenging task. This paper presents a document image binarization technique that segments the text from badly degraded document images accurately. The proposed technique is based on the observations that the text documents usually have a document background of the uniform color and texture and the document text within it has a different intensity level compared with the surrounding document background. Given a document image, the proposed technique first estimates a document background surface through an iterative polynomial smoothing procedure. Different types of document degradation are then compensated by using the estimated document background surface. The text stroke edge is further detected from the compensated document image by using L1-norm image gradient. Finally, the document text is segmented by a local threshold that is estimated based on the detected text stroke edges. The proposed technique was submitted to the recent document image binarization contest (DIBCO) held under the framework of ICDAR 2009 and has achieved the top performance among 43 algorithms that are submitted from 35 international research groups.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Trier O., Taxt T.: Evaluation of binarization methods for document images. IEEE Trans. Pattern Anal. Mach. Intell 17, 312–315 (1995)

    Article  Google Scholar 

  2. Leedham G., Yan C., Takru K., Tan J.H.N., Mian L.: Comparison of some thresholding algorithms for text/background segmentation in difficult document images. Int Conf Doc Anal. Recogn. 2, 859–864 (2003)

    Google Scholar 

  3. Otsu N.: A threshold selection method from gray level histogram. IEEE Trans. Syst. Man Cybern. 19, 62–66 (1978)

    Google Scholar 

  4. Brink A.: Thresholding of digital images using two-dimensional entropies. Pattern Recogn. 25(8), 803–808 (1992)

    Article  Google Scholar 

  5. Kittler J., Illingworth J.: On threshold selection using clustering criteria. IEEE Trans. syst. Man Cybern. 15, 652–655 (1985)

    Google Scholar 

  6. Solihin Y., Leedham C.: Integral ratio: a new class of global thresholding techniques for handwriting images. IEEE Trans. Pattern Anal. Mach. Intell. 21, 761–768 (1999)

    Article  Google Scholar 

  7. Kim I.-K., Jung D.-W., Park R.-H.: Document image binarization based on topographic analysis using a water flow model. Pattern Recogn. 35, 141–150 (2002)

    Google Scholar 

  8. Yang J., Chen Y., Hsu W.: Adaptive thresholding algorithm and its hardware implementation. Pattern Recogn. Lett. 15(2), 141–150 (1994)

    Article  MATH  Google Scholar 

  9. Parker, J., Jennings, C., Salkauskas, A.: Thresholding using an illumination model. International Conference on Document Analysis and Recognition, pp. 270–273. September 1993

  10. Eikvil, L., Taxt, T., Moen, K.: A fast adaptive method for binarization of document images. International Conference on Document Analysis and Recognition, pp. 435–443, September 1991

  11. Niblack W.: An Introduction to Digital Image Processing. Prentice-Hall, Englewood Cliffs (1986)

    Google Scholar 

  12. Sauvola J., Pietikainen M.: Adaptive document image binarization. Pattern Recogn. 33, 225–236 (2000)

    Article  Google Scholar 

  13. Gatos B., Pratikakis I., Perantonis S.: Adaptive degraded document image binarization. Pattern Recogn. 39, 317–327 (2006)

    Article  MATH  Google Scholar 

  14. Moghaddam R.F., Cheriet M.: Rsldi: restoration of single-sided low-quality document images. Pattern Recogn. 42, 3355–3364 (2009)

    Article  MATH  Google Scholar 

  15. Moghaddam, R.F., Cheriet, M.: Application of multi-level classifiers and clustering for automatic word-spotting in historical document images. International Conference on Document Analysis and Recognition, pp. 511–515. July 2009

  16. Chen Q., Sun Q., Heng P.A., Xia D.: A double-threshold image binarization method based on edge detector. Pattern Recogn. 41, 1254–1267 (2008)

    Article  Google Scholar 

  17. Su, B., Lu, S., Tan, C.L.: Binarization of historical handwritten document images using local maximum and minimum filter. International Workshop on Document Analysis Systems, pp. 159–165. June 2010

  18. Dawoud A.: Iterative cross section sequence graph for handwritten character segmentation. IEEE Trans. Image Process. 16, 2150–2154 (2007)

    Article  MathSciNet  Google Scholar 

  19. Gatos, B., Ntirogiannis, K., Pratikakis, I.: Icdar 2009 document image binarization contest (dibco 2009). International Conference on Document Analysis and Recognition, pp. 1375–1382. July 2009

  20. Lu S., Tan C.L.: Binarization of badly illuminated document images through shading estimation and compensation. Int. Conf. Doc. Anal. Recogn. 1, 312–316 (2007)

    Google Scholar 

  21. Hamming R.W.: Digital Filter. Prentice-Hall, Englewood Cliffs (1983)

    Google Scholar 

  22. Krzysztof, M.P.M., Axel, M.: Dynamic threshold using polynomial surface regression with application to the binarization of fingerprints. Proceedings of the SPIE, vol. 5779

  23. Seeger, M., Dance, C.: Binarising camera images for ocr. Proceedings of International Conference on Document Analysis and Recognition, pp. 54–58 (2001)

  24. Lu, S., Tan, C.L.: Thresholding of badly illuminated document images through photometric correction. ACM symposium on Document engineering pp. 3–8. 2007

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shijian Lu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lu, S., Su, B. & Tan, C.L. Document image binarization using background estimation and stroke edges. IJDAR 13, 303–314 (2010). https://doi.org/10.1007/s10032-010-0130-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10032-010-0130-8

Keywords

Navigation