Skip to main content

KFBin: Kalman Filter-Based Approach for Document Image Binarization

  • Conference paper
  • First Online:
Image Analysis and Recognition (ICIAR 2019)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11662))

Included in the following conference series:

  • 1173 Accesses

Abstract

In this paper, we propose a novel two-step approach, called KFBin, for the binarization of document images based on the Kalman filtering (KF) technique. In the first step, a state space model is developed as a new document image representation, and then the Kalman filter is applied to track the positions of the foreground and background information and generate two corresponding outputs, which allows the enhancement of the foreground content leading to better legibility of text. Standard thresholding algorithms were used in the second step to generate binary images from the enhanced foreground components. The performance of the proposed approach is validated on a well-known dataset and evaluated using common image binarization quality metrics. Outstanding improvement of the binarization performances of several state-of-the-art binarization methods has been achieved by using the proposed approach. Experimental results point that the poor binarization results of egraded document images can be greatly improved by enhancing their quality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Azimi-Sadjadi, M.R., Bannour, S.: Two-dimensional adaptive block Kalman filtering of SAR imagery. IEEE Trans. Geosci. Remote Sens. 29(5), 742–753 (1991)

    Article  Google Scholar 

  2. Calvo-Zaragoza, J., Vigliensoni, G., Fujinaga, I.: Pixel-wise binarization of musical documents with convolutional neural networks. In: 2017 Fifteenth IAPR International Conference on Machine Vision Applications (MVA), pp. 362–365. IEEE (2017)

    Google Scholar 

  3. Chang, C.-I.: Discrete-time Kalman filtering for hyperspectral processing. In: Real-Time Recursive Hyperspectral Sample and Band Processing, pp. 49–71. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-45171-8_3

    Chapter  Google Scholar 

  4. Cheriet, M., Moghaddam, R.F., Hedjam, R.: A learning framework for the optimization and automation of document binarization methods. Comput. Vis. Image Underst. 117(3), 269–280 (2013)

    Article  Google Scholar 

  5. Cuevas, E.V., Zaldivar, D., Rojas, R.: Kalman filter for vision tracking (2005)

    Google Scholar 

  6. Gatos, B., Ntirogiannis, K., Pratikakis, I.: ICDAR 2009 document image binarization contest (DIBCO 2009). In: 10th International Conference on Document Analysis and Recognition, ICDAR 2009, pp. 1375–1382. IEEE (2009)

    Google Scholar 

  7. He, S., Schomaker, L.: DeepOtsu: document enhancement and binarization using iterative deep learning. Pattern Recognit. 91, 379–390 (2019)

    Article  Google Scholar 

  8. Howe, N.R.: Document binarization with automatic parameter tuning. Int. J. Doc. Anal. Recognit. (IJDAR) 16(3), 247–258 (2013)

    Article  Google Scholar 

  9. Jia, F., Shi, C., He, K., Wang, C., Xiao, B.: Degraded document image binarization using structural symmetry of strokes. Pattern Recognit. 74, 225–240 (2018)

    Article  Google Scholar 

  10. Kalman, R.E.: A new approach to linear filtering and prediction problems. J. Basic Eng. 82(1), 35–45 (1960)

    Article  Google Scholar 

  11. Lu, G., Ouyang, W., Xu, D., Zhang, X., Gao, Z., Sun, M.-T.: Deep Kalman filtering network for video compression artifact reduction. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) Computer Vision – ECCV 2018. LNCS, vol. 11218, pp. 591–608. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01264-9_35

    Chapter  Google Scholar 

  12. Lu, H., Kot, A.C., Shi, Y.Q.: Distance-reciprocal distortion measure for binary document images. IEEE Signal Process. Lett. 11(2), 228–231 (2004)

    Article  Google Scholar 

  13. Moghaddam, R.F., Cheriet, M.: A multi-scale framework for adaptive binarization of degraded document images. Pattern Recognit. 43(6), 2186–2198 (2010)

    Article  Google Scholar 

  14. Moghaddam, R.F., Cheriet, M.: Adotsu: An adaptive and parameterless generalization of Otsu’s method for document image binarization. Pattern Recognit. 45(6), 2419–2431 (2012)

    Article  Google Scholar 

  15. Nafchi, H.Z., Moghaddam, R.F., Cheriet, M.: Application of phase-based features and denoising in postprocessing and binarization of historical document images. In: 2013 12th International Conference on Document Analysis and Recognition, pp. 220–224. IEEE (2013)

    Google Scholar 

  16. Niblack, W.: An introduction to digital image processing (1986)

    Google Scholar 

  17. Ohta, Y.I., Kanade, T., Sakai, T.: Color information for region segmentation. Comput. Graph. Image Process. 13(3), 222–241 (1980)

    Article  Google Scholar 

  18. Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)

    Article  Google Scholar 

  19. Pan, J., Yang, X., Cai, H., Mu, B.: Image noise smoothing using a modified Kalman filter. Neurocomputing 173, 1625–1629 (2016)

    Article  Google Scholar 

  20. Pratikakis, I., Zagoris, K., Barlas, G., Gatos, B.: ICDAR 2017 competition on document image binarization (DIBCO 2017). In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 01, pp. 1395–1403, November 2017

    Google Scholar 

  21. Pratikakis, I., Gatos, B., Ntirogiannis, K.: ICDAR 2013 document image binarization contest (DIBCO 2013). In: 2013 12th International Conference on Document Analysis and Recognition, pp. 1471–1476. IEEE (2013)

    Google Scholar 

  22. Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recognit. 33(2), 225–236 (2000)

    Article  Google Scholar 

  23. Sokolova, M., Lapalme, G.: A systematic analysis of performance measures for classification tasks. Inf. Process. Manage. 45(4), 427–437 (2009)

    Article  Google Scholar 

  24. Su, B., Lu, S., Tan, C.L.: Binarization of historical document images using the local maximum and minimum. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, pp. 159–166. ACM (2010)

    Google Scholar 

  25. Su, B., Lu, S., Tan, C.L.: Combination of document image binarization techniques. In: 2011 International Conference on Document Analysis and Recognition, pp. 22–26. IEEE (2011)

    Google Scholar 

  26. Tensmeyer, C., Martinez, T.: Document image binarization with fully convolutional neural networks. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 1, pp. 99–104. IEEE (2017)

    Google Scholar 

  27. Tonazzini, A.: Color space transformations for analysis and enhancement of ancient degraded manuscripts. Pattern Recognit. Image Anal. 20(3), 404–417 (2010)

    Article  Google Scholar 

  28. Vo, Q.N., Kim, S.H., Yang, H.J., Lee, G.: Binarization of degraded document images based on hierarchical deep supervised network. Pattern Recognit. 74, 568–586 (2018)

    Article  Google Scholar 

  29. Young, D.P., Ferryman, J.M.: Pets metrics: on-line performance evaluation service. In: 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pp. 317–324. IEEE (2005)

    Google Scholar 

  30. Zhang, L., Cichocki, A.: Blind deconvolution of dynamical systems: a state space approach. J. Signal Process. 4(2), 111–130 (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abderrahmane Rahiche .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rahiche, A., Cheriet, M. (2019). KFBin: Kalman Filter-Based Approach for Document Image Binarization. In: Karray, F., Campilho, A., Yu, A. (eds) Image Analysis and Recognition. ICIAR 2019. Lecture Notes in Computer Science(), vol 11662. Springer, Cham. https://doi.org/10.1007/978-3-030-27202-9_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-27202-9_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-27201-2

  • Online ISBN: 978-3-030-27202-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics