Abstract
In this paper, we propose a novel two-step approach, called KFBin, for the binarization of document images based on the Kalman filtering (KF) technique. In the first step, a state space model is developed as a new document image representation, and then the Kalman filter is applied to track the positions of the foreground and background information and generate two corresponding outputs, which allows the enhancement of the foreground content leading to better legibility of text. Standard thresholding algorithms were used in the second step to generate binary images from the enhanced foreground components. The performance of the proposed approach is validated on a well-known dataset and evaluated using common image binarization quality metrics. Outstanding improvement of the binarization performances of several state-of-the-art binarization methods has been achieved by using the proposed approach. Experimental results point that the poor binarization results of egraded document images can be greatly improved by enhancing their quality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Azimi-Sadjadi, M.R., Bannour, S.: Two-dimensional adaptive block Kalman filtering of SAR imagery. IEEE Trans. Geosci. Remote Sens. 29(5), 742–753 (1991)
Calvo-Zaragoza, J., Vigliensoni, G., Fujinaga, I.: Pixel-wise binarization of musical documents with convolutional neural networks. In: 2017 Fifteenth IAPR International Conference on Machine Vision Applications (MVA), pp. 362–365. IEEE (2017)
Chang, C.-I.: Discrete-time Kalman filtering for hyperspectral processing. In: Real-Time Recursive Hyperspectral Sample and Band Processing, pp. 49–71. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-45171-8_3
Cheriet, M., Moghaddam, R.F., Hedjam, R.: A learning framework for the optimization and automation of document binarization methods. Comput. Vis. Image Underst. 117(3), 269–280 (2013)
Cuevas, E.V., Zaldivar, D., Rojas, R.: Kalman filter for vision tracking (2005)
Gatos, B., Ntirogiannis, K., Pratikakis, I.: ICDAR 2009 document image binarization contest (DIBCO 2009). In: 10th International Conference on Document Analysis and Recognition, ICDAR 2009, pp. 1375–1382. IEEE (2009)
He, S., Schomaker, L.: DeepOtsu: document enhancement and binarization using iterative deep learning. Pattern Recognit. 91, 379–390 (2019)
Howe, N.R.: Document binarization with automatic parameter tuning. Int. J. Doc. Anal. Recognit. (IJDAR) 16(3), 247–258 (2013)
Jia, F., Shi, C., He, K., Wang, C., Xiao, B.: Degraded document image binarization using structural symmetry of strokes. Pattern Recognit. 74, 225–240 (2018)
Kalman, R.E.: A new approach to linear filtering and prediction problems. J. Basic Eng. 82(1), 35–45 (1960)
Lu, G., Ouyang, W., Xu, D., Zhang, X., Gao, Z., Sun, M.-T.: Deep Kalman filtering network for video compression artifact reduction. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) Computer Vision – ECCV 2018. LNCS, vol. 11218, pp. 591–608. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01264-9_35
Lu, H., Kot, A.C., Shi, Y.Q.: Distance-reciprocal distortion measure for binary document images. IEEE Signal Process. Lett. 11(2), 228–231 (2004)
Moghaddam, R.F., Cheriet, M.: A multi-scale framework for adaptive binarization of degraded document images. Pattern Recognit. 43(6), 2186–2198 (2010)
Moghaddam, R.F., Cheriet, M.: Adotsu: An adaptive and parameterless generalization of Otsu’s method for document image binarization. Pattern Recognit. 45(6), 2419–2431 (2012)
Nafchi, H.Z., Moghaddam, R.F., Cheriet, M.: Application of phase-based features and denoising in postprocessing and binarization of historical document images. In: 2013 12th International Conference on Document Analysis and Recognition, pp. 220–224. IEEE (2013)
Niblack, W.: An introduction to digital image processing (1986)
Ohta, Y.I., Kanade, T., Sakai, T.: Color information for region segmentation. Comput. Graph. Image Process. 13(3), 222–241 (1980)
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)
Pan, J., Yang, X., Cai, H., Mu, B.: Image noise smoothing using a modified Kalman filter. Neurocomputing 173, 1625–1629 (2016)
Pratikakis, I., Zagoris, K., Barlas, G., Gatos, B.: ICDAR 2017 competition on document image binarization (DIBCO 2017). In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 01, pp. 1395–1403, November 2017
Pratikakis, I., Gatos, B., Ntirogiannis, K.: ICDAR 2013 document image binarization contest (DIBCO 2013). In: 2013 12th International Conference on Document Analysis and Recognition, pp. 1471–1476. IEEE (2013)
Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recognit. 33(2), 225–236 (2000)
Sokolova, M., Lapalme, G.: A systematic analysis of performance measures for classification tasks. Inf. Process. Manage. 45(4), 427–437 (2009)
Su, B., Lu, S., Tan, C.L.: Binarization of historical document images using the local maximum and minimum. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, pp. 159–166. ACM (2010)
Su, B., Lu, S., Tan, C.L.: Combination of document image binarization techniques. In: 2011 International Conference on Document Analysis and Recognition, pp. 22–26. IEEE (2011)
Tensmeyer, C., Martinez, T.: Document image binarization with fully convolutional neural networks. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 1, pp. 99–104. IEEE (2017)
Tonazzini, A.: Color space transformations for analysis and enhancement of ancient degraded manuscripts. Pattern Recognit. Image Anal. 20(3), 404–417 (2010)
Vo, Q.N., Kim, S.H., Yang, H.J., Lee, G.: Binarization of degraded document images based on hierarchical deep supervised network. Pattern Recognit. 74, 568–586 (2018)
Young, D.P., Ferryman, J.M.: Pets metrics: on-line performance evaluation service. In: 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pp. 317–324. IEEE (2005)
Zhang, L., Cichocki, A.: Blind deconvolution of dynamical systems: a state space approach. J. Signal Process. 4(2), 111–130 (2000)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Rahiche, A., Cheriet, M. (2019). KFBin: Kalman Filter-Based Approach for Document Image Binarization. In: Karray, F., Campilho, A., Yu, A. (eds) Image Analysis and Recognition. ICIAR 2019. Lecture Notes in Computer Science(), vol 11662. Springer, Cham. https://doi.org/10.1007/978-3-030-27202-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-27202-9_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-27201-2
Online ISBN: 978-3-030-27202-9
eBook Packages: Computer ScienceComputer Science (R0)