Abstract
In this paper we introduce the use of integral histograms (IH) for document analysis. IH take advantage of the great increase of the memory size available on computers over time. By storing selected histogram features into each pixel position, several image filters can be calculated within constant complexity. In other words, time complexity is remarkably reduced by using more memory. While IH received much attention in the computer vision field, they have not been intensively investigated for document analysis so far. As a first step into this direction, we analyze IH for the toy problem of image binarization which is a prerequisite for many graphics and text recognition systems. The results of our participation in the HDIBCO2010 competition as well as our experiments with all DIBCO datasets show the capabilities of this novel method for Document Image analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
As made available http://liris.cnrs.fr/christian.wolf/software/binarize/
- 2.
A binary version of our benchmarking program http://nicolaou.homouniversalis.org/demos/2013/05/15/Binarization-Demo.html
References
Porikli, F.: Integral histogram: a fast way to extract histograms in cartesian spaces. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, vol. 1, pp. 829–836. IEEE (2005)
Garz, A., Fischer, A., Sablatnig, R., Bunke, H.: Binarization-free text line segmentation for historical documents based on interest point clustering. In: 2012 10th IAPR International Workshop on Document Analysis Systems (DAS), pp. 95–99. IEEE (2012)
Lopresti, D., Nagy, G.: When is a problem solved? In: 2011 International Conference on Document Analysis and Recognition (ICDAR), pp. 32–36. IEEE (2011)
Pratikakis, I., Gatos, B., Ntirogiannis, K.: Icdar 2011 document image binarization contest (dibco 2011). In: 2011 International Conference on Document Analysis and Recognition (ICDAR), pp. 1506–1510. IEEE (2011)
Pratikakis, I., Gatos, B., Ntirogiannis, K.: Icfhr 2012 competition on handwritten document image binarization (h-dibco 2012). In: 2012 International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 817–822. IEEE (2012)
Crow, F.C.: Summed-area tables for texture mapping. In: ACM SIGGRAPH Computer Graphics, vol. 18, pp. 207–212. ACM (1984)
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, vol. 1, pp. I-511–I-518. IEEE (2001)
Shafait, F., Keysers, D., Breuel, T.M.: Efficient implementation of local adaptive thresholding techniques using integral images. Doc. Recogn. Retrieval XV 6815(1), 681510 (2008)
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006)
Porikli, F.: Constant time o (1) bilateral filtering. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8. IEEE (2008)
Kass, M., Solomon, J.: Smoothed local histogram filters. ACM Trans. Graph. (TOG) 29(4), 100 (2010)
Zhang, K., Lafruit, G., Lauwereins, R., Van Gool, L.: Joint integral histograms and its application in stereo matching. In: 2010 17th IEEE International Conference on Image Processing (ICIP), pp. 817–820. IEEE (2010)
Konya, I., Seibert, C., Eickeler, S., Glahn, S.: Constant-time locally optimal adaptive binarization. In: 10th International Conference on Document Analysis and Recognition, ICDAR’09, pp. 738–742. IEEE (2009)
Niblack, W.: An Introduction to Digital Image Processing. Prentice Hall, Englewood Cliffs (1986)
Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recogn. 33(2), 225–236 (2000)
Ntirogiannis, K., Gatos, B., Pratikakis, I.: A combined approach for the binarization of handwritten document images. Pattern Recognit. Lett. 33(12), 1601–1613 (2012)
Su, B., Lu, S., Tan, C.L.: Combination of document image binarization techniques. In: 2011 International Conference on Document Analysis and Recognition (ICDAR), pp. 22–26. IEEE (2011)
Badekas, E., Papamarkos, N.: Optimal combination of document binarization techniques using a self-organizing map neural network. Eng. Appl. Artif. Intell. 20(1), 11–24 (2007)
Otsu, N.: A threshold selection method from gray-level histograms. Automatica 11(285–296), 23–27 (1975)
Gatos, B., Ntirogiannis, K., Pratikakis, I.: Icdar 2009 document image binarization contest (dibco 2009). In: 10th International Conference on Document Analysis and Recognition, ICDAR’09, pp. 1375–1382. IEEE (2009)
Pratikakis, I., Gatos, B., Ntirogiannis, K.: H-dibco 2010-handwritten document image binarization competition. In: 2010 International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 727–732. IEEE (2010)
Wolf, C., Jolion, J.M., Chassaing, F.: Text localization, enhancement and binarization in multimedia documents. In: Proceedings of the International Conference on Pattern Recognition, vol. 2, pp. 1037–1040 (2002)
Acknowledgment
This work has been supported by the Swiss National Science Foundation with the HisDoc 2.0 project 205120 150173.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nicolaou, A., Ingold, R., Liwicki, M. (2014). Binarization with the Local Otsu Filter. In: Lamiroy, B., Ogier, JM. (eds) Graphics Recognition. Current Trends and Challenges. GREC 2013. Lecture Notes in Computer Science(), vol 8746. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44854-0_14
Download citation
DOI: https://doi.org/10.1007/978-3-662-44854-0_14
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-44853-3
Online ISBN: 978-3-662-44854-0
eBook Packages: Computer ScienceComputer Science (R0)