Methods of Natural Image Preprocessing Supporting the Automatic Text Recognition Using the OCR Algorithms

Lech, Piotr; Okarma, Krzysztof

doi:10.1007/978-3-319-23814-2_17

Piotr Lech^15,16 &
Krzysztof Okarma^15,16

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 389))

797 Accesses

Abstract

Reading text from natural images is much more difficult than from scanned text documents since the text may appear in all colors, different sizes and types, often with distorted geometry or textures applied. The paper presents the idea of high-speed image preprocessing algorithms utilizing the quasi-local histogram based methods such as binarization, ROI filtering, line and corners detection, etc. which can be helpful for this task. Their low computational cost is provided by a reduction of the amount of processed information carried out by means of a simple random sampling. The approach presented in the paper allows to minimize some problems with the implementation of the OCR algorithms operating on natural images on devices with low computing power (e.g. mobile or embedded). Due to relatively small computational effort it is possible to test multiple hypotheses e.g. related to the possible location of the text in the image. Their verification can be based on the analysis of images in various color spaces. An additional advantage of the discussed algorithms is their construction allowing an efficient parallel implementation further reducing the computation time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Automated Text Detection and Character Recognition in Natural Scenes Based on Local Image Features and Contour Processing Techniques

Optimized Scene Text Detector and Paddle Optical Character Recognizer Techniques to Extract Text from Images

An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents

References

International Telecommunication Union recommendation BT.709-5—parameter values for the HDTV standards for production and international programme exchange (2001)
Google Scholar
International Telecommunication Union recommendation BT.601-7—studio encoding parameters of digital television for standard 4:3 and wide-screen 16:9 aspect ratios (2011)
Google Scholar
Bissacco, A., Cummins, M., Netzer, Y., Neven, H.: PhotoOCR: Reading text in uncontrolled conditions. In: Proceedings of IEEE International Conference on Computer Vision (ICCV), pp. 785–792 (2013)
Google Scholar
de Campos, T.E., Babu, B.R., Varma, M.: Character recognition in natural images. In: Proceedings of the International Conference on Computer Vision Theory and Applications (2009)
Google Scholar
Chen, H., Tsai, S., Schroth, G., Chen, D., Grzeszczuk, R., Girod, B.: Robust text detection in natural images with edge-enhanced Maximally Stable Extremal Regions. In: Proceedings of the 18th IEEE International Conference on Image Processing (ICIP), pp. 2609–2612 (2011)
Google Scholar
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2963–2970 (2010)
Google Scholar
Forczmański, P., Frejlichowski, D.: Robust stamps detection and classification by means of general shape analysis. In: Bolc, L., Tadeusiewicz, R., Chmielewski, L., Wojciechowski, K. (eds.) Computer Vision and Graphics. Lecture Notes in Computer Science, vol. 6374, pp. 360–367. Springer, Berlin (2010)
Google Scholar
Gooch, A.A., Olsen, S.C., Tumblin, J., Gooch, B.: Color2Gray: salience-preserving color removal. ACM Trans. Graph. 24(3), 634–639 (2005)
Article Google Scholar
Grundland, M., Dodgson, N.A.: Decolorize: fast, contrast enhancing, color to grayscale conversion. Pattern Recogn. 40(11), 2891–2896 (2007)
Article Google Scholar
Ikica, A., Peer, P.: Swt voting-based color reduction for text detection in natural scene images. EURASIP J. Adv. Sig. Process. 2013(1), Article ID 95 (2013)
Google Scholar
Kapur, J., Sahoo, P., Wong, A.: A new method for gray-level picture thresholding using the entropy of the histogram. Comput. Vis. Graph. Image Process. 29(3), 273–285 (1985)
Article Google Scholar
Milyaev, S., Barinova, O., Novikova, T., Kohli, P., Lempitsky, V.: Image binarization for end-to-end text understanding in natural images. In: Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 128–132 (2013)
Google Scholar
Nagy, R., Dicker, A., Meyer-Wegener, K.: NEOCR: A configurable dataset for natural image text recognition. In: Iwamura, M., Shafait, F. (eds.) Camera-Based Document Analysis and Recognition. Lecture Notes in Computer Science, vol. 7139, pp. 150–163. Springer, Berlin (2012)
Chapter Google Scholar
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)
Article Google Scholar
Roubtsova, N.S., Wijnhoven, R.G.J., de With, P.H.N.: Integrated text detection and recognition in natural images. In: Image Processing: Algorithms and Systems X and Parallel Processing for Imaging Applications II. Proceedings of SPIE, vol. 8295, pp. 829507–829521 (2012)
Google Scholar
Smith, R.: An overview of the Tesseract OCR engine. In: Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR), vol. 2, pp. 629–633 (2007)
Google Scholar
Su, B., Lu, S., Tian, S., Lim, J.H., Tan, C.L.: Character recognition in natural scenes using convolutional co-occurrence HOG. In: Proceedings of 22nd International Conference on Pattern Recognition (ICPR), pp. 2926–2931 (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

West Pomeranian University of Technology, Szczecin, Poland
Piotr Lech & Krzysztof Okarma
Faculty of Electrical Engineering, Department of Signal Processing and Multimedia Engineering, 26. Kwietnia 10, 71-126, Szczecin, Poland
Piotr Lech & Krzysztof Okarma

Authors

Piotr Lech
View author publications
You can also search for this author in PubMed Google Scholar
Krzysztof Okarma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Piotr Lech .

Editor information

Editors and Affiliations

Bydgoszcz, Poland
Ryszard S. Choraś

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lech, P., Okarma, K. (2016). Methods of Natural Image Preprocessing Supporting the Automatic Text Recognition Using the OCR Algorithms. In: Choraś, R. (eds) Image Processing and Communications Challenges 7. Advances in Intelligent Systems and Computing, vol 389. Springer, Cham. https://doi.org/10.1007/978-3-319-23814-2_17

Download citation

DOI: https://doi.org/10.1007/978-3-319-23814-2_17
Published: 15 October 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23813-5
Online ISBN: 978-3-319-23814-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics