Skip to main content

Text Particles Multi-band Fusion for Robust Text Detection

  • Conference paper
Image Analysis and Recognition (ICIAR 2008)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5112))

Included in the following conference series:

Abstract

Texts in images and videos usually carry important information for visual content understanding and retrieval. Two main restrictions exist in the state-of-the-art text detection algorithms: weak contrast and text-background variance. This paper presents a robust text detection method based on text particles (TP) multi-band fusion to solve there problems. Firstly, text particles are generated by their local binary pattern of pyramid Haar wavelet coefficients in YUV color space. It preserves and uniforms text-background contrasts while extracting multi-band information. Secondly, the candidate text regions are generated via density-based text particle multi-band fusion, and the LHBP histogram analysis is utilized to remove non-text regions. Our TP-based detection framework can robustly locate text regions regardless of diversity sizes, colors, rotations, illuminations and text-background contrasts. Experiment results on ICDAR 03 over the existing methods demonstrate the robustness and effectiveness of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. El Rube, I., Ahmed, M., Kamel, M.: Wavelet approximation-based affine invariant shape representation functions. IEEE Transactions on Pattern Analysis and Machine Intelligence 28(2), 323–327 (2006)

    Article  Google Scholar 

  2. Chen, D.T., Bourland, H., Thiran, J.P.: Text identification in complex background using SVM. In: International Conference on Computer Vision and Pattern Recognition, pp. 621–626 (2001)

    Google Scholar 

  3. Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Transactions on Circuits and Systems for Video Technology 12, 256–268 (2002)

    Article  Google Scholar 

  4. Ezaki, N., Bulacu, M., Schomaker, L.: Text Detection from Natural Scene Images: Towards a System for Visually Impaired Persons. In: International Conference on Pattern Recognition, vol. 2, pp. 683–686 (2004)

    Google Scholar 

  5. Yi, J., Peng, Y., Xiao, J.: Color-based Clustering for Text Detection and Extraction in Image. In: ACM Conference on Multimedia, pp. 847–850 (2007)

    Google Scholar 

  6. Gllavata, J., Ewerth, R., Freisleben, B.: Text detection in images based on unsupervised classification of high frequency wavelet coefficients. In: International Conference on Pattern Recognition, pp. 425–428 (2004)

    Google Scholar 

  7. Ye, Q.X., Huang, Q.M.: A New Text Detection Algorithm in Image/Video Frames. In: Advances in Multimedia Information Processing 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30-December 3, 2004, pp. 858–865 (2004)

    Google Scholar 

  8. Ji, R.R., Xu, P.F., Yao, H.X., Sun, X.S., Liu, T.Q.: Directional Correlation Analysis of Local Haar Binary Pattern for Text Detection. In: IEEE International Conference on Multimedia & Expo (accept, 2008)

    Google Scholar 

  9. Xi, D., Kamel, M.: Extraction of filled in strokes from cheque image using pseudo 2D wavelet with adjustable support. In: IEEE International Conference on Image Processing, vol. 2, pp. 11–14 (2005)

    Google Scholar 

  10. Ojala, T., Pietikäinen, M., Harwood, D.: A Comparative Study of Texture Measures with Classification Based on Feature Distributions. Pattern Recognition 29(1), 51–59 (1996)

    Article  Google Scholar 

  11. Ojala, T., Pietikäinen, M., Mäenpäa, T.: Multi-resolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(7), 971–987 (2002)

    Article  Google Scholar 

  12. Li, S., Chu, R., Liao, S., Zhang, L.: Illumination Invariant Face Recognition Using Near-Infrared Images. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(4), 627–639 (2007)

    Article  Google Scholar 

  13. Zhao, G., Pietikäinen, M.: Dynamic Texture Recognition Using Local Binary Patterns with an Application to Facial Expressions. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(6), 915–928 (2007)

    Article  Google Scholar 

  14. Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competitions. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 682–687 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Aurélio Campilho Mohamed Kamel

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xu, P., Ji, R., Yao, H., Sun, X., Liu, T., Liu, X. (2008). Text Particles Multi-band Fusion for Robust Text Detection. In: Campilho, A., Kamel, M. (eds) Image Analysis and Recognition. ICIAR 2008. Lecture Notes in Computer Science, vol 5112. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69812-8_58

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-69812-8_58

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-69811-1

  • Online ISBN: 978-3-540-69812-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics