Abstract
Texts in images and videos usually carry important information for visual content understanding and retrieval. Two main restrictions exist in the state-of-the-art text detection algorithms: weak contrast and text-background variance. This paper presents a robust text detection method based on text particles (TP) multi-band fusion to solve there problems. Firstly, text particles are generated by their local binary pattern of pyramid Haar wavelet coefficients in YUV color space. It preserves and uniforms text-background contrasts while extracting multi-band information. Secondly, the candidate text regions are generated via density-based text particle multi-band fusion, and the LHBP histogram analysis is utilized to remove non-text regions. Our TP-based detection framework can robustly locate text regions regardless of diversity sizes, colors, rotations, illuminations and text-background contrasts. Experiment results on ICDAR 03 over the existing methods demonstrate the robustness and effectiveness of the proposed method.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
El Rube, I., Ahmed, M., Kamel, M.: Wavelet approximation-based affine invariant shape representation functions. IEEE Transactions on Pattern Analysis and Machine Intelligence 28(2), 323–327 (2006)
Chen, D.T., Bourland, H., Thiran, J.P.: Text identification in complex background using SVM. In: International Conference on Computer Vision and Pattern Recognition, pp. 621–626 (2001)
Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Transactions on Circuits and Systems for Video Technology 12, 256–268 (2002)
Ezaki, N., Bulacu, M., Schomaker, L.: Text Detection from Natural Scene Images: Towards a System for Visually Impaired Persons. In: International Conference on Pattern Recognition, vol. 2, pp. 683–686 (2004)
Yi, J., Peng, Y., Xiao, J.: Color-based Clustering for Text Detection and Extraction in Image. In: ACM Conference on Multimedia, pp. 847–850 (2007)
Gllavata, J., Ewerth, R., Freisleben, B.: Text detection in images based on unsupervised classification of high frequency wavelet coefficients. In: International Conference on Pattern Recognition, pp. 425–428 (2004)
Ye, Q.X., Huang, Q.M.: A New Text Detection Algorithm in Image/Video Frames. In: Advances in Multimedia Information Processing 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30-December 3, 2004, pp. 858–865 (2004)
Ji, R.R., Xu, P.F., Yao, H.X., Sun, X.S., Liu, T.Q.: Directional Correlation Analysis of Local Haar Binary Pattern for Text Detection. In: IEEE International Conference on Multimedia & Expo (accept, 2008)
Xi, D., Kamel, M.: Extraction of filled in strokes from cheque image using pseudo 2D wavelet with adjustable support. In: IEEE International Conference on Image Processing, vol. 2, pp. 11–14 (2005)
Ojala, T., Pietikäinen, M., Harwood, D.: A Comparative Study of Texture Measures with Classification Based on Feature Distributions. Pattern Recognition 29(1), 51–59 (1996)
Ojala, T., Pietikäinen, M., Mäenpäa, T.: Multi-resolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(7), 971–987 (2002)
Li, S., Chu, R., Liao, S., Zhang, L.: Illumination Invariant Face Recognition Using Near-Infrared Images. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(4), 627–639 (2007)
Zhao, G., Pietikäinen, M.: Dynamic Texture Recognition Using Local Binary Patterns with an Application to Facial Expressions. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(6), 915–928 (2007)
Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competitions. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 682–687 (2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xu, P., Ji, R., Yao, H., Sun, X., Liu, T., Liu, X. (2008). Text Particles Multi-band Fusion for Robust Text Detection. In: Campilho, A., Kamel, M. (eds) Image Analysis and Recognition. ICIAR 2008. Lecture Notes in Computer Science, vol 5112. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69812-8_58
Download citation
DOI: https://doi.org/10.1007/978-3-540-69812-8_58
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69811-1
Online ISBN: 978-3-540-69812-8
eBook Packages: Computer ScienceComputer Science (R0)