Abstract
In this paper we propose a robust text segmentation method in complex background. The proposed method first utilizes the K-means algorithm to decompose a detected text block into different binary image layers. Then an effective post-processing is followed to eliminate background residues in each layer. In this step we develop a group of robust constraints to characterize general text regions based on color, edge and stroke thickness. We also propose the components relation constraint (CRC) designed specifically for Chinese characters. Finally the text image layer is identified based on the periodical and symmetrical layout of text lines. The experimental results show that our method can effectively eliminate a wide range of background residues, and has a better performance than the K-means method, as well as a high speed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhong, Y., Karu, K., Jain, A.K.: Locating text in complex color images. Pattern Recognition 28(10), 1523–1535 (1995)
Wu, V., Manmatha, R., Riseman, E.: Finding text in images. In: Proceedings of ACM International Conference on Digital Libraries, Philadelphia, pp. 1–10 (1997)
Jain, A.K., Yu, B.: Automatic text location in images and video frames. Pattern Recognition 31(12), 2055–2076 (1998)
Chen, D., Odobez, J.-M., Bourlard, H.: Text detection and recognition in images and video frames. Pattern Recognition 3(37), 595–608 (2004)
Ye, Q., Gao, W., Wang, W., Zeng, W.: A robust text detection algorithm in images and video frames. In: 4th IEEE Pacific-Rim Conference on Multimedia, Singapore (2003)
Li, H., Doermann, D., Kia, O.: Automatic text detection and tracking in digital video. IEEE Trans. on Image Processing 9(1), 147–156 (2000)
Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Trans. on Circuits and Systems for Video Technology 12(4) (April 2002)
Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recognition 37(5), 977–997 (2004)
Trier, O.D., Jain, A.K.: Goal-directed evaluation of binarization methods. IEEE Trans. on Pattern Recognition and Machine Intelligence. 12 (December 1995)
Tsai, C.M., Lee, H.J.: Binarization of color document images via luminance and saturation color features. IEEE Transactions on Image Processing 11(4) (2002)
Lienhart, R.: Video OCR: a survey and practitioner’s guide. In: Video Mining, October, pp. 155–184. Kluwer Academic Publisher, Dordrecht (2003)
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. on System, Man and Cybernetics 9(1), 62–66 (1979)
Gao, J., Yang, J.: An adaptive algorithm for text detection from natural scenes. In: Computer Vision and Pattern Recognition, December 2001, vol. 2 (2001)
Ye, Q., Gao, W., Huang, Q.: Automatic text segmentation from complex background. In: IEEE International Conference on Image Processing, Singapore (October 2004)
Sato, T., Kanade, T., Hughes, E., Smith, M.: Video OCR for digital news archives. In: IEEE Workshop on Content-based Access of Image and Video Databases, Bombay, India, January, pp. 52–60 (1998)
Tang, X., Gao, X., Liu, J., Zhang, H.: A spatial-temporal approach for video caption detection and recognition. IEEE Trans. on Neural Networks 13(4) (July 2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fu, L., Wang, W., Zhan, Y. (2005). A Robust Text Segmentation Approach in Complex Background Based on Multiple Constraints. In: Ho, YS., Kim, H.J. (eds) Advances in Multimedia Information Processing - PCM 2005. PCM 2005. Lecture Notes in Computer Science, vol 3767. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581772_52
Download citation
DOI: https://doi.org/10.1007/11581772_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30027-4
Online ISBN: 978-3-540-32130-9
eBook Packages: Computer ScienceComputer Science (R0)