A Robust Text Segmentation Approach in Complex Background Based on Multiple Constraints

Fu, Libo; Wang, Weiqiang; Zhan, Yaowen

doi:10.1007/11581772_52

Libo Fu^18,19,
Weiqiang Wang^18,19 &
Yaowen Zhan^18,19

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3767))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

1230 Accesses
6 Citations

Abstract

In this paper we propose a robust text segmentation method in complex background. The proposed method first utilizes the K-means algorithm to decompose a detected text block into different binary image layers. Then an effective post-processing is followed to eliminate background residues in each layer. In this step we develop a group of robust constraints to characterize general text regions based on color, edge and stroke thickness. We also propose the components relation constraint (CRC) designed specifically for Chinese characters. Finally the text image layer is identified based on the periodical and symmetrical layout of text lines. The experimental results show that our method can effectively eliminate a wide range of background residues, and has a better performance than the K-means method, as well as a high speed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Zhong, Y., Karu, K., Jain, A.K.: Locating text in complex color images. Pattern Recognition 28(10), 1523–1535 (1995)
Article Google Scholar
Wu, V., Manmatha, R., Riseman, E.: Finding text in images. In: Proceedings of ACM International Conference on Digital Libraries, Philadelphia, pp. 1–10 (1997)
Google Scholar
Jain, A.K., Yu, B.: Automatic text location in images and video frames. Pattern Recognition 31(12), 2055–2076 (1998)
Article Google Scholar
Chen, D., Odobez, J.-M., Bourlard, H.: Text detection and recognition in images and video frames. Pattern Recognition 3(37), 595–608 (2004)
Article Google Scholar
Ye, Q., Gao, W., Wang, W., Zeng, W.: A robust text detection algorithm in images and video frames. In: 4th IEEE Pacific-Rim Conference on Multimedia, Singapore (2003)
Google Scholar
Li, H., Doermann, D., Kia, O.: Automatic text detection and tracking in digital video. IEEE Trans. on Image Processing 9(1), 147–156 (2000)
Article Google Scholar
Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Trans. on Circuits and Systems for Video Technology 12(4) (April 2002)
Google Scholar
Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recognition 37(5), 977–997 (2004)
Article Google Scholar
Trier, O.D., Jain, A.K.: Goal-directed evaluation of binarization methods. IEEE Trans. on Pattern Recognition and Machine Intelligence. 12 (December 1995)
Google Scholar
Tsai, C.M., Lee, H.J.: Binarization of color document images via luminance and saturation color features. IEEE Transactions on Image Processing 11(4) (2002)
Google Scholar
Lienhart, R.: Video OCR: a survey and practitioner’s guide. In: Video Mining, October, pp. 155–184. Kluwer Academic Publisher, Dordrecht (2003)
Google Scholar
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. on System, Man and Cybernetics 9(1), 62–66 (1979)
Article MathSciNet Google Scholar
Gao, J., Yang, J.: An adaptive algorithm for text detection from natural scenes. In: Computer Vision and Pattern Recognition, December 2001, vol. 2 (2001)
Google Scholar
Ye, Q., Gao, W., Huang, Q.: Automatic text segmentation from complex background. In: IEEE International Conference on Image Processing, Singapore (October 2004)
Google Scholar
Sato, T., Kanade, T., Hughes, E., Smith, M.: Video OCR for digital news archives. In: IEEE Workshop on Content-based Access of Image and Video Databases, Bombay, India, January, pp. 52–60 (1998)
Google Scholar
Tang, X., Gao, X., Liu, J., Zhang, H.: A spatial-temporal approach for video caption detection and recognition. IEEE Trans. on Neural Networks 13(4) (July 2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100080, China
Libo Fu, Weiqiang Wang & Yaowen Zhan
Graduate School of Chinese Academy of Sciences, Beijing, 100039, China
Libo Fu, Weiqiang Wang & Yaowen Zhan

Authors

Libo Fu
View author publications
You can also search for this author in PubMed Google Scholar
Weiqiang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yaowen Zhan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Gwangju Institute of Science and Technology (GIST), 1 Oryong-dong Buk-gu, 500-712, Gwangju, Korea
Yo-Sung Ho
Multimedia Security Lab, Korea University, Science Campus, 136-701, Seoul, Korea
Hyoung Joong Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fu, L., Wang, W., Zhan, Y. (2005). A Robust Text Segmentation Approach in Complex Background Based on Multiple Constraints. In: Ho, YS., Kim, H.J. (eds) Advances in Multimedia Information Processing - PCM 2005. PCM 2005. Lecture Notes in Computer Science, vol 3767. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581772_52

Download citation

DOI: https://doi.org/10.1007/11581772_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30027-4
Online ISBN: 978-3-540-32130-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics