Skip to main content

A Robust Text Segmentation Approach in Complex Background Based on Multiple Constraints

  • Conference paper
Advances in Multimedia Information Processing - PCM 2005 (PCM 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3767))

Included in the following conference series:

Abstract

In this paper we propose a robust text segmentation method in complex background. The proposed method first utilizes the K-means algorithm to decompose a detected text block into different binary image layers. Then an effective post-processing is followed to eliminate background residues in each layer. In this step we develop a group of robust constraints to characterize general text regions based on color, edge and stroke thickness. We also propose the components relation constraint (CRC) designed specifically for Chinese characters. Finally the text image layer is identified based on the periodical and symmetrical layout of text lines. The experimental results show that our method can effectively eliminate a wide range of background residues, and has a better performance than the K-means method, as well as a high speed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhong, Y., Karu, K., Jain, A.K.: Locating text in complex color images. Pattern Recognition 28(10), 1523–1535 (1995)

    Article  Google Scholar 

  2. Wu, V., Manmatha, R., Riseman, E.: Finding text in images. In: Proceedings of ACM International Conference on Digital Libraries, Philadelphia, pp. 1–10 (1997)

    Google Scholar 

  3. Jain, A.K., Yu, B.: Automatic text location in images and video frames. Pattern Recognition 31(12), 2055–2076 (1998)

    Article  Google Scholar 

  4. Chen, D., Odobez, J.-M., Bourlard, H.: Text detection and recognition in images and video frames. Pattern Recognition 3(37), 595–608 (2004)

    Article  Google Scholar 

  5. Ye, Q., Gao, W., Wang, W., Zeng, W.: A robust text detection algorithm in images and video frames. In: 4th IEEE Pacific-Rim Conference on Multimedia, Singapore (2003)

    Google Scholar 

  6. Li, H., Doermann, D., Kia, O.: Automatic text detection and tracking in digital video. IEEE Trans. on Image Processing 9(1), 147–156 (2000)

    Article  Google Scholar 

  7. Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Trans. on Circuits and Systems for Video Technology 12(4) (April 2002)

    Google Scholar 

  8. Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recognition 37(5), 977–997 (2004)

    Article  Google Scholar 

  9. Trier, O.D., Jain, A.K.: Goal-directed evaluation of binarization methods. IEEE Trans. on Pattern Recognition and Machine Intelligence. 12 (December 1995)

    Google Scholar 

  10. Tsai, C.M., Lee, H.J.: Binarization of color document images via luminance and saturation color features. IEEE Transactions on Image Processing 11(4) (2002)

    Google Scholar 

  11. Lienhart, R.: Video OCR: a survey and practitioner’s guide. In: Video Mining, October, pp. 155–184. Kluwer Academic Publisher, Dordrecht (2003)

    Google Scholar 

  12. Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. on System, Man and Cybernetics 9(1), 62–66 (1979)

    Article  MathSciNet  Google Scholar 

  13. Gao, J., Yang, J.: An adaptive algorithm for text detection from natural scenes. In: Computer Vision and Pattern Recognition, December 2001, vol. 2 (2001)

    Google Scholar 

  14. Ye, Q., Gao, W., Huang, Q.: Automatic text segmentation from complex background. In: IEEE International Conference on Image Processing, Singapore (October 2004)

    Google Scholar 

  15. Sato, T., Kanade, T., Hughes, E., Smith, M.: Video OCR for digital news archives. In: IEEE Workshop on Content-based Access of Image and Video Databases, Bombay, India, January, pp. 52–60 (1998)

    Google Scholar 

  16. Tang, X., Gao, X., Liu, J., Zhang, H.: A spatial-temporal approach for video caption detection and recognition. IEEE Trans. on Neural Networks 13(4) (July 2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fu, L., Wang, W., Zhan, Y. (2005). A Robust Text Segmentation Approach in Complex Background Based on Multiple Constraints. In: Ho, YS., Kim, H.J. (eds) Advances in Multimedia Information Processing - PCM 2005. PCM 2005. Lecture Notes in Computer Science, vol 3767. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581772_52

Download citation

  • DOI: https://doi.org/10.1007/11581772_52

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-30027-4

  • Online ISBN: 978-3-540-32130-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics