Skip to main content

A Novel Video Text Detection and Localization Approach

  • Conference paper
Advances in Multimedia Information Processing - PCM 2008 (PCM 2008)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5353))

Included in the following conference series:

Abstract

Text detection and localization in videos are often used for video information indexing and retrieval, as text can retrieve the semantic information of videos. In this paper, we propose a novel approach to detect and localize texts by means of integrating the multiple video frames and multiple video frame motion features. For text detection, first, the motion feature detection is employed to fulfill the multiple frame verification. Second, the synthesized motion feature image, which is produced by motion vector on consecutive frames, is used to detect the text region under a synthesized image, which is produced by multiframe integration. Third, corner points are employed to locate the candidate text pixels position and a region growing algorithm is developed to connect these pixels into text blocks. In text localization, we use the corner points to accurately locate the text region. Experimental results show satisfying performance of the proposed algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lyu, M.R., Song, J., Cai, M.: A Comprehensive Method for Multilingual Video Text Detection, Localization, and Extraction. IEEE Trans. on Circuits and Systems for Video Technology 15(2), 243–255 (2005)

    Article  Google Scholar 

  2. Sato, T., Kanade, T.: Video OCR: Indexing digital news libraries by recognition of superimposed caption. In: ICCV Workshop on Image and Video Retrieval (1998)

    Google Scholar 

  3. Hua, X., Yin, P., Zhang, H.J.: Efficient video text recognition using Multiple Frame Integration. In: IEEE Int. Conf. on Image Processing (ICIP) (September 2002)

    Google Scholar 

  4. Hu, J., Xi, J., Wu, L.: Automatic Detection and Verification of Text Regions in News Video Frames. Int. Journal of Pattern Recognition and Artificial Intelligence 16(2), 257–271 (2002)

    Article  Google Scholar 

  5. Ye, Q., Huang, Q.: A New Text Detection Algorithm in Images/Video Frames. In: Aizawa, K., Nakamura, Y., Satoh, S. (eds.) PCM 2004. LNCS, vol. 3332, pp. 858–865. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  6. Lienhart, R., Wernicke, A.: Localizing and segmenting text in images, videos and web pages. IEEE Trans. Circuits Syst. Video Technol. 12(4), 256–268 (2002)

    Article  Google Scholar 

  7. Boreczky, J.S., Wilcox, L.D.: A Hidden Markov Model Framework for Video Segmentation using Audio and Image Features. In: ICASSP 1998, Seattle, WA, pp. 3741–3744 (May 1998)

    Google Scholar 

  8. Hua, X., Yin, P., Zhang, H.J.: Efficient video text recognition using Multiple Frame Integration. In: IEEE Int. Conf. on Image Processing (ICIP) (September 2002)

    Google Scholar 

  9. Sato, T., Kanade, T., Hughes, E.K., Smith, M.A.: Video OCR: Indexing digital news libraries by recognition of superimposed caption. ACM Multimedia Systems: Special Issue on Video Libraries 7, 385–395 (1999)

    Article  Google Scholar 

  10. Harris, C., Stephens, M.: A Combined Corner and Edge Detector. In: Fourth Alvey Vision Conference, pp. 147–151 (1988)

    Google Scholar 

  11. Wang, R., Jin, W., Wu, L.: A Novel Video Caption Detection Approach Using Multi-Frame Integration. In: Proceedings of the 17th International Conference on Pattern Recognition (ICPR 2004) (2004)

    Google Scholar 

  12. Hua, X.-S., Chert, X.-R., Wenyin, L., Zhang, H.-J.: Automatic location of text in video frames. In: Proceedings of the 2001 ACM workshops on Multimedia (September 2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Huang, X., Ma, H., Yuan, H. (2008). A Novel Video Text Detection and Localization Approach. In: Huang, YM.R., et al. Advances in Multimedia Information Processing - PCM 2008. PCM 2008. Lecture Notes in Computer Science, vol 5353. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89796-5_54

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-89796-5_54

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-89795-8

  • Online ISBN: 978-3-540-89796-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics