A Novel Video Text Detection and Localization Approach

Huang, Xiaodong; Ma, Huadong; Yuan, Haidong

doi:10.1007/978-3-540-89796-5_54

Xiaodong Huang⁸,
Huadong Ma⁸ &
Haidong Yuan⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5353))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

1473 Accesses

Abstract

Text detection and localization in videos are often used for video information indexing and retrieval, as text can retrieve the semantic information of videos. In this paper, we propose a novel approach to detect and localize texts by means of integrating the multiple video frames and multiple video frame motion features. For text detection, first, the motion feature detection is employed to fulfill the multiple frame verification. Second, the synthesized motion feature image, which is produced by motion vector on consecutive frames, is used to detect the text region under a synthesized image, which is produced by multiframe integration. Third, corner points are employed to locate the candidate text pixels position and a region growing algorithm is developed to connect these pixels into text blocks. In text localization, we use the corner points to accurately locate the text region. Experimental results show satisfying performance of the proposed algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Robust Video Text Detection with Morphological Filtering Enhanced MSER

Article 13 March 2015

Multi-oriented Text Detection from Video Using Sub-pixel Mapping

Automatic video superimposed text detection based on Nonsubsampled Contourlet Transform

Article 25 March 2017

References

Lyu, M.R., Song, J., Cai, M.: A Comprehensive Method for Multilingual Video Text Detection, Localization, and Extraction. IEEE Trans. on Circuits and Systems for Video Technology 15(2), 243–255 (2005)
Article Google Scholar
Sato, T., Kanade, T.: Video OCR: Indexing digital news libraries by recognition of superimposed caption. In: ICCV Workshop on Image and Video Retrieval (1998)
Google Scholar
Hua, X., Yin, P., Zhang, H.J.: Efficient video text recognition using Multiple Frame Integration. In: IEEE Int. Conf. on Image Processing (ICIP) (September 2002)
Google Scholar
Hu, J., Xi, J., Wu, L.: Automatic Detection and Verification of Text Regions in News Video Frames. Int. Journal of Pattern Recognition and Artificial Intelligence 16(2), 257–271 (2002)
Article Google Scholar
Ye, Q., Huang, Q.: A New Text Detection Algorithm in Images/Video Frames. In: Aizawa, K., Nakamura, Y., Satoh, S. (eds.) PCM 2004. LNCS, vol. 3332, pp. 858–865. Springer, Heidelberg (2004)
Chapter Google Scholar
Lienhart, R., Wernicke, A.: Localizing and segmenting text in images, videos and web pages. IEEE Trans. Circuits Syst. Video Technol. 12(4), 256–268 (2002)
Article Google Scholar
Boreczky, J.S., Wilcox, L.D.: A Hidden Markov Model Framework for Video Segmentation using Audio and Image Features. In: ICASSP 1998, Seattle, WA, pp. 3741–3744 (May 1998)
Google Scholar
Hua, X., Yin, P., Zhang, H.J.: Efficient video text recognition using Multiple Frame Integration. In: IEEE Int. Conf. on Image Processing (ICIP) (September 2002)
Google Scholar
Sato, T., Kanade, T., Hughes, E.K., Smith, M.A.: Video OCR: Indexing digital news libraries by recognition of superimposed caption. ACM Multimedia Systems: Special Issue on Video Libraries 7, 385–395 (1999)
Article Google Scholar
Harris, C., Stephens, M.: A Combined Corner and Edge Detector. In: Fourth Alvey Vision Conference, pp. 147–151 (1988)
Google Scholar
Wang, R., Jin, W., Wu, L.: A Novel Video Caption Detection Approach Using Multi-Frame Integration. In: Proceedings of the 17th International Conference on Pattern Recognition (ICPR 2004) (2004)
Google Scholar
Hua, X.-S., Chert, X.-R., Wenyin, L., Zhang, H.-J.: Automatic location of text in video frames. In: Proceedings of the 2001 ACM workshops on Multimedia (September 2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Beijing Key Laboratory of Intelligent Telecommunications Software and Multimedia, Beijing University of Posts and Telecommunications, Beijing, 100876, China
Xiaodong Huang, Huadong Ma & Haidong Yuan

Authors

Xiaodong Huang
View author publications
You can also search for this author in PubMed Google Scholar
Huadong Ma
View author publications
You can also search for this author in PubMed Google Scholar
Haidong Yuan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Engineering Science, National Cheng Kung University, No.1, University Road, 701, Tainan City, Taiwan
Yueh-Min Ray Huang
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, 95, Zhongguancun East Road, 100190, Beijing, China
Changsheng Xu
Institute of Biomedical Engineering, National Cheng Kung University, No. 1, University Road, 701, Tainan City, Taiwan
Kuo-Sheng Cheng
Department of Electrical Engineering, National Cheng Kung University, No. 1, University Road, 701, Tainan City, Taiwan
Jar-Ferr Kevin Yang
Department of Electrical and Computer Engineering, Concordia University, S-EV005.139, 1515 St. Catherine West, Montreal, H4G 2W1, Quebec, Canada
M. N. S. Swamy
Microsoft Research Asia, 5/F, Beijing Sigma Center, No. 49, Zhichun Road, Hai Dian District, 100080, Beijing, China
Shipeng Li
Department of Information Management, National Kaohsiung University of Applied Sciences, No. 415, Jiangong Road, Sanmin District, 80778, Kaohsiung, Taiwan
Jen-Wen Ding

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huang, X., Ma, H., Yuan, H. (2008). A Novel Video Text Detection and Localization Approach. In: Huang, YM.R., et al. Advances in Multimedia Information Processing - PCM 2008. PCM 2008. Lecture Notes in Computer Science, vol 5353. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89796-5_54

Download citation

DOI: https://doi.org/10.1007/978-3-540-89796-5_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89795-8
Online ISBN: 978-3-540-89796-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics