Abstract
Text detection and localization in videos are often used for video information indexing and retrieval, as text can retrieve the semantic information of videos. In this paper, we propose a novel approach to detect and localize texts by means of integrating the multiple video frames and multiple video frame motion features. For text detection, first, the motion feature detection is employed to fulfill the multiple frame verification. Second, the synthesized motion feature image, which is produced by motion vector on consecutive frames, is used to detect the text region under a synthesized image, which is produced by multiframe integration. Third, corner points are employed to locate the candidate text pixels position and a region growing algorithm is developed to connect these pixels into text blocks. In text localization, we use the corner points to accurately locate the text region. Experimental results show satisfying performance of the proposed algorithm.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lyu, M.R., Song, J., Cai, M.: A Comprehensive Method for Multilingual Video Text Detection, Localization, and Extraction. IEEE Trans. on Circuits and Systems for Video Technology 15(2), 243–255 (2005)
Sato, T., Kanade, T.: Video OCR: Indexing digital news libraries by recognition of superimposed caption. In: ICCV Workshop on Image and Video Retrieval (1998)
Hua, X., Yin, P., Zhang, H.J.: Efficient video text recognition using Multiple Frame Integration. In: IEEE Int. Conf. on Image Processing (ICIP) (September 2002)
Hu, J., Xi, J., Wu, L.: Automatic Detection and Verification of Text Regions in News Video Frames. Int. Journal of Pattern Recognition and Artificial Intelligence 16(2), 257–271 (2002)
Ye, Q., Huang, Q.: A New Text Detection Algorithm in Images/Video Frames. In: Aizawa, K., Nakamura, Y., Satoh, S. (eds.) PCM 2004. LNCS, vol. 3332, pp. 858–865. Springer, Heidelberg (2004)
Lienhart, R., Wernicke, A.: Localizing and segmenting text in images, videos and web pages. IEEE Trans. Circuits Syst. Video Technol. 12(4), 256–268 (2002)
Boreczky, J.S., Wilcox, L.D.: A Hidden Markov Model Framework for Video Segmentation using Audio and Image Features. In: ICASSP 1998, Seattle, WA, pp. 3741–3744 (May 1998)
Hua, X., Yin, P., Zhang, H.J.: Efficient video text recognition using Multiple Frame Integration. In: IEEE Int. Conf. on Image Processing (ICIP) (September 2002)
Sato, T., Kanade, T., Hughes, E.K., Smith, M.A.: Video OCR: Indexing digital news libraries by recognition of superimposed caption. ACM Multimedia Systems: Special Issue on Video Libraries 7, 385–395 (1999)
Harris, C., Stephens, M.: A Combined Corner and Edge Detector. In: Fourth Alvey Vision Conference, pp. 147–151 (1988)
Wang, R., Jin, W., Wu, L.: A Novel Video Caption Detection Approach Using Multi-Frame Integration. In: Proceedings of the 17th International Conference on Pattern Recognition (ICPR 2004) (2004)
Hua, X.-S., Chert, X.-R., Wenyin, L., Zhang, H.-J.: Automatic location of text in video frames. In: Proceedings of the 2001 ACM workshops on Multimedia (September 2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Huang, X., Ma, H., Yuan, H. (2008). A Novel Video Text Detection and Localization Approach. In: Huang, YM.R., et al. Advances in Multimedia Information Processing - PCM 2008. PCM 2008. Lecture Notes in Computer Science, vol 5353. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89796-5_54
Download citation
DOI: https://doi.org/10.1007/978-3-540-89796-5_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89795-8
Online ISBN: 978-3-540-89796-5
eBook Packages: Computer ScienceComputer Science (R0)