Abstract
The amount of digital video data is increasing over the world. It highlights the need for efficient algorithms that can retrieve this data by content. The full use of this media is currently limited by the opaque nature of the video which prevents content-based access. To facilitate video indexing and browsing, it is essential to allow non-linear access, especially for long programs. This can be achieved by identifying semantic description captured automatically from video story structure. Among these descriptions, text within video frames is considered as rich features that enable a good way for browsing video. In this paper we propose a fast Hough transformation based approach for automatic video frames text localization. Experimental results, that we drove on a large sample of video images issued from portions of news broadcasts, sports program, advertisements and movies, shows that our method is very efficient, capable to locate text regions with different character sizes, directions and styles even in case of complex image background.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bouaziz, B., Mahdi, W., Ardabilian, M., Ben Hamadou, A.: A new approach for texture features extraction : application for text localization in video images. In: Proceedings of the IEEE International Conference on Multimedia and Exposition, ICME 2006, Ontario, Canada (July 2006)
Aigrain, P., Jolyet, P., Longueville, V.: Representation based user interfaces for the Audiovisual Library of Year 2000. In: Proceedings of IS&T/SPIE Conference on Multimedia Computing and Networking, pp. 35–45 (1995)
Ardebilian, M., Tu, X.W., Chen, L.: Improvement of Shot Detection Methods Based-on Dynamic Threshold Selection. In: Proc. SPIE: Multimedia Strage and Archiving Systems II, Dallas, USA (1997)
Jung, K., Han, H.: Hybrid approach to efficient text extraction in complex color images. Pattern Recognition Letters 25(6), 679–699 (2004)
Swain, M., Ballard, D.: Color Indexing. International Journal of Computer Vision 7(1), 11–32 (1991)
Mahdi, W., Ardebilian, M., Chen, L.: Automatic Video Content Parsing Based on Exterior and Interior Shots Classification. In: The seventh Iternational conference on Advanced Computer Systems, ACS 2000, Szczecin, Poland, October 23-25, pp. 571–578 (2000), ISBN 83-87352-24-7
Mahdi, W., Ardebilian, M., Chen, L.: Automatic Video Scene Segmentation based on Spatial-temporal Clues and Rhythm. International Journal of Networking and Information Sytsems 3(1), 27–51 (2000), http://www.hermes-journals.com
Lyu, Song, J., Cai, M.: A comprehensive method for multilingual video text detection, localization, and extraction. IEEE Transactions on Circuits and Systems for Video Technology 15(2), 243–255 (2005)
Marc, D.: Media stream: Representing Video for Retrieval and Repurposing. In: ACM Multimedia proceeding, Sans Fransisco, CA, USA, October 15-20, pp. 478–479 (1994)
Jain, A.K., Yu, B.: Automatic Text Location In Images and Videos Frames. Pattern Recognation 31(12), 2055–2076 (1998)
Zhong, Y., Karu, K., Jain, A.K.: Locating text in complex color images. Pattern Recognation 28(10), 1523–1535 (1995)
Li, H., Doermann, D., Kia, O.: Automatic Text Detection and Tracking in Digital Video. IEEE Trans, Image Processing 9(1) (January 2000)
Zhong, Y., Zhang, H., Jain, A.K.: Automatic Caption Extraction of Digital Video. In: Proc. ICIP 1999, Kobe (1999)
Sato, T., Kanade, T.: Video OCR: Indexing digital news livrairies by recognation of superimposed caption. In: ICCV Workshop on Image and Video retrieval (1998)
Wu, V., Manmatha, R., Riseman, E.M.: Finding text in images. In: Proc. of the 2nd Intl. Conf. on Digital Libraries, Philadalphia, PA, pp. 1–10 (July 1997)
Sobottka, K., Bunke, H., Kronenberg, H.: Identification of Text on Colored Book and Journal Covers. In: Proc. of the 5th Intl. Conf. on Document Analysis and Recoginzation, pp. 57–62 (1999)
Sato, T., Kanade, T., Hughes, E., Smith, M.: Video OCR for digital News Archives. In: IEEE International Workshop on Content-Based Access of Images and Video Databases, pp. 52–60 (January 1998)
Qi, W., et al.: Integrating Visual, Audio and Text Analysis for news Video. In: 7th IEEE International Conference on Image Processing (ICIP 2000), Vancouver, British Columbia, Canada, September 10-13 (2000)
Hao, Y., Zhang, Y., Zeng-guang, H., Min, T.: Automatic Text Detection In Video Frames Based on Bootstrap Artificial Neural Network And CED. Journal of WSCG 11(1) (2003), ISSN 1214-6972, WSG 2003, Plzen, Czech Republic. Copyright UNION Agency – Science Press (February 3-7, 2003)
Wolf, C., Jolion, J.M., Chassaing, F.: Text Localization, Enhancement and Binarization in Multimedia Documents. In: Proceedings of the International Conference on Pattern Recognition (ICPR) 2002, Quebec City, Canada, August 11–15, vol. 4, pp. 1037–1040. IEEE Computer Society, Los Alamitos (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bouaziz, B., Mahdi, W., Ben Hamadou, A. (2006). A New Video Images Text Localization Approach Based on a Fast Hough Transform. In: Campilho, A., Kamel, M.S. (eds) Image Analysis and Recognition. ICIAR 2006. Lecture Notes in Computer Science, vol 4141. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11867586_39
Download citation
DOI: https://doi.org/10.1007/11867586_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44891-4
Online ISBN: 978-3-540-44893-8
eBook Packages: Computer ScienceComputer Science (R0)