skip to main content
10.1145/1815330.1815365acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdasConference Proceedingsconference-collections
research-article

A skeleton-based method for multi-oriented video text detection

Published: 09 June 2010 Publication History

Abstract

In this paper, we propose a method based on the skeletonization operation for multi-oriented video text detection. The first step uses our existing Laplacian-based method to identify candidate text regions. In the second step, each region is classified as either a simple connected component (a single text string) or a complex connected component (multiple text strings that are connected to each other) depending on the number of intersection points in its skeleton. Complex connected components are then segmented into constituent parts based on the skeleton segments in order to separate the text strings from each other. Finally, text string straightness and edge density are used for false positive elimination. Experimental results show that the proposed method is able to detect multi-oriented graphics text and scene text.

References

[1]
Antonacopoulos, A. and Karatzas, D. 2000. An Anthropocentric Approach to Text Extraction from WWW Images. Proceedings of the 4th IAPR Workshop on Document Analysis Systems, December 2000, pp. 515--526.
[2]
Cai, M., Song, J. and Lyu, M. R. 2002. A New Approach for Video Text Detection. IEEE International Conference on Image Processing, 2002, pp. 117--120.
[3]
Crandall, D., Antani S. and Kasturi R. 2003. Extraction of Special Effects Caption Text Events from Digital Video. Int J Doc Anal Recog 5(2--3):138--157, 2003.
[4]
Jung, K. 2001. Neural network-based text location in color images. Pattern Recognition Letters 22, 2001, pp 1503--1515.
[5]
Jung, K. and Han, J. H. 2004. Hybrid Approach to Efficient Text Extraction in Complex Color Images. Pattern Recognition Letters 25 (2004) 679--699.
[6]
Lee, C. W., Jung, K. and Kim, H. J. 2003. Automatic text detection and removal in video sequences. Pattern Recognition Letters 24, 2003, pp 2607--2623.
[7]
Liu, C., Wang, C. and Dai, R. 2005. Text Detection in Images Based on Unsupervised Classification of Edge-based Features. ICDAR 2005, pp. 610--614.
[8]
Mariano, V. Y. and Kasturi, R. 2000. Locating Uniform-Colored Text in Video Frames. 15th ICPR, Volume 4, 2000, pp 539--542.
[9]
Phan, T. Q., Shivakumara, P. and Tan, C. L. 2009. A Laplacian Method for Video Text Detection. 10th International Conference on Document Analysis and Recognition, Spain, July 26--29, 2009, pp. 66--70.
[10]
Sobottka, K., Bunke, H. and Kronenberg, H. 1999. Identification of Text on Colored Book and Journal Covers. Proceedings of the Fifth International Conference on Document Analysis and Recognition, p. 57, September 1999.
[11]
Wong, E. K. and Chen, M. 2003. A new robust algorithm for video text extraction. Pattern Recognition 36, 2003, pp. 1397--1406.
[12]
Ye, Q., Huang, Q., Gao, W. and Zhao, D. 2005. Fast and robust text detection in images and video frames. Image and Vision Computing 23, 2005, pp. 565--576.
[13]
Zang, J. and Kasturi, R. 2008. Extraction of Text Objects in Video Documents: Recent Progress. The Eighth IAPR Workshop on Document Analysis Systems, Nara, Japan, September 2008, pp 5--17.
[14]
Zhong, Y., Karu, K. and Jain, A. K. 1995. Locating Text in Complex Color Images. Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1), p. 146, August 14--15, 1995.
[15]
Zhou, J. and Lopresti, D. 1997. Extracting text from WWW images. Proceedings of the Fourth International Conference on Document Analysis and Recognition, pp. 248--252.
[16]
http://algoval.essex.ac.uk/icdar/TextLocating.html

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
June 2010
490 pages
ISBN:9781605587738
DOI:10.1145/1815330
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. connected component analysis
  2. multi-oriented video text detection
  3. skeleton

Qualifiers

  • Research-article

Funding Sources

Conference

DAS '10

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2019)Decade research on text detection in images/videos: a reviewEvolutionary Intelligence10.1007/s12065-019-00248-zOnline publication date: 6-Jun-2019
  • (2014)Video Caption DetectionVideo Text Detection10.1007/978-1-4471-6515-6_3(49-80)Online publication date: 30-Jun-2014
  • (2013)Unsupervised text extraction from G-maps2013 International Conference on Human Computer Interactions (ICHCI)10.1109/ICHCI-IEEE.2013.6887782(1-4)Online publication date: Aug-2013
  • (2012)Recent Advances in Video Based Document ProcessingProceedings of the 2012 10th IAPR International Workshop on Document Analysis Systems10.1109/DAS.2012.72(63-68)Online publication date: 27-Mar-2012

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media