research-article

A skeleton-based method for multi-oriented video text detection

Authors:

Trung Quy Phan,

Palaiahnakote Shivakumara,

Chew Lim TanAuthors Info & Claims

DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems

Pages 271 - 278

https://doi.org/10.1145/1815330.1815365

Published: 09 June 2010 Publication History

Abstract

In this paper, we propose a method based on the skeletonization operation for multi-oriented video text detection. The first step uses our existing Laplacian-based method to identify candidate text regions. In the second step, each region is classified as either a simple connected component (a single text string) or a complex connected component (multiple text strings that are connected to each other) depending on the number of intersection points in its skeleton. Complex connected components are then segmented into constituent parts based on the skeleton segments in order to separate the text strings from each other. Finally, text string straightness and edge density are used for false positive elimination. Experimental results show that the proposed method is able to detect multi-oriented graphics text and scene text.

References

[1]

Antonacopoulos, A. and Karatzas, D. 2000. An Anthropocentric Approach to Text Extraction from WWW Images. Proceedings of the 4th IAPR Workshop on Document Analysis Systems, December 2000, pp. 515--526.

[2]

Cai, M., Song, J. and Lyu, M. R. 2002. A New Approach for Video Text Detection. IEEE International Conference on Image Processing, 2002, pp. 117--120.

[3]

Crandall, D., Antani S. and Kasturi R. 2003. Extraction of Special Effects Caption Text Events from Digital Video. Int J Doc Anal Recog 5(2--3):138--157, 2003.

[4]

Jung, K. 2001. Neural network-based text location in color images. Pattern Recognition Letters 22, 2001, pp 1503--1515.

Digital Library

[5]

Jung, K. and Han, J. H. 2004. Hybrid Approach to Efficient Text Extraction in Complex Color Images. Pattern Recognition Letters 25 (2004) 679--699.

Digital Library

[6]

Lee, C. W., Jung, K. and Kim, H. J. 2003. Automatic text detection and removal in video sequences. Pattern Recognition Letters 24, 2003, pp 2607--2623.

Digital Library

[7]

Liu, C., Wang, C. and Dai, R. 2005. Text Detection in Images Based on Unsupervised Classification of Edge-based Features. ICDAR 2005, pp. 610--614.

Digital Library

[8]

Mariano, V. Y. and Kasturi, R. 2000. Locating Uniform-Colored Text in Video Frames. 15th ICPR, Volume 4, 2000, pp 539--542.

[9]

Phan, T. Q., Shivakumara, P. and Tan, C. L. 2009. A Laplacian Method for Video Text Detection. 10th International Conference on Document Analysis and Recognition, Spain, July 26--29, 2009, pp. 66--70.

Digital Library

[10]

Sobottka, K., Bunke, H. and Kronenberg, H. 1999. Identification of Text on Colored Book and Journal Covers. Proceedings of the Fifth International Conference on Document Analysis and Recognition, p. 57, September 1999.

Digital Library

[11]

Wong, E. K. and Chen, M. 2003. A new robust algorithm for video text extraction. Pattern Recognition 36, 2003, pp. 1397--1406.

[12]

Ye, Q., Huang, Q., Gao, W. and Zhao, D. 2005. Fast and robust text detection in images and video frames. Image and Vision Computing 23, 2005, pp. 565--576.

Digital Library

[13]

Zang, J. and Kasturi, R. 2008. Extraction of Text Objects in Video Documents: Recent Progress. The Eighth IAPR Workshop on Document Analysis Systems, Nara, Japan, September 2008, pp 5--17.

Digital Library

[14]

Zhong, Y., Karu, K. and Jain, A. K. 1995. Locating Text in Complex Color Images. Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1), p. 146, August 14--15, 1995.

Digital Library

[15]

Zhou, J. and Lopresti, D. 1997. Extracting text from WWW images. Proceedings of the Fourth International Conference on Document Analysis and Recognition, pp. 248--252.

Digital Library

[16]

http://algoval.essex.ac.uk/icdar/TextLocating.html

Cited By

Manjunath Aradhya VBasavaraju HGuru D(2019)Decade research on text detection in images/videos: a reviewEvolutionary Intelligence10.1007/s12065-019-00248-zOnline publication date: 6-Jun-2019
https://doi.org/10.1007/s12065-019-00248-z
Lu TPalaiahnakote STan CLiu WLu TPalaiahnakote STan CLiu W(2014)Video Caption DetectionVideo Text Detection10.1007/978-1-4471-6515-6_3(49-80)Online publication date: 30-Jun-2014
https://doi.org/10.1007/978-1-4471-6515-6_3
Adak C(2013)Unsupervised text extraction from G-maps2013 International Conference on Human Computer Interactions (ICHCI)10.1109/ICHCI-IEEE.2013.6887782(1-4)Online publication date: Aug-2013
https://doi.org/10.1109/ICHCI-IEEE.2013.6887782
Show More Cited By

Index Terms

A skeleton-based method for multi-oriented video text detection
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Interest point and salient region detections

Recommendations

Skeleton-Based Recognition of Chinese Calligraphic Character Image
PCM '08: Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing

The large amount of digitized Chinese calligraphic works in existence is a valuable part of the Chinese cultural heritage. But they can hardly be recognized by optical character recognition (OCR) which performs well on machine printed characters against ...
Text line detection in handwritten documents

In this paper, we present a new text line detection method for handwritten documents. The proposed technique is based on a strategy that consists of three distinct steps. The first step includes image binarization and enhancement, connected component ...
Wavelet-Based Approach to Character Skeleton

Character skeleton plays a significant role in character recognition. The strokes of a character may consist of two regions, i.e., singular and regular regions. The intersections and junctions of the strokes belong to singular region, while the straight ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems

June 2010

490 pages

ISBN:9781605587738

DOI:10.1145/1815330

General Chairs:
David Doermann
University of Maryland, College Park
,
Venu Govindaraju
University at Buffalo, SUNY
,
Daniel Lopresti
Lehigh University
,
Prem Natarajan
Raytheon BBN Technologies

Copyright © 2010 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2010

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Conference

DAS '10

DAS '10: The Eighth IAPR International Workshop on Document Analysis Systems

June 9 - 11, 2010

Massachusetts, Boston, USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
245
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 02 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Manjunath Aradhya VBasavaraju HGuru D(2019)Decade research on text detection in images/videos: a reviewEvolutionary Intelligence10.1007/s12065-019-00248-zOnline publication date: 6-Jun-2019
https://doi.org/10.1007/s12065-019-00248-z
Lu TPalaiahnakote STan CLiu WLu TPalaiahnakote STan CLiu W(2014)Video Caption DetectionVideo Text Detection10.1007/978-1-4471-6515-6_3(49-80)Online publication date: 30-Jun-2014
https://doi.org/10.1007/978-1-4471-6515-6_3
Adak C(2013)Unsupervised text extraction from G-maps2013 International Conference on Human Computer Interactions (ICHCI)10.1109/ICHCI-IEEE.2013.6887782(1-4)Online publication date: Aug-2013
https://doi.org/10.1109/ICHCI-IEEE.2013.6887782
Sharma NPal UBlumenstein M(2012)Recent Advances in Video Based Document ProcessingProceedings of the 2012 10th IAPR International Workshop on Document Analysis Systems10.1109/DAS.2012.72(63-68)Online publication date: 27-Mar-2012
https://dl.acm.org/doi/10.1109/DAS.2012.72

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten