skip to main content
10.1145/1815330.1815366acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdasConference Proceedingsconference-collections
research-article

A new wavelet-median-moment based method for multi-oriented video text detection

Published:09 June 2010Publication History

ABSTRACT

In this paper, we present a new method based on wavelet-median-moments and a novel idea of angle projection for detecting multi-oriented text in video. The proposed method uses wavelet decomposition first to obtain three high frequency sub-bands (LH, HL and HH) and then median moments are computed on the average sub-bands of the three high frequency sub-bands to brighten the text pixels. K-means clustering (K=2) is used for obtaining text pixels from the wavelet-median-moments features (WMMF). Text candidates are obtained by mapping the output of K-means on Sobel edge map of the original input frame. To deal with multi-oriented text, we introduce a new idea of Angle Projection (AP) based on boundary growing and nearest neighbor concepts from the text candidates instead of conventional projection profiles. The proposed method is experimented on horizontal text data, non-horizontal text data, temporal data, non-text data and camera based images (scene text data of ICDAR 2003 competition) to show that the proposed method is superior to existing methods.

References

  1. D. Crandall and R. Kasturi, Robust Detection of Stylized Text Events in Digital Video, ICDAR 2001, pp 865--869. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. K. Jung, "Neural network-based text location in color images", Pattern Recognition Letters 22, 2001, pp 1503--1515. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Zang and R. Kasturi. Extraction of Text Objects in Video Documents: Recent Progress, DAS 2008, pp 5--17. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. K. Jung, K. I. Kim and A. K. Jain. Text information extraction in images and video: a survey. Pattern Recognition, 37, 2004, pp. 977--997.Google ScholarGoogle ScholarCross RefCross Ref
  5. V. Y. Marinano and R. Kasturi, "Locating Uniform-Colored Text in Video Frames", 15th ICPR, Volume 4, 2000, pp 539--542.Google ScholarGoogle Scholar
  6. Q. Ye, Q. Huang, W. Gao and D. Zhao. Fast and robust text detection in images and video frames. Image and Vision Computing 23, 2005, pp 565--576. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. K. Jain and B. Yu. Automatic Text Location in Images and Video Frames. Pattern Recognition, 31, 1998, pp 2055--2076.Google ScholarGoogle ScholarCross RefCross Ref
  8. C. Liu, C. Wang and R. Dai. Text Detection in Images Based on Unsupervised Classification of Edge-based Features. ICDAR 2005, pp 610--614. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. P. Shivakumara, W. Huang and C. L. Tan. An Efficient Edge based Technique for Text Detection in Video Frames, DAS 2008, pp 307--314. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. Cai, J. Song and M. R. Lyu, "A New Approach for Video Text Detection" ICIP, 2002, pp 117--120.Google ScholarGoogle Scholar
  11. E. K. Wong and M. Chen. A new robust algorithm for video text extraction. Pattern Recognition 36, 2003, pp 1397--1406.Google ScholarGoogle ScholarCross RefCross Ref
  12. T. Q. Phan, P. Shivakumara and C. L Tan, "A Laplacian Method for Video Text Detection", ICDAR, 2009, pp 66--70. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. H. Li, D. Doermann and O. Kia. Automatic Text Detection and Tracking in Digital Video. IEEE Transactions on Image Processing, Vol. 9, No. 1, January 2000, pp 147--156. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. P. Shivakumara, T. Q. Phan and C. L Tan, "A Robust Wavelet Transform Based Technique for Video Text Detection", ICDAR, 2009, pp 1285--1289. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. X. Chen and A. L. Yuille, "Detecting and Reading Text in Natural Scenes", CVPR, 2004, pp 366--373. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. Zhang, D. Goldgof and R. Kasturi, "A New Edge-Based Text Verification Approach for Video", ICPR, 2008.Google ScholarGoogle ScholarCross RefCross Ref

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
    June 2010
    490 pages
    ISBN:9781605587738
    DOI:10.1145/1815330

    Copyright © 2010 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 9 June 2010

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader