poster

Detecting text in the real world

Authors:
Trung Quy Phan

National University of Singapore, Singapore, Singapore

National University of Singapore, Singapore, Singapore
View Profile

,
Palaiahnakote Shivakumara

National University of Singapore, Singapore, Singapore

National University of Singapore, Singapore, Singapore
View Profile

,
Chew Lim Tan

National University of Singapore, Singapore, Singapore

National University of Singapore, Singapore, Singapore
View Profile

MM '12: Proceedings of the 20th ACM international conference on MultimediaOctober 2012Pages 765–768https://doi.org/10.1145/2393347.2396307

Published:29 October 2012Publication History

MM '12: Proceedings of the 20th ACM international conference on Multimedia

Pages 765–768

ABSTRACT

The problem of text detection in natural scene images is challenging because of the unconstrained sizes, colors, backgrounds and alignments of the characters. This paper proposes novel symmetry features for this task. Within a text line, the intra-character symmetry captures the correspondence between the inner contour and the outer contour of a character while the inter-character symmetry helps to extract information from the gap region between two consecutive characters. A formulation based on Gradient Vector Flow is used to detect both types of symmetry points. These points are then grouped into text lines using the consistency in sizes, colors, and stroke and gap thickness. Therefore, unlike most existing methods which use only character features, our method exploits both the text features and the gap features to improve the detection result. Experimentally, our method compares well to the state-of-the-art on public datasets for natural scenes and street-level images, an emerging category of image data. The proposed technique can be used in a wide range of multimedia applications such as content-based image/video retrieval, mobile visual search and sign translation.

Supplemental Material

Available for Download

zip

msp018.zip (1.1 MB)

The ZIP file contains a PDF file which shows additional full-image results of the proposed text detection method on natural scenes and street images.

References

Canny, J. 1986. A Computational Approach To Edge Detection. IEEE TPAMI, 8:679--714, 1986. Google ScholarDigital Library
Chen, D., Odobez, J.-M., and Bourlard, H. 2004. Text detection and recognition in images and video frames. Pattern Recognition, 37(3), 2004, pp. 595--608.Google ScholarCross Ref
Chen, H., Tsai, S. S., Schroth, G., Chen, D. M., Grzeszczuk, R., and Girod, B. Robust text detection in natural images with edge-enhanced maximally stable extremal regions. ICIP 2011.Google ScholarCross Ref
Chen, X., and Yuille, A. L. 2004. Detecting and Reading Text in Natural Scenes. CVPR 2004. Google ScholarDigital Library
Dalal, N., and Triggs, B. 2005. Histograms of Oriented Gradients for Human Detection. CVPR 2005. Google ScholarDigital Library
Epshtein, B., Ofek, E., and Wexler, Y. 2010. Detecting Text in Natural Scenes with Stroke Width Transform. CVPR 2010.Google Scholar
Liang, J., Doermann D., and Li, H. 2005. Camera-based Analysis of Text and Documents: A Survey. IJDAR, 7(2), 2005, pp. 84--104.Google ScholarDigital Library
Liu, C., Wang, C., and Dai, R. 2005. Text Detection in Images Based on Unsupervised Classification of Edge-based Features. ICDAR 2005. Google ScholarDigital Library
Lucas, S. M. 2005. ICDAR 2005 Text Locating Competition Results. ICDAR 2005. Google ScholarDigital Library
Neumann, L., and Matas, J. 2011. Text Localization in Real-World Images Using Efficiently Pruned Exhaustive Search. ICDAR 2011. Google ScholarDigital Library
Tsai, S. S., Chen, D., Chen, H., Hsu, C.-H., Kim, K.-H., Singh, J. P., and Girod, B. 2011. Combining Image and Text Features: A Hybrid Approach to Mobile Book Spine Recognition. ACM MM 2011. Google ScholarDigital Library
Wang, K., Babenko, B., and Belongie, S. 2011. End-to-End Scene Text Recognition. ICCV 2011.Google Scholar
Xu, C., and Prince, J. L. 1998. Snakes, Shapes, and Gradient Vector Flow. IEEE TIP, 7(3), 1998, pp. 359--369. Google ScholarDigital Library
ICDAR 2003 Dataset. http://algoval.essex.ac.uk/icdar/Datasets.htmlGoogle Scholar

Index Terms

Detecting text in the real world
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Shape inference
      2. Computer vision representations
        Shape representations
  2. Computer graphics
    1. Image manipulation
      1. Texturing
    2. Shape modeling
2. Information systems
  1. Information retrieval
    1. Document representation
    2. Search engine architectures and scalability
      1. Search engine indexing

Recommendations

A feature-based method of rapidly detecting global exact symmetries in CAD models

Detecting global exact symmetries in CAD models is of great importance in the research of CAD/CAE integration. Therefore, a method is proposed in this paper to rapidly detect the global exact rotational and reflectional symmetries in feature-based CAD ...
Read More
Could scene context be beneficial for scene text detection?

Scene text detection and scene segmentation are meaningful tasks in the computer vision field. Could the semantic scene segmentation assist scene text detection in any degree? For example, can we expect the probability of a region being text is low if ...
Read More
A Multi-Oriented Scene Text Detector with Position-Sensitive Segmentation
ICMR '18: Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval

Scene text detection has been studied for a long time and lots of approaches have achieved promising performances. Most approaches regard text as a specific object and utilize the popular frameworks of object detection to detect scene text. However, ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '12: Proceedings of the 20th ACM international conference on Multimedia
October 2012
1584 pages
ISBN:9781450310895
DOI:10.1145/2393347
General Chairs:
Noboru Babaguchi
Osaka University, Japan
,
Kiyoharu Aizawa
The University of Tokyo, Japan
,
John Smith
IBM, USA
,
Program Chairs:
Shin'ichi Satoh
National Institute of Informatics, Japan
,
Thomas Plagemann
University of Oslo, Norway
,
Xian-Sheng Hua
Microsoft, USA
,
Rong Yan
Facebook, USA
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 29 October 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
gradient vector flow
natural scene text
scene text detection
street view images
symmetry detection
texture analysis
Qualifiers
- poster
Conference

Acceptance Rates
Overall Acceptance Rate995of4,171submissions,24%
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 35
  Total Citations
  View Citations
- 345
  Total Downloads
- Downloads (Last 12 months)4
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Detecting text in the real world

MM '12: Proceedings of the 20th ACM international conference on Multimedia

ABSTRACT

Supplemental Material

Available for Download

References

Cited By

Index Terms

Recommendations

A feature-based method of rapidly detecting global exact symmetries in CAD models

Could scene context be beneficial for scene text detection?

A Multi-Oriented Scene Text Detector with Position-Sensitive Segmentation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Detecting text in the real world

MM '12: Proceedings of the 20th ACM international conference on Multimedia

ABSTRACT

Supplemental Material

Available for Download

References

Cited By

Index Terms

Recommendations

A feature-based method of rapidly detecting global exact symmetries in CAD models

Could scene context be beneficial for scene text detection?

A Multi-Oriented Scene Text Detector with Position-Sensitive Segmentation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media