skip to main content
10.1145/2393347.2396307acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
poster

Detecting text in the real world

Published:29 October 2012Publication History

ABSTRACT

The problem of text detection in natural scene images is challenging because of the unconstrained sizes, colors, backgrounds and alignments of the characters. This paper proposes novel symmetry features for this task. Within a text line, the intra-character symmetry captures the correspondence between the inner contour and the outer contour of a character while the inter-character symmetry helps to extract information from the gap region between two consecutive characters. A formulation based on Gradient Vector Flow is used to detect both types of symmetry points. These points are then grouped into text lines using the consistency in sizes, colors, and stroke and gap thickness. Therefore, unlike most existing methods which use only character features, our method exploits both the text features and the gap features to improve the detection result. Experimentally, our method compares well to the state-of-the-art on public datasets for natural scenes and street-level images, an emerging category of image data. The proposed technique can be used in a wide range of multimedia applications such as content-based image/video retrieval, mobile visual search and sign translation.

Skip Supplemental Material Section

Supplemental Material

References

  1. Canny, J. 1986. A Computational Approach To Edge Detection. IEEE TPAMI, 8:679--714, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Chen, D., Odobez, J.-M., and Bourlard, H. 2004. Text detection and recognition in images and video frames. Pattern Recognition, 37(3), 2004, pp. 595--608.Google ScholarGoogle ScholarCross RefCross Ref
  3. Chen, H., Tsai, S. S., Schroth, G., Chen, D. M., Grzeszczuk, R., and Girod, B. Robust text detection in natural images with edge-enhanced maximally stable extremal regions. ICIP 2011.Google ScholarGoogle ScholarCross RefCross Ref
  4. Chen, X., and Yuille, A. L. 2004. Detecting and Reading Text in Natural Scenes. CVPR 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Dalal, N., and Triggs, B. 2005. Histograms of Oriented Gradients for Human Detection. CVPR 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Epshtein, B., Ofek, E., and Wexler, Y. 2010. Detecting Text in Natural Scenes with Stroke Width Transform. CVPR 2010.Google ScholarGoogle Scholar
  7. Liang, J., Doermann D., and Li, H. 2005. Camera-based Analysis of Text and Documents: A Survey. IJDAR, 7(2), 2005, pp. 84--104.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Liu, C., Wang, C., and Dai, R. 2005. Text Detection in Images Based on Unsupervised Classification of Edge-based Features. ICDAR 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Lucas, S. M. 2005. ICDAR 2005 Text Locating Competition Results. ICDAR 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Neumann, L., and Matas, J. 2011. Text Localization in Real-World Images Using Efficiently Pruned Exhaustive Search. ICDAR 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Tsai, S. S., Chen, D., Chen, H., Hsu, C.-H., Kim, K.-H., Singh, J. P., and Girod, B. 2011. Combining Image and Text Features: A Hybrid Approach to Mobile Book Spine Recognition. ACM MM 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Wang, K., Babenko, B., and Belongie, S. 2011. End-to-End Scene Text Recognition. ICCV 2011.Google ScholarGoogle Scholar
  13. Xu, C., and Prince, J. L. 1998. Snakes, Shapes, and Gradient Vector Flow. IEEE TIP, 7(3), 1998, pp. 359--369. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. ICDAR 2003 Dataset. http://algoval.essex.ac.uk/icdar/Datasets.htmlGoogle ScholarGoogle Scholar

Index Terms

  1. Detecting text in the real world

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Conferences
                MM '12: Proceedings of the 20th ACM international conference on Multimedia
                October 2012
                1584 pages
                ISBN:9781450310895
                DOI:10.1145/2393347

                Copyright © 2012 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 29 October 2012

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • poster

                Acceptance Rates

                Overall Acceptance Rate995of4,171submissions,24%

                Upcoming Conference

                MM '24
                MM '24: The 32nd ACM International Conference on Multimedia
                October 28 - November 1, 2024
                Melbourne , VIC , Australia

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader