Skip to main content

Text Detection of Two Major Indian Scripts in Natural Scene Images

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7139))

Abstract

In this article, we present a robust scheme for detection of Devanagari or Bangla texts in scene images. These are the two most popular scripts in India. The proposed scheme is primarily based on two major characteristics of such texts - (i) variations in stroke thickness for text components of a script are low compared to their non-text counterparts and (ii) presence of a headline along with a few vertical downward strokes originating from this headline. We use the Euclidean distance transform to verify the general characteristics of texts in (i). Also, we apply the probabilistic Hough line transform to detect the characteristic headline of Devanagari and Bangla texts. Further, similarity and adjacency measures are applied to identify text regions, which do not satisfy the verification in (ii). The proposed approach has been simulated on a repository of 120 images taken from Indian roads and the results are encouraging. Also, we have discussed the applicability of the proposed scheme for detection of English texts. Towards this end, we have considered the training and test samples from the image database of ICDAR 2003 Robust Reading Competition.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Liang, J., Doermann, D., Li, H.: Camera Based Analysis of Text and Documents: A Survey. Int. Journ. on Doc. Anal. and Recog. 7, 84–104 (2005)

    Article  Google Scholar 

  2. Jung, K., Kim, K.I., Jain, A.K.: Text Information Extraction in Images and Video: a Survey. Pattern Recognition 37, 977–997 (2004)

    Article  Google Scholar 

  3. Li, H., Doermann, D., Kia, O.: Automatic Text Detection and Tracking in Digital Video. IEEE Trans. Image Processing 9, 147–167 (2000)

    Article  Google Scholar 

  4. Gllavata, J., Ewerth, R., Freisleben, B.: Text Detection in Images Based on Unsupervised Classification of High Frequency Wavelet Coefficients. In: Proc. of 17th Int. Conf. on Patt. Recog., vol. 1, pp. 425–428 (2004)

    Google Scholar 

  5. Saoi, T., Goto, H., Kobayashi, H.: Text Detection in Color Scene Images Based on Unsupervised Clustering of Multihannel Wavelet Features. In: Proc. of 8th Int. Conf. on Doc. Anal. and Recog., pp. 690–694 (2005)

    Google Scholar 

  6. Ezaki, N., Bulacu, M., Schomaker, L.: Text Detection From Natural Scene Images: Towards a System for Visually Impaired Persons. In: Proc. of 17th Int. Conf. on Patt. Recog., vol. II, pp. 683–686 (2004)

    Google Scholar 

  7. Ye, Q., Huang, Q., Gao, W., Zhao, D.: Fast and Robust Text Detection in Images and Video Frames. Image and Vis. Comp. 23, 565–576 (2005)

    Article  Google Scholar 

  8. Subramanian, K., Natarajan, P., Decerbo, M., Castan̈on, D.: Character-Stroke Detection for Text-Localization and Extraction. In: Proc. of Int. Conf. on Doc. Anal. and Recog., pp. 33–37 (2005)

    Google Scholar 

  9. Epshtein, B., Ofek, E., Wexler, Y.: Detecting Text in Natural Scenes with Stroke Width Transform. In: Proc. of IEEE Conf. on Comp. Vis. and Patt. Recog., pp. 2963–2970 (2010)

    Google Scholar 

  10. Bhattacharya, U., Parui, S.K., Mondal, S.: Devanagari and Bangla Text Extraction from Natural Scene Images. In: 10th Int. Conf. on Doc. Anal. and Recog., pp. 171–175 (2009)

    Google Scholar 

  11. Kumar, S., Perrault, A.: Text Detection on Nokia N900 Using Stroke Width Transform, http://www.cs.cornell.edu/courses/cs4670/2010fa/projects/final/results/group_of_arp86_sk2357/Writeup.pdf (last accessed on October 31, 2011)

  12. Canny, J.: A Computational Approach to Edge Detection. IEEE Trans. Patt. Anal. and Mach. Intell. 8, 679–714 (1986)

    Article  Google Scholar 

  13. Borgefors, G.: Distance Transformations in Digital Images. Comp. Vis., Graph. and Image Proc. 34, 344–371 (1986)

    Article  Google Scholar 

  14. Matas, J., Galambos, C., Kittler, J.: Progressive Probabilistic Hough Transform. In: Proc. of BMVC 1998, vol. 1, pp. 256–265 (1998)

    Google Scholar 

  15. Bradski, G., Kaehler, A.: Learning OpenCV. O’Reilly Media, Inc. (2008)

    Google Scholar 

  16. Lucas, S.M., et al.: ICDAR 2003 Robust Reading Competitions. In: Proc. of 7th Int. Conf. on Doc. Anal. and Recog., pp. 682–668 (2003)

    Google Scholar 

  17. Zhou, L., Lu, Y., Tan, C.L.: Bangla/English Script Identification Based on Analysis of Connected Component Profiles. In: Proc. Doc. Anal. Syst., pp. 243–254 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Roy Chowdhury, A., Bhattacharya, U., Parui, S.K. (2012). Text Detection of Two Major Indian Scripts in Natural Scene Images. In: Iwamura, M., Shafait, F. (eds) Camera-Based Document Analysis and Recognition. CBDAR 2011. Lecture Notes in Computer Science, vol 7139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29364-1_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-29364-1_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-29363-4

  • Online ISBN: 978-3-642-29364-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics