Skip to main content

An Application of K-Means Clustering for Improving Video Text Detection

  • Conference paper
Intelligent Informatics

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 182))

Abstract

In the present work, we explore an extensive applications of Gabor filter and K-means clustering algorithm in detection of text in an unconstrained complex background and regular images. The system is a comprehensive of four stages: In the first stage, combination of wavelet transforms and Gabor filter is applied to extract sharpened edges and textural features of a given input image. In the second stage, the resultant Gabor output image is grouped into three clusters to classify the background, foreground and the true text pixels using K-means clustering algorithm. In the third stage of the system, morphological operations are performed to obtain connected components, then after a concept of linked list approach is in turn used to build a true text line sequence. In the final stage, wavelet entropy is imposed on an each connected component sequence, in order to determine the true text region of an input image. Experiments are conducted on 101 video images and on standard ICDAR 2003 database. The proposed method is evaluated by testing the 101 video images as well with the ICDAR 2003 database. Experimental results show that the proposed method is able to detect a text of different size, complex background and contrast. Withal, the system performance outreaches the existing method in terms of detection accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chowdhury, S.P., Dhar, S., Das, A.K., Chanda, B., McMenemy, K.: Robust Extraction of Text from Camera Images. In: The Proceedings of 10th International Conference on Document Analysis and Recognition, pp. 1280–1284 (2009)

    Google Scholar 

  2. Manjunath Aradhya, V.N., Pavithra, M.S., Naveena, C.: A Robust Multilingual Text Detection Approach Based on Transforms and Wavelet Entropy. In: The Proceedings of Procedia Technology (Elsevier) 2nd International Conference on Computer, Communication, Control and Information Technology (C3IT-2012), Hooghly, West Bengal, India (in press, 2012)

    Google Scholar 

  3. Kaushik, N., Sarathi, D., Mittal, A.: Robust Text Detection in images using morphological operations and Gabor wavelet. In: The Proceedings of EAIT, pp. 153–156 (2006)

    Google Scholar 

  4. Phan, T., Shivakumara, P., Tan, C.: A Laplacian method for video text detection. In: The Proceedings of 10th International Conference on Document Analysis and Recognition, pp. 66–70 (2009)

    Google Scholar 

  5. Shivakumara, P., Phan, T., Tan, C.: A Robust Wavelet Transform Based Technique for Video Text Detection. In: The Proceedings of 10th International Conference on Document Analysis and Recognition, pp. 1285–1289 (2009)

    Google Scholar 

  6. Shivakumara, P., Phan, T., Tan, C.: A Laplacian Approach to Multi-Oriented Text Detection in Video. IEEE Transactions on Pattern Analysis and Machine Intelligence 33, 412–419 (2011)

    Article  Google Scholar 

  7. Naveena, C., Manjunath Aradhya, V.N.: A linked List Approach for Handwritten Textline Segmentation. Journal of Intelligent Systems (in press, 2012)

    Google Scholar 

  8. Liu, C., Wang, C., Dai, R.: Text Detection in Images Based on Unsupervised Classification of Edge-based Features. In: ICDAR, pp. 610–614 (2005)

    Google Scholar 

  9. Wong, E.K., Chen, M.: A new robust algorithm for video text extraction. In: The Proceedings of First Asian Conference on Pattern Recognition, ACPR, vol. 36, pp. 1397–1406 (2003)

    Google Scholar 

  10. Mariano, V.Y., Kasturi, R.: Locating Uniform Colored Text in Video Frames. In: 15th ICPR, vol. 4, pp. 539–542 (2000)

    Google Scholar 

  11. Teknomo, K.: Kardi Teknomo’s Tutorials (2006); Available via Kardi Teknomo, http://people.revoledu.com/kardi/tutorial/kMean/index.html

  12. Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 Robust Reading competitions. In: ICDAR 2003, pp. 682–687 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to V. N. Manjunath Aradhya .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Manjunath Aradhya, V.N., Pavithra, M.S. (2013). An Application of K-Means Clustering for Improving Video Text Detection. In: Abraham, A., Thampi, S. (eds) Intelligent Informatics. Advances in Intelligent Systems and Computing, vol 182. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32063-7_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-32063-7_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-32062-0

  • Online ISBN: 978-3-642-32063-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics