Skip to main content
Log in

Robust detection of video text using an efficient hybrid method via key frame extraction and text localization

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Video text detection is a challenging problem, since the background of the video image is generally complex and its subtitles often have colour bleeding problems, blurred boundaries and low contrast due to video loss compression and low resolution. Text detection is an important method for many image processing tasks that are focused on text. In this paper, we put forward a robust detection method for extracting video text using hybrid method of MSER via morphological filtering for solving these problems. This can also solve the problems of bleeding in colour and floured boundaries. In this we added 2-D DWT (discrete wavelet transforms) is developed to remove background noise and improve sound and text contrast. SO that components are extracted with MSER from origin and processed images. In this work, the proposed method develops an efficient method of extracting and recognizing text, using the principle of morphological operations using MATLAB. Current text extraction methods–edge dependent and connected components when implemented separately yield better results. But using these approaches sometimes cannot get better results as well as its time taken. Therefore it is suggested that combine both methods, the outcome shows that the approach suggested produces better results than the other two approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

References

  1. Abualigah LMQ (2019) Feature selection and enhanced krill herd algorithm for text document clustering. Springer, Berlin, pp 1–165

    Book  Google Scholar 

  2. Abualigah LMQ, Khader AT, Hanandeh ES (2018) A new feature selection method to improve the document clustering using particle swarm optimization algorithm. Journal of Computational Science 25:456–466

  3. Chen X, Yuille A (2004) Detecting and reading text in natural scenes. Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit 2004:366–373

    Google Scholar 

  4. Chen D, Odobez JM, Bourlard H (2004) Text detection and recognition in images and video frames. Pattern Recogn 37(3):595–608

    Article  Google Scholar 

  5. Chen H, Tsai SS, Schroth G, Chen DM, Grzeszczuk R, Girod B (2011) Robust Text detection in natural images with edge-enhanced maximally stable extremal regions. In: Proceedings of the 18th IEEE International Conference on Image Processing (ICIP), 2011, Brussels, Belgium, September 11–14, 2011, pp 2609–2612

  6. Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: Proceedings of the IEEE Conference on CVPR, vol 2010. San Francisco, CA, USA, pp 2963–2970

  7. Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. Proc IEEE Conf Comput Vis Pattern Recognit 1:2963–2970

    Google Scholar 

  8. González A, Bergasa LM (2013) A text reading, algorithm for natural images. Image Vis Comput 31(1):255–274

    Article  Google Scholar 

  9. Huo Y, Wei G, Zhang Y, Wu L (2010) An adaptive threshold for the Canny Operator of edge detection. In: Proceedings of the IEEE International Conference on Image Analysis and Signal Processing (IASP), vol 2010. Zhejiang, China, pp 371–374

  10. Karatzas D, Shafait F, Uchida S, Iwamura M, Gomez L, Robles S, Mas J, Fernandez D, Almazan J, Heras LP l (2013) Robust reading competition. In: Proceedings of the 12th International Conference of Document Analysis and Recognition (ICDAR), 2013, Washington, DC, United States Aug 25–28, pp 1115–1124

  11. Kim H (1996) Efficient automatic text location method and content-based indexing and structuring of video database. J Vis Commun Image Represent 7(4):336–344

    Article  Google Scholar 

  12. Koo H, Kim DH (2013) Scene text detection via connected component clustering and non-text filtering. IEEE Trans Image Process 22(6):2296–2305

    Article  MathSciNet  Google Scholar 

  13. Liang C-W, Chen P-Y (2004) DWT based text localization. Int J Appl Sci Eng 2(1):105–116

    Google Scholar 

  14. Liu W, Liu H, Tao D, Wang Y, Lu K (2015) Multi view Hessian regularized logistic regression for action recognition. Signal Process 110(2015):101–107

  15. Matas J, Chum O, Urban M, Pajdla T (2002) Robust wide baseline stereo from maximally stable extremal regions. Proc Br Mach Vis Conf 2002:384–393

    Google Scholar 

  16. Mosleh A, Bouguila N, Hamza AB (2012) Image text detection using a bandlet based edge detector and stroke width transform. Proc Br Mach Vis Conf 2012:1–2

    Google Scholar 

  17. Neumann L, Matas J (2013) On combining multiple segmentations in scene text recognition. Proc IEEE Intl Conf Doc Anal Recognit 2013:523–527

    Google Scholar 

  18. Pan YF, Hou X, Liu CL (2011) A hybrid approach to detect and localize texts in natural scene images. IEEE Trans Image Process 20(3):800–813

    Article  MathSciNet  Google Scholar 

  19. Shekar B, Kumari MS, Holla R (2011) An efficient and accurate shot boundary detection technique based on colour moments. Int J Artif Intell Knowl Disc 1(2011):77–80

    Google Scholar 

  20. Shi C, Wang C, Xiao B, Zhang Y, Gao S (2013) Scene text detection using graph model built up on maximally stable extremal regions. Pattern Recogn Lett 34(2):107–116

    Article  Google Scholar 

  21. Shi C, Wang C, Xiao B, Zhang Y, Gao S, Zhang Z (2013) Scene text recognition using part based tree structured character detections. Proc IEEE Intl Conf Comput Vis Pattern Recognit 2013:2961–2968

    Google Scholar 

  22. Shivakumara P, Phan TQ, Tan CL (2011) A Laplacian Approach to multioriented text detection in video. IEEE Trans Pattern Anal Mach Intell (TPAMI): IEEE Transactions on Software Engineering 33(2):1–8

  23. Shivakumara P, Basavaraju HT, Guru DS, Tan CL (2013) Detection of curved text in video: quad tree based method. 12th International Conference on Document Analysis and Recognition (ICDAR), IEEE (2013), Washington, DC, USA, pp 594–598

  24. Singh M, Kaur A (2015) An efficient hybrid scheme for key frame extraction and text localization in video: international conference on advances in computing. Commun Inf 2015:1250–1254

    Google Scholar 

  25. Wei YC, Lin CH (2012) A robust video text detection approach using SVM. Expert Syst Appl 39(12):10832–10840

    Article  Google Scholar 

  26. Weifeng L, Hongli L, Dapeng T, Wang Y, Ke L (2015) Multi view Hessian regularized logistic regression for action recognition. Signal Process 110(2015):101–107

    Google Scholar 

  27. Xu C, Tao D, Xu C (2015) Multi view intact space learning. IEEE Trans Pattern Anal Mach Intell 37(12):2531–2544

  28. Ye Q, Doermann D (2014) Text detection and recognition in imagery : a survey. IEEE Trans Pattern Anal Mach Intell 99:1–20

    Google Scholar 

  29. Ye Q, Doermann D (2014) Robust scene text detection using integrated feature discrimination. In: Proceedings of the IEEE International Conference on Image Processing (ICIP), vol 2014. France, Paris, pp 1678–1682

  30. Yi C, Tian Y (2011) Text string detection from natural scenes by structure based partition and grouping. IEEE Trans Image Process 20(9):2594–2605

    Article  MathSciNet  Google Scholar 

  31. Yin X, Yin X, Hao H, Iqbal K (2012) Effective Text Localization in Natural Scene Images with MSER, Geometry-based Grouping and Ada Boost: In: Proceedings of the 21st International IAPR Conference on Pattern Recognition (ICPR'12), 2012, November 11-15, 2012. Tsukuba, Japan, pp 725–728

  32. Yin X , Yin X, Huang K, Hao H (2013) Accurate and robust text detection: a step in for text retrieval in natural scene images. In: Proceedings of the 36th International ACMSIGIR Conference on Research and Development in Information Retrieval (SIGIR'13), 2013

  33. Yin X, Yin X, Huang K, Hao H (2014) Robust text detection in natural scene images. IEEE Trans Pattern Anal Mach Intell 36(5):970–982

    Article  Google Scholar 

  34. Zhang J, Kasturi R (2010) Character energy and link energy-based text extraction in scene images: In: Proc the 10th Asian Conference on Computer Vision, Springer. Berlin, Heidelberg, pp 308–320

Download references

Acknowledgments

Authors of the study did not acknowledge to any funding agency. Because, authors has done this work by own. Authors acknowledge the Satya institutes of technology and managements to provide good lab facilities to carry out this work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Meesala Krishna Murthy.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sravani, M., Maheswararao, A. & Murthy, M.K. Robust detection of video text using an efficient hybrid method via key frame extraction and text localization. Multimed Tools Appl 80, 9671–9686 (2021). https://doi.org/10.1007/s11042-020-10113-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-020-10113-2

Keywords

Navigation