Robust detection of video text using an efficient hybrid method via key frame extraction and text localization

Sravani, Meesala; Maheswararao, Aggala; Murthy, Meesala Krishna

doi:10.1007/s11042-020-10113-2

Robust detection of video text using an efficient hybrid method via key frame extraction and text localization

Published: 13 November 2020

Volume 80, pages 9671–9686, (2021)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Meesala Sravani¹,
Aggala Maheswararao² &
Meesala Krishna Murthy ORCID: orcid.org/0000-0002-3634-0053³

275 Accesses
7 Citations
Explore all metrics

Abstract

Video text detection is a challenging problem, since the background of the video image is generally complex and its subtitles often have colour bleeding problems, blurred boundaries and low contrast due to video loss compression and low resolution. Text detection is an important method for many image processing tasks that are focused on text. In this paper, we put forward a robust detection method for extracting video text using hybrid method of MSER via morphological filtering for solving these problems. This can also solve the problems of bleeding in colour and floured boundaries. In this we added 2-D DWT (discrete wavelet transforms) is developed to remove background noise and improve sound and text contrast. SO that components are extracted with MSER from origin and processed images. In this work, the proposed method develops an efficient method of extracting and recognizing text, using the principle of morphological operations using MATLAB. Current text extraction methods–edge dependent and connected components when implemented separately yield better results. But using these approaches sometimes cannot get better results as well as its time taken. Therefore it is suggested that combine both methods, the outcome shows that the approach suggested produces better results than the other two approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automatic video superimposed text detection based on Nonsubsampled Contourlet Transform

Article 25 March 2017

Xiaodong Huang

Robust Video Text Detection with Morphological Filtering Enhanced MSER

Article 13 March 2015

Yun-Zhi Zhuge & Hu-Chuan Lu

An Exploration of Wavelet Transform and Level Set Method for Text Detection in Images and Video Frames

References

Abualigah LMQ (2019) Feature selection and enhanced krill herd algorithm for text document clustering. Springer, Berlin, pp 1–165
Book Google Scholar
Abualigah LMQ, Khader AT, Hanandeh ES (2018) A new feature selection method to improve the document clustering using particle swarm optimization algorithm. Journal of Computational Science 25:456–466
Chen X, Yuille A (2004) Detecting and reading text in natural scenes. Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit 2004:366–373
Google Scholar
Chen D, Odobez JM, Bourlard H (2004) Text detection and recognition in images and video frames. Pattern Recogn 37(3):595–608
Article Google Scholar
Chen H, Tsai SS, Schroth G, Chen DM, Grzeszczuk R, Girod B (2011) Robust Text detection in natural images with edge-enhanced maximally stable extremal regions. In: Proceedings of the 18th IEEE International Conference on Image Processing (ICIP), 2011, Brussels, Belgium, September 11–14, 2011, pp 2609–2612
Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: Proceedings of the IEEE Conference on CVPR, vol 2010. San Francisco, CA, USA, pp 2963–2970
Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. Proc IEEE Conf Comput Vis Pattern Recognit 1:2963–2970
Google Scholar
González A, Bergasa LM (2013) A text reading, algorithm for natural images. Image Vis Comput 31(1):255–274
Article Google Scholar
Huo Y, Wei G, Zhang Y, Wu L (2010) An adaptive threshold for the Canny Operator of edge detection. In: Proceedings of the IEEE International Conference on Image Analysis and Signal Processing (IASP), vol 2010. Zhejiang, China, pp 371–374
Karatzas D, Shafait F, Uchida S, Iwamura M, Gomez L, Robles S, Mas J, Fernandez D, Almazan J, Heras LP l (2013) Robust reading competition. In: Proceedings of the 12th International Conference of Document Analysis and Recognition (ICDAR), 2013, Washington, DC, United States Aug 25–28, pp 1115–1124
Kim H (1996) Efficient automatic text location method and content-based indexing and structuring of video database. J Vis Commun Image Represent 7(4):336–344
Article Google Scholar
Koo H, Kim DH (2013) Scene text detection via connected component clustering and non-text filtering. IEEE Trans Image Process 22(6):2296–2305
Article MathSciNet Google Scholar
Liang C-W, Chen P-Y (2004) DWT based text localization. Int J Appl Sci Eng 2(1):105–116
Google Scholar
Liu W, Liu H, Tao D, Wang Y, Lu K (2015) Multi view Hessian regularized logistic regression for action recognition. Signal Process 110(2015):101–107
Matas J, Chum O, Urban M, Pajdla T (2002) Robust wide baseline stereo from maximally stable extremal regions. Proc Br Mach Vis Conf 2002:384–393
Google Scholar
Mosleh A, Bouguila N, Hamza AB (2012) Image text detection using a bandlet based edge detector and stroke width transform. Proc Br Mach Vis Conf 2012:1–2
Google Scholar
Neumann L, Matas J (2013) On combining multiple segmentations in scene text recognition. Proc IEEE Intl Conf Doc Anal Recognit 2013:523–527
Google Scholar
Pan YF, Hou X, Liu CL (2011) A hybrid approach to detect and localize texts in natural scene images. IEEE Trans Image Process 20(3):800–813
Article MathSciNet Google Scholar
Shekar B, Kumari MS, Holla R (2011) An efficient and accurate shot boundary detection technique based on colour moments. Int J Artif Intell Knowl Disc 1(2011):77–80
Google Scholar
Shi C, Wang C, Xiao B, Zhang Y, Gao S (2013) Scene text detection using graph model built up on maximally stable extremal regions. Pattern Recogn Lett 34(2):107–116
Article Google Scholar
Shi C, Wang C, Xiao B, Zhang Y, Gao S, Zhang Z (2013) Scene text recognition using part based tree structured character detections. Proc IEEE Intl Conf Comput Vis Pattern Recognit 2013:2961–2968
Google Scholar
Shivakumara P, Phan TQ, Tan CL (2011) A Laplacian Approach to multioriented text detection in video. IEEE Trans Pattern Anal Mach Intell (TPAMI): IEEE Transactions on Software Engineering 33(2):1–8
Shivakumara P, Basavaraju HT, Guru DS, Tan CL (2013) Detection of curved text in video: quad tree based method. 12th International Conference on Document Analysis and Recognition (ICDAR), IEEE (2013), Washington, DC, USA, pp 594–598
Singh M, Kaur A (2015) An efficient hybrid scheme for key frame extraction and text localization in video: international conference on advances in computing. Commun Inf 2015:1250–1254
Google Scholar
Wei YC, Lin CH (2012) A robust video text detection approach using SVM. Expert Syst Appl 39(12):10832–10840
Article Google Scholar
Weifeng L, Hongli L, Dapeng T, Wang Y, Ke L (2015) Multi view Hessian regularized logistic regression for action recognition. Signal Process 110(2015):101–107
Google Scholar
Xu C, Tao D, Xu C (2015) Multi view intact space learning. IEEE Trans Pattern Anal Mach Intell 37(12):2531–2544
Ye Q, Doermann D (2014) Text detection and recognition in imagery : a survey. IEEE Trans Pattern Anal Mach Intell 99:1–20
Google Scholar
Ye Q, Doermann D (2014) Robust scene text detection using integrated feature discrimination. In: Proceedings of the IEEE International Conference on Image Processing (ICIP), vol 2014. France, Paris, pp 1678–1682
Yi C, Tian Y (2011) Text string detection from natural scenes by structure based partition and grouping. IEEE Trans Image Process 20(9):2594–2605
Article MathSciNet Google Scholar
Yin X, Yin X, Hao H, Iqbal K (2012) Effective Text Localization in Natural Scene Images with MSER, Geometry-based Grouping and Ada Boost: In: Proceedings of the 21st International IAPR Conference on Pattern Recognition (ICPR'12), 2012, November 11-15, 2012. Tsukuba, Japan, pp 725–728
Yin X , Yin X, Huang K, Hao H (2013) Accurate and robust text detection: a step in for text retrieval in natural scene images. In: Proceedings of the 36th International ACMSIGIR Conference on Research and Development in Information Retrieval (SIGIR'13), 2013
Yin X, Yin X, Huang K, Hao H (2014) Robust text detection in natural scene images. IEEE Trans Pattern Anal Mach Intell 36(5):970–982
Article Google Scholar
Zhang J, Kasturi R (2010) Character energy and link energy-based text extraction in scene images: In: Proc the 10th Asian Conference on Computer Vision, Springer. Berlin, Heidelberg, pp 308–320

Download references

Acknowledgments

Authors of the study did not acknowledge to any funding agency. Because, authors has done this work by own. Authors acknowledge the Satya institutes of technology and managements to provide good lab facilities to carry out this work.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Satya Institute Of Technology and Management, Vizianagaram, Andhra Pradesh, 535003, India
Meesala Sravani
Department of Computer Science and Engineering, Vignan Institute of Engineering for Women, Visakhapatnam, Andhra Pradesh, 530049, India
Aggala Maheswararao
Department of Biotechnology and Bioinformatics, AMIT, Affiliated to Utkal University, Bhubaneswar, Khurda, Odisha, 752050, India
Meesala Krishna Murthy

Authors

Meesala Sravani
View author publications
You can also search for this author in PubMed Google Scholar
Aggala Maheswararao
View author publications
You can also search for this author in PubMed Google Scholar
Meesala Krishna Murthy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Meesala Krishna Murthy.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sravani, M., Maheswararao, A. & Murthy, M.K. Robust detection of video text using an efficient hybrid method via key frame extraction and text localization. Multimed Tools Appl 80, 9671–9686 (2021). https://doi.org/10.1007/s11042-020-10113-2

Download citation

Received: 03 May 2020
Revised: 18 September 2020
Accepted: 19 October 2020
Published: 13 November 2020
Issue Date: March 2021
DOI: https://doi.org/10.1007/s11042-020-10113-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Robust detection of video text using an efficient hybrid method via key frame extraction and text localization

Abstract

Access this article

Similar content being viewed by others

Automatic video superimposed text detection based on Nonsubsampled Contourlet Transform

Robust Video Text Detection with Morphological Filtering Enhanced MSER

An Exploration of Wavelet Transform and Level Set Method for Text Detection in Images and Video Frames

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Robust detection of video text using an efficient hybrid method via key frame extraction and text localization

Abstract

Access this article

Similar content being viewed by others

Automatic video superimposed text detection based on Nonsubsampled Contourlet Transform

Robust Video Text Detection with Morphological Filtering Enhanced MSER

An Exploration of Wavelet Transform and Level Set Method for Text Detection in Images and Video Frames

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation