Abstract
Video text detection is a challenging problem, since the background of the video image is generally complex and its subtitles often have colour bleeding problems, blurred boundaries and low contrast due to video loss compression and low resolution. Text detection is an important method for many image processing tasks that are focused on text. In this paper, we put forward a robust detection method for extracting video text using hybrid method of MSER via morphological filtering for solving these problems. This can also solve the problems of bleeding in colour and floured boundaries. In this we added 2-D DWT (discrete wavelet transforms) is developed to remove background noise and improve sound and text contrast. SO that components are extracted with MSER from origin and processed images. In this work, the proposed method develops an efficient method of extracting and recognizing text, using the principle of morphological operations using MATLAB. Current text extraction methods–edge dependent and connected components when implemented separately yield better results. But using these approaches sometimes cannot get better results as well as its time taken. Therefore it is suggested that combine both methods, the outcome shows that the approach suggested produces better results than the other two approaches.
Similar content being viewed by others
References
Abualigah LMQ (2019) Feature selection and enhanced krill herd algorithm for text document clustering. Springer, Berlin, pp 1–165
Abualigah LMQ, Khader AT, Hanandeh ES (2018) A new feature selection method to improve the document clustering using particle swarm optimization algorithm. Journal of Computational Science 25:456–466
Chen X, Yuille A (2004) Detecting and reading text in natural scenes. Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit 2004:366–373
Chen D, Odobez JM, Bourlard H (2004) Text detection and recognition in images and video frames. Pattern Recogn 37(3):595–608
Chen H, Tsai SS, Schroth G, Chen DM, Grzeszczuk R, Girod B (2011) Robust Text detection in natural images with edge-enhanced maximally stable extremal regions. In: Proceedings of the 18th IEEE International Conference on Image Processing (ICIP), 2011, Brussels, Belgium, September 11–14, 2011, pp 2609–2612
Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: Proceedings of the IEEE Conference on CVPR, vol 2010. San Francisco, CA, USA, pp 2963–2970
Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. Proc IEEE Conf Comput Vis Pattern Recognit 1:2963–2970
González A, Bergasa LM (2013) A text reading, algorithm for natural images. Image Vis Comput 31(1):255–274
Huo Y, Wei G, Zhang Y, Wu L (2010) An adaptive threshold for the Canny Operator of edge detection. In: Proceedings of the IEEE International Conference on Image Analysis and Signal Processing (IASP), vol 2010. Zhejiang, China, pp 371–374
Karatzas D, Shafait F, Uchida S, Iwamura M, Gomez L, Robles S, Mas J, Fernandez D, Almazan J, Heras LP l (2013) Robust reading competition. In: Proceedings of the 12th International Conference of Document Analysis and Recognition (ICDAR), 2013, Washington, DC, United States Aug 25–28, pp 1115–1124
Kim H (1996) Efficient automatic text location method and content-based indexing and structuring of video database. J Vis Commun Image Represent 7(4):336–344
Koo H, Kim DH (2013) Scene text detection via connected component clustering and non-text filtering. IEEE Trans Image Process 22(6):2296–2305
Liang C-W, Chen P-Y (2004) DWT based text localization. Int J Appl Sci Eng 2(1):105–116
Liu W, Liu H, Tao D, Wang Y, Lu K (2015) Multi view Hessian regularized logistic regression for action recognition. Signal Process 110(2015):101–107
Matas J, Chum O, Urban M, Pajdla T (2002) Robust wide baseline stereo from maximally stable extremal regions. Proc Br Mach Vis Conf 2002:384–393
Mosleh A, Bouguila N, Hamza AB (2012) Image text detection using a bandlet based edge detector and stroke width transform. Proc Br Mach Vis Conf 2012:1–2
Neumann L, Matas J (2013) On combining multiple segmentations in scene text recognition. Proc IEEE Intl Conf Doc Anal Recognit 2013:523–527
Pan YF, Hou X, Liu CL (2011) A hybrid approach to detect and localize texts in natural scene images. IEEE Trans Image Process 20(3):800–813
Shekar B, Kumari MS, Holla R (2011) An efficient and accurate shot boundary detection technique based on colour moments. Int J Artif Intell Knowl Disc 1(2011):77–80
Shi C, Wang C, Xiao B, Zhang Y, Gao S (2013) Scene text detection using graph model built up on maximally stable extremal regions. Pattern Recogn Lett 34(2):107–116
Shi C, Wang C, Xiao B, Zhang Y, Gao S, Zhang Z (2013) Scene text recognition using part based tree structured character detections. Proc IEEE Intl Conf Comput Vis Pattern Recognit 2013:2961–2968
Shivakumara P, Phan TQ, Tan CL (2011) A Laplacian Approach to multioriented text detection in video. IEEE Trans Pattern Anal Mach Intell (TPAMI): IEEE Transactions on Software Engineering 33(2):1–8
Shivakumara P, Basavaraju HT, Guru DS, Tan CL (2013) Detection of curved text in video: quad tree based method. 12th International Conference on Document Analysis and Recognition (ICDAR), IEEE (2013), Washington, DC, USA, pp 594–598
Singh M, Kaur A (2015) An efficient hybrid scheme for key frame extraction and text localization in video: international conference on advances in computing. Commun Inf 2015:1250–1254
Wei YC, Lin CH (2012) A robust video text detection approach using SVM. Expert Syst Appl 39(12):10832–10840
Weifeng L, Hongli L, Dapeng T, Wang Y, Ke L (2015) Multi view Hessian regularized logistic regression for action recognition. Signal Process 110(2015):101–107
Xu C, Tao D, Xu C (2015) Multi view intact space learning. IEEE Trans Pattern Anal Mach Intell 37(12):2531–2544
Ye Q, Doermann D (2014) Text detection and recognition in imagery : a survey. IEEE Trans Pattern Anal Mach Intell 99:1–20
Ye Q, Doermann D (2014) Robust scene text detection using integrated feature discrimination. In: Proceedings of the IEEE International Conference on Image Processing (ICIP), vol 2014. France, Paris, pp 1678–1682
Yi C, Tian Y (2011) Text string detection from natural scenes by structure based partition and grouping. IEEE Trans Image Process 20(9):2594–2605
Yin X, Yin X, Hao H, Iqbal K (2012) Effective Text Localization in Natural Scene Images with MSER, Geometry-based Grouping and Ada Boost: In: Proceedings of the 21st International IAPR Conference on Pattern Recognition (ICPR'12), 2012, November 11-15, 2012. Tsukuba, Japan, pp 725–728
Yin X , Yin X, Huang K, Hao H (2013) Accurate and robust text detection: a step in for text retrieval in natural scene images. In: Proceedings of the 36th International ACMSIGIR Conference on Research and Development in Information Retrieval (SIGIR'13), 2013
Yin X, Yin X, Huang K, Hao H (2014) Robust text detection in natural scene images. IEEE Trans Pattern Anal Mach Intell 36(5):970–982
Zhang J, Kasturi R (2010) Character energy and link energy-based text extraction in scene images: In: Proc the 10th Asian Conference on Computer Vision, Springer. Berlin, Heidelberg, pp 308–320
Acknowledgments
Authors of the study did not acknowledge to any funding agency. Because, authors has done this work by own. Authors acknowledge the Satya institutes of technology and managements to provide good lab facilities to carry out this work.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Sravani, M., Maheswararao, A. & Murthy, M.K. Robust detection of video text using an efficient hybrid method via key frame extraction and text localization. Multimed Tools Appl 80, 9671–9686 (2021). https://doi.org/10.1007/s11042-020-10113-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-10113-2