Abstract
Text detection from scenic photographs with text is a difficult issue that has recently attracted a lot of attention. There are two main elements in scenery photographs (1) Recognizing text in photographs and (2) Character recognition. The model’s entire accuracy depends on the output of this phase, finding the text in the photos is the most crucial aspect. An approach consisting of two phases has been proposed in this article. (1) Text recognition and (2) Text checker. Text detection is accomplished using the Maximally Stable Extremal Regions (MSER) feature detector. The output of the MSER feature detector is subjected to various filters in order to exclude components, i.e., unlikely to contain text. The second phase uses a machine learning methodology to classify the text and non-text on phase-1 final output. It has been discovered that the proposed method nearly removes all false-positive results on the MSER method’s final output.
Similar content being viewed by others
References
Baran R, Partila P, Wilk R (2018) Automated text detection and character recognition in natural scenes based on local image features and contour processing techniques, in: Intelligent human systems integration: proceedings of the 1st international conference on intelligent human systems integration (IHSI 2018): Integrating people and intelligent systems, January 7-9, 2018, Dubai, United Arab Emirates, Springer, pp. 42–48
Chen H, Tsai SS, Schroth G, Chen DM, Grzeszczuk R, Girod B, Robust text detection in natural images with edge-enhanced maximally stable extremal regions, in, (2011) 18th IEEE international conference on image processing. IEEE 2011:2609–2612
Ch’ng C-K, Chan CS, Liu C-L (2020) Total-text: toward orientation robustness in scene text detection. Int J Doc Anal Recognit (IJDAR) 23(1):31–52
Epshtein B, Ofek E, Wexler Y, Detecting text in natural scenes with stroke width transform, in, (2010) IEEE computer society conference on computer vision and pattern recognition. IEEE 2010:2963–2970
Gllavata J, Ewerth R, Freisleben B (2004) Text detection in images based on unsupervised classification of high-frequency wavelet coefficients, in: Proceedings of the 17th international conference on pattern recognition, 2004. ICPR 2004., Vol. 1, IEEE, pp. 425–428
Gupta N, Jalal AS (2019) A robust model for salient text detection in natural scene images using mser feature detector and grabcut. Multimed Tools Appl 78:10821–10835
Hao M, Shi W, Zhang H, Wang Q, Deng K (2016) A scale-driven change detection method incorporating uncertainty analysis for remote sensing images. Remote Sens 8(9):745
He T, Huang W, Qiao Y, Yao J (2016) Text-attentional convolutional neural network for scene text detection. IEEE Trans Image Process 25(6):2529–2541
Li H, Doermann D, Kia O (2000) Automatic text detection and tracking in digital video. IEEE Trans Image Process 9(1):147–156
Liang J, Doermann D, Li H (2005) Camera-based analysis of text and documents: a survey. IJDAR 7:84–104
Imam NH, Vassilakis VG, Kolovos D (2022) Ocr post-correction for detecting adversarial text images. J Inform Secur Appl 66:103170
Naiemi F, Ghods V, Khalesi H (2021) Mostl: an accurate multi-oriented scene text localization. Circuits Syst Signal Process 40:4452–4473
Naiemi F, Ghods V, Khalesi H (2021) A novel pipeline framework for multi oriented scene text image detection and recognition. Expert Syst Appl 170:114549
Neumann L, Matas J (2011) A method for text localization and recognition in real-world images, in: Computer Vision–ACCV 2010: 10th Asian conference on computer vision, Queenstown, New Zealand, November 8-12, 2010, Revised Selected Papers, Part III 10, Springer, pp. 770–783
Panchal BY, Chauhan G, Panchal SR, Chaudhari UM (2022) An investigation on feature and text extraction from images using image recognition in android. Mater Today Proc 51:798–802
Rajeswari R, Aradhana B (2021) Character recognition in scene images using mser and cnn, in: Cognition and recognition: 8th international conference, ICCR 2021, Mandya, India, December 30–31, 2021, Revised Selected Papers, Springer,2023, pp. 99–107
Rashtehroudi AR, Akoushideh A, Shahbahrami A (2023) Pestd: a large-scale persian-english scene text dataset. Multimed Tools Appl. https://doi.org/10.1007/s11042-023-15062-0
Soni R, Kumar B, Chand S (2020) Text region extraction from scene images using agf and mser. Int J Image Graphics 20(02):2050009
Tong G, Dong M, Sun X, Song Y (2022) Natural scene text detection and recognition based on saturation-incorporated multi-channel mser. Knowl-Based Syst 250:109040
Wang X, Liu S, Du P, Liang H, Xia J, Li Y (2018) Object-based change detection in urban areas from high spatial resolution images based on multiple features and ensemble learning. Remote Sens 10(2):276
Weinman J, Hanson A, McCallum A (2004) Sign detection in natural images with conditional random fields, in: Proceedings of the 2004 14th IEEE signal processing society workshop machine learning for signal processing, IEEE, 2004, pp. 549–558
Yadav AK, Maurya AK, Yadav RS, et al (2021) Extractive text summarization using recent approaches: a survey, Ingénierie des Systèmes d’Information 26(1)
Yadav AK, Yadav RS, Maurya AK, et al (2023) State-of-the-art approach to extractive text summarization: a comprehensive review, Multimed Tools Appl 1–63
Yang X, Wang H, Xie D, Deng C, Tao D (2022) Object-agnostic transformers for video referring segmentation. IEEE Trans Image Process 31:2839–2849
Yang Z, Dong J, Liu P, Yang Y, Yan S (2019) Very long natural scenery image prediction by outpainting, in: Proceedings of the IEEE/CVF international conference on computer vision, pp. 10561–10570
Ye Q, Huang Q, Gao W, Zhao D (2005) Fast and robust text detection in images and video frames. Image Vis Comput 23(6):565–576
Acknowledgements
We would like to thank to reviewers.
Funding
No funding.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
None.
Informed Consent
There is no plagiarized.
Human and animals participants
None.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yadav, A.K., Sharma, A., Yadav, V. et al. Rfpssih: reducing false positive text detection sequels in scenery images using hybrid technique. Int J Syst Assur Eng Manag 14, 2289–2300 (2023). https://doi.org/10.1007/s13198-023-02070-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13198-023-02070-4