Text localization in camera captured images using fuzzy distance transform based adaptive stroke filter

Paul, Shauvik; Saha, Satadal; Basu, Subhadip; Saha, Punam Kumar; Nasipuri, Mita

doi:10.1007/s11042-019-7178-3

Text localization in camera captured images using fuzzy distance transform based adaptive stroke filter

Published: 17 January 2019

Volume 78, pages 18017–18036, (2019)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Shauvik Paul¹,
Satadal Saha²,
Subhadip Basu³,
Punam Kumar Saha⁴ &
…
Mita Nasipuri³

323 Accesses
15 Citations
Explore all metrics

Abstract

Localization of text from camera captured images with complex background is now-a-days a growing demand of modern IT enable service. Most of the current text localization techniques are sensitive to text features like color, size, style and also to the background clutter. Among all the methods proposed in different literatures, Stroke Filter is much more effective in localization of text. The effectiveness of traditional stroke filter is limited because of its fixed width and is capable of segmenting strokes/texts of predefined range of width. The proposed method uses Fuzzy Distance Transform based adaptive stroke filter which can effectively localize text regions from camera captured images with complex background. The method is applied by experiment on a database containing 600 images and the visual response of text segmentation is quite impressive. To get the accuracy of the proposed method, it is applied on a set of 16 test images and the segmentation result is compared with the ground truth images resulting in a recall, precision and f-measure values of 96.65%, 87.77% and 91.89% respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 3

A comprehensive survey of image segmentation: clustering methods, performance parameters, and benchmark datasets

Article 09 February 2021

Review on image-stitching techniques

Article 20 March 2020

A Detailed Review on Text Extraction Using Optical Character Recognition

References

Anthimopoulos M, Gatos B, Pratikakis I (2010) A two-stage scheme for text detection in video images. Image Vis Comput 28(9):1413–1426
Article Google Scholar
Bai X, Shi B, Zhang C, Cai X, Qi L (2017) Text/non-text image classification in the wild with convolutional neural networks. Pattern Recogn 66:437–446
Article Google Scholar
Bezdek JC, Pal SK (1992) Fuzzy models for pattern recognition, vol. 267. IEEE press New York
Borgefors G (1986) Distance transformations in digital images. Comput Vis, Graph, Image Proc 34(3):344–371
Article Google Scholar
Bušta M, Neumann L, Matas J (2017) Deep textspotter: An end-to-end trainable scene text localization and recognition framework. in Computer Vision (ICCV), 2017 IEEE International Conference on 2223–2231
X. Chen and AL. Yuille, “Detecting and reading text in natural scenes,” in Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proc 2004 IEEE Comput Soc Conf, 2004, vol. 2, p. II–366
Danielsson P-E (1980) Euclidean distance mapping. Comput Graph Image Proc 14(3):227–248
Article Google Scholar
Dimitrova N, Zhang H-J, Shahraray B, Sezan I, Huang T, Zakhor A (2002) Applications of video-content analysis and retrieval. Multi Media, IEEE 9(3):42–55
Article Google Scholar
Dutta IN, Chakraborty N, Mollah AF, Basu S, Sarkar R (2019) Multi-lingual Text Localization from Camera Captured Images Based on Foreground Homogenity Analysis, in Recent Developments in Machine Learning and Data Analytics, Springer, 149–158
Emmanouilidis C, Batsalas C, Papamarkos N (2009) Development and Evaluation of Text Localization Techniques Based on Structural Texture Features and Neural Classifiers, 2009 10th International Conference on Document Analysis and Recognition 1270–1274
Gavrila DM, Davis LS (1996) 3-D model-based tracking of humans in action: a multi-view approach. in IEEE Computer Society Conference on CVPR 73–80
Gómez L, Karatzas D (2017) Textproposals: a text-specific selective search algorithm for word spotting in the wild. Pattern Recogn 70:60–74
Article Google Scholar
Gonzales RC, Woods RE (2002) Digital Image Processing, vol. 6. Prentice Hall
Jin D, Saha PK (2013) A new fuzzy skeletonization algorithm and its applications to medical imaging. in International Conference on Image Analysis and Processing 662–671
Jin D, Liu Y, Saha PK (2013) Application of fuzzy skeletonization ot quantitatively assess trabecular bone micro-architecture. in Engineering in Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the IEEE. 3682–3685
Jin D, Chen C, Saha PK (2015) Filtering non-significant quench points using collision impact in grassfire propagation. in International Conference on Image Analysis and Processing 432–443
K. Jung (2004) In Kim, and A. K Jain. Text information extraction in images and video: a survey. Pattern recognition 37(5):977–997
Jung C, Liu Q, Kim J (2008) A new approach for text segmentation using a stroke filter. Signal Process 88(7):1907–1916
Article MATH Google Scholar
Jung C, Liu Q, Kim J (Jan. 2009) A stroke filter and its application to text localization. Pattern Recogn Lett 30(2):114–122
Article Google Scholar
Kaufmann A, Swanson DL (1975) Introduction to the theory of fuzzy subsets, vol. 1. Academic Press New York
Liang J, Doermann D, Li H (2005) Camera-based analysis of text and documents: a survey. IJDAR 7(2–3):84–104
Article Google Scholar
Liu Y, Nie L, Liu L, Rosenblum DS (2016) From action to activity: sensor-based activity recognition. Neurocomputing 181:108–115
Article Google Scholar
Lyu MR, Song J, Cai M (2005) A comprehensive method for multilingual video text detection, localization, and extraction. Circ Syst Video Technol, IEEE Trans 15(2):243–255
Article Google Scholar
Ma J, Shao W, Ye H, Wang L, Wang H, Zheng Y, Xue X (2018) Arbitrary-oriented scene text detection via rotation proposals, IEEE Transactions on Multimedia
Paul S, Saha S, Basu S, Nasipuri M (2015) Text Localization in Camera Captured Images Using Adaptive Stroke Filter. in Information Systems Design and Intelligent Applications, J. Mandal, S. Satpathy, M. Sanyal, P. Sarkar, and A. Mukhopadhyay, Eds. Springer, 217–225
Rong X, Yi C, Tian Y (2017) Unambiguous text localization and retrieval for cluttered scenes, in Computer Vision and Pattern Recognition (CVPR), 2017 IEEE Conference on, 3279–3287
Rosenfeld A, Pfaltz JL (1968) Distance functions on digital pictures. Pattern Recogn 1(1):33–61
Article MathSciNet Google Scholar
Saha PK, Wehrli FW, Gomberg BR (2002) Fuzzy distance transform: theory, algorithms, and applications. Comput Vis Image Underst 86(3):171–190
Article MATH Google Scholar
Saha S, Basu S, Nasipuri M, Basu DK (2009) Development of an automated Red Light Violation Detection System ( RLVDS ) for Indian vehicles, in National Conference on Computing and Communication Systems (COCOSYS-09). 59–64
Saha S, Basu S, Nasipuri M, Basu DK (2009) License plate localization from vehicle images: an edge based multi-stage approach. Int J Recent Trends Eng (Comput Sci) 1(1):284–288
Google Scholar
Saha S, Basu S, Nasipuri M, Basu DK (2011) Localization of license plates from Indian vehicle images using iterative edge map generation technique. J Comput 3(6):48–57
Google Scholar
S. Saha, S. Basu, and M. Nasipuri (2012) License Plate Localization Using Vertical Edge Map and Hough Transform Based Technique,” in Proceedings of the International Conference on Information Systems Design and Intelligent Applications (INDIA 2012) held in Visakhapatnam, India, pp. 649–656
S. Saha, S. Basu, and M. Nasipuri (2013) Development of a Stop-Line Violation Detection System for Indian Vehicles, in Handbook of Research on Computational Intelligence for Engineering, Science and Business, S. Bhattacharyya and P. Dutta, Eds. IGI Global. 200–227
Saha S, Basu S, Nasipuri M (2014) iLPR: an Indian license plate recognition system. Multimed Tools Appl 1–36
Saha PK, Borgefors G, di Baja GS (2016) A survey on skeletonization algorithms and their applications. Pattern Recogn Lett 76:3–12
Article Google Scholar
Shi C, Wang C, Xiao B, Zhang Y, Gao S (2013) Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recogn Lett 34(2):107–116
Article Google Scholar
Shi B, Bai X, Belongie S (2017) Detecting oriented text in natural images by linking segments, arXiv preprint arXiv:1703.06520
Shi B, Bai X, Yao C (2017) An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans Pattern Anal Mach Intell 39(11):2298–2304
Article Google Scholar
Subramanian K, Natarajan P, Decerbo M, Castañòn D (2007) Character-stroke detection for text-localization and extraction, in Document Analysis and Recognition, 2007. ICDAR 2007. Ninth Int Conf 1:33–37
Google Scholar
Wei Y, Zhang Z, Shen W, Zeng D, Fang M, Zhou S (2017) Text detection in scene images based on exhaustive segmentation. Signal Process Image Commun 50:1–8
Article Google Scholar
Ye Q, Doermann D (2015) Text detection and recognition in imagery: a survey. IEEE Trans Pattern Anal Mach Intell 37(7):1480–1500
Article Google Scholar
Zadeh LA (1978) Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst 1(1):3–28
Article MathSciNet MATH Google Scholar
Zhou X, Yao C, Wen H, Wang Y, Zhou S, He W, Liang J (2017) EAST: an efficient and accurate scene text detector. in Proc. CVPR 2642–2651

Download references

Author information

Authors and Affiliations

MCA Department, Techno India, Salt Lake, Kolkata, India
Shauvik Paul
ECE Department, MCKV Institute of Engineering, Howrah, India
Satadal Saha
CSE Department, Jadavpur University, Kolkata, India
Subhadip Basu & Mita Nasipuri
Department of Radiology, University of Iowa, Iowa City, USA
Punam Kumar Saha

Authors

Shauvik Paul
View author publications
You can also search for this author in PubMed Google Scholar
Satadal Saha
View author publications
You can also search for this author in PubMed Google Scholar
Subhadip Basu
View author publications
You can also search for this author in PubMed Google Scholar
Punam Kumar Saha
View author publications
You can also search for this author in PubMed Google Scholar
Mita Nasipuri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Satadal Saha.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Paul, S., Saha, S., Basu, S. et al. Text localization in camera captured images using fuzzy distance transform based adaptive stroke filter. Multimed Tools Appl 78, 18017–18036 (2019). https://doi.org/10.1007/s11042-019-7178-3

Download citation

Received: 29 May 2018
Revised: 21 November 2018
Accepted: 06 January 2019
Published: 17 January 2019
Issue Date: 15 July 2019
DOI: https://doi.org/10.1007/s11042-019-7178-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Text localization in camera captured images using fuzzy distance transform based adaptive stroke filter

Abstract

Access this article

Similar content being viewed by others

A comprehensive survey of image segmentation: clustering methods, performance parameters, and benchmark datasets

Review on image-stitching techniques

A Detailed Review on Text Extraction Using Optical Character Recognition

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Text localization in camera captured images using fuzzy distance transform based adaptive stroke filter

Abstract

Access this article

Similar content being viewed by others

A comprehensive survey of image segmentation: clustering methods, performance parameters, and benchmark datasets

Review on image-stitching techniques

A Detailed Review on Text Extraction Using Optical Character Recognition

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation