Skip to main content
Log in

Text localization in camera captured images using fuzzy distance transform based adaptive stroke filter

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Localization of text from camera captured images with complex background is now-a-days a growing demand of modern IT enable service. Most of the current text localization techniques are sensitive to text features like color, size, style and also to the background clutter. Among all the methods proposed in different literatures, Stroke Filter is much more effective in localization of text. The effectiveness of traditional stroke filter is limited because of its fixed width and is capable of segmenting strokes/texts of predefined range of width. The proposed method uses Fuzzy Distance Transform based adaptive stroke filter which can effectively localize text regions from camera captured images with complex background. The method is applied by experiment on a database containing 600 images and the visual response of text segmentation is quite impressive. To get the accuracy of the proposed method, it is applied on a set of 16 test images and the segmentation result is compared with the ground truth images resulting in a recall, precision and f-measure values of 96.65%, 87.77% and 91.89% respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  1. Anthimopoulos M, Gatos B, Pratikakis I (2010) A two-stage scheme for text detection in video images. Image Vis Comput 28(9):1413–1426

    Article  Google Scholar 

  2. Bai X, Shi B, Zhang C, Cai X, Qi L (2017) Text/non-text image classification in the wild with convolutional neural networks. Pattern Recogn 66:437–446

    Article  Google Scholar 

  3. Bezdek JC, Pal SK (1992) Fuzzy models for pattern recognition, vol. 267. IEEE press New York

  4. Borgefors G (1986) Distance transformations in digital images. Comput Vis, Graph, Image Proc 34(3):344–371

    Article  Google Scholar 

  5. Bušta M, Neumann L, Matas J (2017) Deep textspotter: An end-to-end trainable scene text localization and recognition framework. in Computer Vision (ICCV), 2017 IEEE International Conference on 2223–2231

  6. X. Chen and AL. Yuille, “Detecting and reading text in natural scenes,” in Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proc 2004 IEEE Comput Soc Conf, 2004, vol. 2, p. II–366

  7. Danielsson P-E (1980) Euclidean distance mapping. Comput Graph Image Proc 14(3):227–248

    Article  Google Scholar 

  8. Dimitrova N, Zhang H-J, Shahraray B, Sezan I, Huang T, Zakhor A (2002) Applications of video-content analysis and retrieval. Multi Media, IEEE 9(3):42–55

    Article  Google Scholar 

  9. Dutta IN, Chakraborty N, Mollah AF, Basu S, Sarkar R (2019) Multi-lingual Text Localization from Camera Captured Images Based on Foreground Homogenity Analysis, in Recent Developments in Machine Learning and Data Analytics, Springer, 149–158

  10. Emmanouilidis C, Batsalas C, Papamarkos N (2009) Development and Evaluation of Text Localization Techniques Based on Structural Texture Features and Neural Classifiers, 2009 10th International Conference on Document Analysis and Recognition 1270–1274

  11. Gavrila DM, Davis LS (1996) 3-D model-based tracking of humans in action: a multi-view approach. in IEEE Computer Society Conference on CVPR 73–80

  12. Gómez L, Karatzas D (2017) Textproposals: a text-specific selective search algorithm for word spotting in the wild. Pattern Recogn 70:60–74

    Article  Google Scholar 

  13. Gonzales RC, Woods RE (2002) Digital Image Processing, vol. 6. Prentice Hall

  14. Jin D, Saha PK (2013) A new fuzzy skeletonization algorithm and its applications to medical imaging. in International Conference on Image Analysis and Processing 662–671

  15. Jin D, Liu Y, Saha PK (2013) Application of fuzzy skeletonization ot quantitatively assess trabecular bone micro-architecture. in Engineering in Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the IEEE. 3682–3685

  16. Jin D, Chen C, Saha PK (2015) Filtering non-significant quench points using collision impact in grassfire propagation. in International Conference on Image Analysis and Processing 432–443

  17. K. Jung (2004) In Kim, and A. K Jain. Text information extraction in images and video: a survey. Pattern recognition 37(5):977–997

  18. Jung C, Liu Q, Kim J (2008) A new approach for text segmentation using a stroke filter. Signal Process 88(7):1907–1916

    Article  MATH  Google Scholar 

  19. Jung C, Liu Q, Kim J (Jan. 2009) A stroke filter and its application to text localization. Pattern Recogn Lett 30(2):114–122

    Article  Google Scholar 

  20. Kaufmann A, Swanson DL (1975) Introduction to the theory of fuzzy subsets, vol. 1. Academic Press New York

  21. Liang J, Doermann D, Li H (2005) Camera-based analysis of text and documents: a survey. IJDAR 7(2–3):84–104

    Article  Google Scholar 

  22. Liu Y, Nie L, Liu L, Rosenblum DS (2016) From action to activity: sensor-based activity recognition. Neurocomputing 181:108–115

    Article  Google Scholar 

  23. Lyu MR, Song J, Cai M (2005) A comprehensive method for multilingual video text detection, localization, and extraction. Circ Syst Video Technol, IEEE Trans 15(2):243–255

    Article  Google Scholar 

  24. Ma J, Shao W, Ye H, Wang L, Wang H, Zheng Y, Xue X (2018) Arbitrary-oriented scene text detection via rotation proposals, IEEE Transactions on Multimedia

  25. Paul S, Saha S, Basu S, Nasipuri M (2015) Text Localization in Camera Captured Images Using Adaptive Stroke Filter. in Information Systems Design and Intelligent Applications, J. Mandal, S. Satpathy, M. Sanyal, P. Sarkar, and A. Mukhopadhyay, Eds. Springer, 217–225

  26. Rong X, Yi C, Tian Y (2017) Unambiguous text localization and retrieval for cluttered scenes, in Computer Vision and Pattern Recognition (CVPR), 2017 IEEE Conference on, 3279–3287

  27. Rosenfeld A, Pfaltz JL (1968) Distance functions on digital pictures. Pattern Recogn 1(1):33–61

    Article  MathSciNet  Google Scholar 

  28. Saha PK, Wehrli FW, Gomberg BR (2002) Fuzzy distance transform: theory, algorithms, and applications. Comput Vis Image Underst 86(3):171–190

    Article  MATH  Google Scholar 

  29. Saha S, Basu S, Nasipuri M, Basu DK (2009) Development of an automated Red Light Violation Detection System ( RLVDS ) for Indian vehicles, in National Conference on Computing and Communication Systems (COCOSYS-09). 59–64

  30. Saha S, Basu S, Nasipuri M, Basu DK (2009) License plate localization from vehicle images: an edge based multi-stage approach. Int J Recent Trends Eng (Comput Sci) 1(1):284–288

    Google Scholar 

  31. Saha S, Basu S, Nasipuri M, Basu DK (2011) Localization of license plates from Indian vehicle images using iterative edge map generation technique. J Comput 3(6):48–57

    Google Scholar 

  32. S. Saha, S. Basu, and M. Nasipuri (2012) License Plate Localization Using Vertical Edge Map and Hough Transform Based Technique,” in Proceedings of the International Conference on Information Systems Design and Intelligent Applications (INDIA 2012) held in Visakhapatnam, India, pp. 649–656

  33. S. Saha, S. Basu, and M. Nasipuri (2013) Development of a Stop-Line Violation Detection System for Indian Vehicles, in Handbook of Research on Computational Intelligence for Engineering, Science and Business, S. Bhattacharyya and P. Dutta, Eds. IGI Global. 200–227

  34. Saha S, Basu S, Nasipuri M (2014) iLPR: an Indian license plate recognition system. Multimed Tools Appl 1–36

  35. Saha PK, Borgefors G, di Baja GS (2016) A survey on skeletonization algorithms and their applications. Pattern Recogn Lett 76:3–12

    Article  Google Scholar 

  36. Shi C, Wang C, Xiao B, Zhang Y, Gao S (2013) Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recogn Lett 34(2):107–116

    Article  Google Scholar 

  37. Shi B, Bai X, Belongie S (2017) Detecting oriented text in natural images by linking segments, arXiv preprint arXiv:1703.06520

  38. Shi B, Bai X, Yao C (2017) An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans Pattern Anal Mach Intell 39(11):2298–2304

    Article  Google Scholar 

  39. Subramanian K, Natarajan P, Decerbo M, Castañòn D (2007) Character-stroke detection for text-localization and extraction, in Document Analysis and Recognition, 2007. ICDAR 2007. Ninth Int Conf 1:33–37

    Google Scholar 

  40. Wei Y, Zhang Z, Shen W, Zeng D, Fang M, Zhou S (2017) Text detection in scene images based on exhaustive segmentation. Signal Process Image Commun 50:1–8

    Article  Google Scholar 

  41. Ye Q, Doermann D (2015) Text detection and recognition in imagery: a survey. IEEE Trans Pattern Anal Mach Intell 37(7):1480–1500

    Article  Google Scholar 

  42. Zadeh LA (1978) Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst 1(1):3–28

    Article  MathSciNet  MATH  Google Scholar 

  43. Zhou X, Yao C, Wen H, Wang Y, Zhou S, He W, Liang J (2017) EAST: an efficient and accurate scene text detector. in Proc. CVPR 2642–2651

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Satadal Saha.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Paul, S., Saha, S., Basu, S. et al. Text localization in camera captured images using fuzzy distance transform based adaptive stroke filter. Multimed Tools Appl 78, 18017–18036 (2019). https://doi.org/10.1007/s11042-019-7178-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-019-7178-3

Keywords

Navigation