Skip to main content

Advertisement

Log in

Optimal feature and classifier selection for text region classification in natural scene images using Weka tool

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

The problem of text detection and localization in scene images has always been challenging for the researchers over the years due to diversities present in these images. This diversity includes variation in fonts, size, color, different backgrounds, etc. The textual content in such images can be helpful for humans in many different domains like visually impaired people, scene understanding, intelligent navigation, etc. The natural scene contains some non-text objects along with relevant text objects, and it is necessary to classify them appropriately & accurately to increase the performance of the detection and localization method. The classification of text regions in scene images depends on the selection of optimal features and optimal classifier. This work contributes to finding both the optimal feature set and the optimal classifier with the help of weka tool. In this paper, first, we detect the possible text regions with the help of the improved MSER algorithm; then, we extract 11 features on these potential text regions. From these 11 features, we choose an optimal feature set for discrimination between text and non-text components with the help of the CfsSubsetEval and BFS parameter of the Weka Tool. We trained several classifiers using these optimal features with the help of Weka tool on the ICDAR 2013 training set. The performance of these classifiers is compared empirically based on the classification accuracy obtained using Weka tool. Based on this empirical estimation, Naive Bayes Classifier with the highest accuracy of 92.5% is proposed as an optimal choice for classification purpose.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

Notes

  1. The term ’region(s) and component(s) are used interchangeably in this paper

References

  1. Amancio DR, Comin CH, Casanova D, Travieso G, Bruno OM, Rodrigues FA, da Fontoura Costa L (2014) A systematic comparison of supervised classifiers. PloS one 9(4):e94137

    Article  Google Scholar 

  2. Ansari GJ, Shah JH, Yasmin M, Sharif M, Fernandes SL (2018) A novel machine learning approach for scene text extraction. Future Generation Computer Systems

  3. Baran R, Partila P, Wilk R (2018) Automated text detection and character recognition in natural scenes based on local image features and contour processing techniques. In: International conference on intelligent human systems integration. Springer, pp 42–48

  4. Bušta M, Neumann L, Matas J (2017) Deep textspotter: an end-to-end trainable scene text localization and recognition framework. In: IEEE International conference on computer vision (ICCV). Venice, pp 22–29

  5. Chang CC, Lin CJ (2011) Libsvm: a library for support vector machines. ACM Trans Intell Syst Technol (TIST) 2(3):27

    Google Scholar 

  6. Chen X, Yuille AL (2004) Detecting and reading text in natural scenes. In: Proceedings of the 2004 IEEE computer society conference on computer vision and pattern recognition, 2004. CVPR 2004, vol 2. IEEE, pp II–II

  7. Chen H, Tsai SS, Schroth G, Chen DM, Grzeszczuk R, Girod B (2011) Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: 2011 18th IEEE International conference on image processing (ICIP). IEEE, pp 2609–2612

  8. da Silva BLS, Ciarelli PM (2016) Edge detection and confidence map applied to identify textual elements in images

  9. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE Computer society conference on computer vision and pattern recognition, 2005. CVPR 2005, vol 1. IEEE, pp 886–893

  10. Dash JK, Mukhopadhyay S (2018) Similarity learning for texture image retrieval using multiple classifier system. Multimed Tools Appl 77(1):459–483

    Article  Google Scholar 

  11. Dash JK, Mukhopadhyay S, Gupta RD (2017) Multiple classifier system using classification confidence for texture classification. Multimed Tools Appl 76(2):2535–2556

    Article  Google Scholar 

  12. Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: 2010 IEEE Conference on computer vision and pattern recognition (CVPR). IEEE, pp 2963–2970

  13. Extremal MS, Matas J, Chum O, Urban M, Pajdla T (2002) Robust wide baseline stereo from in. In: British machine vision conference. Citeseer

  14. Fabrizio J, Robert-Seidowsky M, Dubuisson S, Calarasanu S, Boissel R (2016) Textcatcher: a method to detect curved and challenging text in natural scenes. Int J Doc Anal Recogn (IJDAR) 19(2):99–117

    Article  Google Scholar 

  15. Feng Y, Song Y, Zhang Y (2016) Scene text detection based on multi-scale swt and edge filtering. In: 2016 23rd International conference on pattern recognition (ICPR). IEEE, pp 645–650

  16. Frank E (2014) Fully supervised training of Gaussian radial basis function networks in weka

  17. Freund Y, Schapire RE (1999) Large margin classification using the perceptron algorithm. Mach Learn 37(3):277–296

    Article  MATH  Google Scholar 

  18. Friedman N, Geiger D, Goldszmidt M (1997) Bayesian network classifiers. Mach Learn 29(2–3):131–163

    Article  MATH  Google Scholar 

  19. Genkin A, Lewis DD, Madigan D (2007) Large-scale bayesian logistic regression for text categorization. Technometrics 49(3):291–304

    Article  MathSciNet  Google Scholar 

  20. Ghanei S, Faez K (2017) A robust approach for scene text localization using rule-based confidence map and grouping. Int J Pattern Recogn Artif Intell 31(03):1753002

    Article  Google Scholar 

  21. Ghoshal R, Roy A, Dhara BC, Parui SK (2015) Text extraction from scene images through local binary pattern and business features based color image segmentation. In: Information systems design and intelligent applications. Springer, pp 475–482

  22. Gllavata J, Ewerth R, Freisleben B (2004) Text detection in images based on unsupervised classification of high-frequency wavelet coefficients. In: Proceedings of the 17th International conference on pattern recognition, 2004. ICPR 2004, vol 1. IEEE, pp 425–428

  23. Gonzalez A, Bergasa LM, Yebes JJ, Bronte S (2012) Text location in complex images. In: 2012 21st International conference on pattern recognition (ICPR). IEEE, pp 617–620

  24. Guan L, Chu J (2017) Natural scene text detection based on swt, mser and candidate classification, IEEE

  25. Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The weka data mining software: an update. ACM SIGKDD Explor Newslett 11(1):10–18

    Article  Google Scholar 

  26. Hanif SM, Prevost L (2009) Text detection and localization in complex scene images using constrained adaboost algorithm. In: 10th International conference on document analysis and recognition, 2009. ICDAR’09. IEEE, pp 1–5

  27. He T, Huang W, Qiao Y, Yao J (2016) Text-attentional convolutional neural network for scene text detection. IEEE Trans Image Process 25(6):2529–2541

    Article  MathSciNet  MATH  Google Scholar 

  28. Heckerman D, Geiger D, Chickering DM (1995) Learning bayesian networks: the combination of knowledge and statistical data. Mach Learn 20(3):197–243

    MATH  Google Scholar 

  29. Iqbal K, Yin XC, Yin X, Ali H, Hao HW (2013) Classifier comparison for mser-based text classification in scene images. In: The 2013 international joint conference on neural networks (IJCNN). IEEE, pp 1–6

  30. Jaderberg M, Simonyan K, Vedaldi A, Zisserman A (2014) Deep structured output learning for unconstrained text recognition. arXiv:14125903

  31. Jian H, Xiaopei L, Qian Z (2016) A swt verified method of natural scene text location. In: 2016 International symposium on computer, consumer and control (IS3C). IEEE, pp 980–984

  32. Jiang R, Qi F, Xu L, Wu G (2006) Using connected-components’ features to detect and segment text. J Image Graph 11:1653–1656

    Google Scholar 

  33. Jiang M, Cheng J, Chen M, Ku X (2018) An improved text localization method for natural scene images. In: Journal of physics: conference series, vol 960. IOP Publishing, p 012027

  34. Joan SF, Valli S (2017) An enhanced text detection technique for the visually impaired to read text. Inf Syst Front 19(5):1039–1056

    Article  Google Scholar 

  35. Jung K, Kim KI, Jain AK (2004) Text information extraction in images and video: a survey. Pattern Recog 37(5):977–997

    Article  Google Scholar 

  36. Karatzas D, Shafait F, Uchida S, Iwamura M, i Bigorda LG, Mestre SR, Mas J, Mota DF, Almazan JA, de las Heras LP (2013) Icdar 2013 robust reading competition. In: 2013 12th International conference on document analysis and recognition (ICDAR). IEEE, pp 1484–1493

  37. Kim HK, Kim M (2016) Model-induced term-weighting schemes for text classification. Appl Intell 45(1):30–43

    Article  Google Scholar 

  38. Kim KI, Jung K, Kim JH (2003) Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans Pattern Anal Mach Intell 25(12):1631–1639

    Article  Google Scholar 

  39. Klein DA, Frintrop S (2011) Center-surround divergence of feature statistics for salient object detection. In: 2011 IEEE International conference on computer vision (ICCV). IEEE, pp 2214–2219

  40. Koo HI, Kim DH (2013) Scene text detection via connected component clustering and nontext filtering. IEEE Trans Image Process 22(6):2296–2305

    Article  MathSciNet  MATH  Google Scholar 

  41. Lee S, Cho MS, Jung K, Kim JH (2010) Scene text extraction with edge constraint and text collinearity. In: 2010 20th International conference on pattern recognition (ICPR). IEEE, pp 3983–3986

  42. Lee JJ, Lee PH, Lee SW, Yuille A, Koch C (2011) Adaboost for text detection in natural scene. In: 2011 International conference on document analysis and recognition (ICDAR). IEEE, pp 429–434

  43. Li M, Wang C (2008) An adaptive text detection approach in images and video frames. In: IEEE International joint conference on neural networks 2008 IJCNN 2008 IEEE World congress on computational intelligence. IEEE, pp 72–77

  44. Li Y, Lu H (2012) Scene text detection via stroke width. In: 2012 21st International conference on pattern recognition (ICPR). IEEE, pp 681–684

  45. Li Y, Jia W, Shen C, van den Hengel A (2014) Characterness: an indicator of text in the wild. IEEE Trans Image Process 23(4):1666–1677

    Article  MathSciNet  MATH  Google Scholar 

  46. Liu X, Samarabandu J (2006) Multiscale edge-based text extraction from complex images. In: 2006 IEEE International conference on multimedia and expo. IEEE, pp 1721–1724

  47. Liu J, Su H, Yi Y, Hu W (2016) Robust text detection via multi-degree of sharpening and blurring. Signal Process 124:259–265

    Article  Google Scholar 

  48. Lucas SM (2005) Icdar 2005 text locating competition results. In: 2005 Proceedings. Eighth international conference on document analysis and recognition. IEEE, pp 80–84

  49. Lucas SM, Panaretos A, Sosa L, Tang A, Wong S, Young R (2003) Icdar 2003 robust reading competitions. In: ICDAR, vol 2003. Citeseer, p 682

  50. Ma L, Wang C, Xiao B (2010) Text detection in natural images based on multi-scale edge detetion and classification. In: 2010 3rd International congress on image and signal processing (CISP), vol 4. IEEE, pp 1961–1965

  51. Majtey A, Lamberti P, Prato D (2005) Jensen-shannon divergence as a measure of distinguishability between mixed quantum states. Phys Rev A 72(5):052310

    Article  Google Scholar 

  52. Maruyama M, Yamaguchi T (2009) Extraction of characters on signboards in natural scene images by stump classifiers. In: 10th International conference on document analysis and recognition, 2009. ICDAR’09. IEEE, pp 1365–1369

  53. McCallum A, Nigam K, et al. (1998) A comparison of event models for naive bayes text classification. In: AAAI-98 workshop on learning for text categorization, vol 752. Citeseer, pp 41–48

  54. Misra C, Swain P, Mantri J (2012) Text extraction and recognition from image using neural network. Int J Comput Appl 40(2):13–19

    Google Scholar 

  55. Mukhopadhyay A, Kumar S, Chowdhury SR, Chakraborty N, Mollah AF, Basu S, Sarkar R (2019) Multi-lingual scene text detection using one-class classifier. Int J Comput Vis Image Process (IJCVIP) 9(2):48–65

    Article  Google Scholar 

  56. Neumann L, Matas J (2010) A method for text localization and recognition in real-world images. In: Asian conference on computer vision. Springer, pp 770–783

  57. Neumann L, Matas J (2012) Real-time scene text localization and recognition. In: 2012 IEEE Conference on computer vision and pattern recognition (CVPR). IEEE, pp 3538–3545

  58. Nguyen K, Thanh ND (2016) Scene text detection based on structural features. In: 2016 International conference on computer, control, informatics and its applications (IC3INA). IEEE, pp 48–53

  59. Ou Ww, Zhu J, Liu C (2004) Text location in natural scene. J Chin Inf Process 5:006

    Google Scholar 

  60. Pan YF, Hou X, Liu CL (2011) A hybrid approach to detect and localize texts in natural scene images. IEEE Trans Image Process 20(3):800–813

    Article  MathSciNet  MATH  Google Scholar 

  61. Pan YF, Liu CL, Hou X (2010) Fast scene text localization by learning-based filtering and verification. In: 2010 17th IEEE International conference on image processing (ICIP). IEEE, pp 2269–2272

  62. Panda M, Abraham A, Patra MR (2010) Discriminative multinomial naive bayes for network intrusion detection. In: 2010 Sixth International conference on information assurance and security (IAS). IEEE, pp 5–10

  63. Pietikainen M, Okun O (2001) Edge-based method for text detection from complex document images. In: 2001 Proceedings. Sixth international conference on document analysis and recognition. IEEE, pp 286–291

  64. Platt JC (1999) 12 fast training of support vector machines using sequential minimal optimization. Adv Kernel Methods, 185–208

  65. Rish I, et al. (2001) An empirical study of the naive bayes classifier. In: IJCAI 2001 workshop on empirical methods in artificial intelligence, vol 3, pp 41–46

  66. Salvithal NN, Kulkarni R (2013) Evaluating performance of data mining classification algorithm in weka

  67. Shahab A, Shafait F, Dengel A (2011) Icdar 2011 robust reading competition challenge 2: reading text in scene images. In: 2011 International conference on document analysis and recognition (ICDAR). IEEE, pp 1491–1496

  68. Shi H, Liu Y (2011) Naïve bayes vs. support vector machine: resilience to missing data. In: International conference on artificial intelligence and computational intelligence. Springer, pp 680–687

  69. Shi C, Wang C, Xiao B, Zhang Y, Gao S (2013) Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recogn Lett 34(2):107–116

    Article  Google Scholar 

  70. Singh A, Halgamuge MN, Lakshmiganthan R (2017) Impact of different data types on classifier performance of random forest, naive bayes, and k-nearest neighbors algorithms. Int J Adv Comput Sci Appl 8(12):1–10

    Google Scholar 

  71. Soni R, Kumar B, Chand S (2018) Text detection and localization in natural scene images based on text awareness score. Appl Intell, 1–30

  72. Sun L, Huo Q, Jia W, Chen K (2015) A robust approach for text detection from natural scene images. Pattern Recogn 48(9):2906–2920

    Article  Google Scholar 

  73. Šarić M (2017) Scene text segmentation using low variation extremal regions and sorting based character grouping. Neurocomputing 266:56–65

    Article  Google Scholar 

  74. Tang B, Kay S, He H (2016) Toward optimal feature selection in naive bayes for text categorization. arXiv:160202850

  75. Unar S, Hussain A, Shaikh M, Memon KH, Ansari MA, Memon Z (2018) A study on text detection and localization techniques for natural scene images. IJCSNS 18(1):100

    Google Scholar 

  76. Wang K, Belongie S (2010) Word spotting in the wild. In: European conference on computer vision. Springer, pp 591–604

  77. Wang Q, Lu Y, Sun S (2015) Text detection in nature scene images using two-stage nontext filtering. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 106–110

  78. Wei Y, Zhang Z, Shen W, Zeng D, Fang M, Zhou S (2017) Text detection in scene images based on exhaustive segmentation. Signal Process Image Commun 50:1–8

    Article  Google Scholar 

  79. Wu H, Zou B, Yq Zhao, Guo J (2017) Scene text detection using adaptive color reduction, adjacent character model and hybrid verification strategy. Vis Comput 33(1):113–126

    Article  Google Scholar 

  80. Xiang ZL, Yu XR, Kang DK (2016) Experimental analysis of naïve bayes classifier based on an attribute weighting framework with smooth kernel density estimations. Appl Intell 44(3):611–620

    Article  Google Scholar 

  81. Xu S, Krauthammer M (2010) A new pivoting and iterative text detection algorithm for biomedical images. J Biomed Inform 43(6):924–931

    Article  Google Scholar 

  82. Xu S, Li Y, Wang Z (2017) Bayesian multinomial naïve bayes classifier to text classification. In: Advanced multimedia and ubiquitous engineering. Springer, pp 347–352

  83. Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In: 2012 IEEE Conference on computer vision and pattern recognition (CVPR). IEEE, pp 1083–1090

  84. Ye Q, Doermann D (2015) Text detection and recognition in imagery: a survey. IEEE Trans Pattern Anal Mach Intell 37(7):1480–1500

    Article  Google Scholar 

  85. Ye Q, Jiao J, Huang J, Yu H (2007) Text detection and restoration in natural scene images. J Vis Commun Image Represent 18(6):504–513

    Article  Google Scholar 

  86. Yi C, Tian Y (2011) Text string detection from natural scenes by structure-based partition and grouping. IEEE Trans Image Process 20(9):2594–2605

    Article  MathSciNet  MATH  Google Scholar 

  87. Yu C, Song Y, Zhang Y (2016) Scene text localization using edge analysis and feature pool. Neurocomputing 175:652–661

    Article  Google Scholar 

  88. Zhang J, Kasturi R (2010) Text detection using edge gradient and graph spectrum. In: 2010 20th International conference on pattern recognition (ICPR). IEEE, pp 3979–3982

  89. Zhang H, Zhao K, Song YZ, Guo J (2013) Text extraction from natural scene image: a survey. Neurocomputing 122:310–323

    Article  Google Scholar 

  90. Zhang G, Kai H, Zhang B, Fu H, ZHAO J (2017) A natural scene text extraction method based on the maximum stable extremal region and stroke width transform. J Xi’an Jiaotong Univ 1:021

    Google Scholar 

  91. Zhou G, Liu Y, Meng Q, Zhang Y (2011) Detecting multilingual text in natural scene. In: 2011 1st International symposium on access spaces (ISAS). IEEE, pp 116–120

  92. Zhu A, Wang G, Dong Y (2015) Detecting natural scenes text via auto image partition, two-stage grouping and two-layer classification. Pattern Recogn Lett 67:153–162

    Article  Google Scholar 

  93. Zhu Y, Huang X, Huang Q, Tian Q (2016) Large-scale video copy retrieval with temporal-concentration sift. Neurocomputing 187:83–91

    Article  Google Scholar 

  94. Zhu Y, Yao C, Bai X (2016) Scene text detection and recognition: recent advances and future trends. Front Comput Sci 10(1):19–36

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by UPE-II, Jawaharlal University, New Delhi, India.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rituraj Soni.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Soni, R., Kumar, B. & Chand, S. Optimal feature and classifier selection for text region classification in natural scene images using Weka tool. Multimed Tools Appl 78, 31757–31791 (2019). https://doi.org/10.1007/s11042-019-07998-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-019-07998-z

Keywords

Navigation