Abstract
The problem of text detection and localization in scene images has always been challenging for the researchers over the years due to diversities present in these images. This diversity includes variation in fonts, size, color, different backgrounds, etc. The textual content in such images can be helpful for humans in many different domains like visually impaired people, scene understanding, intelligent navigation, etc. The natural scene contains some non-text objects along with relevant text objects, and it is necessary to classify them appropriately & accurately to increase the performance of the detection and localization method. The classification of text regions in scene images depends on the selection of optimal features and optimal classifier. This work contributes to finding both the optimal feature set and the optimal classifier with the help of weka tool. In this paper, first, we detect the possible text regions with the help of the improved MSER algorithm; then, we extract 11 features on these potential text regions. From these 11 features, we choose an optimal feature set for discrimination between text and non-text components with the help of the CfsSubsetEval and BFS parameter of the Weka Tool. We trained several classifiers using these optimal features with the help of Weka tool on the ICDAR 2013 training set. The performance of these classifiers is compared empirically based on the classification accuracy obtained using Weka tool. Based on this empirical estimation, Naive Bayes Classifier with the highest accuracy of 92.5% is proposed as an optimal choice for classification purpose.
Similar content being viewed by others
Notes
The term ’region(s) and component(s) are used interchangeably in this paper
References
Amancio DR, Comin CH, Casanova D, Travieso G, Bruno OM, Rodrigues FA, da Fontoura Costa L (2014) A systematic comparison of supervised classifiers. PloS one 9(4):e94137
Ansari GJ, Shah JH, Yasmin M, Sharif M, Fernandes SL (2018) A novel machine learning approach for scene text extraction. Future Generation Computer Systems
Baran R, Partila P, Wilk R (2018) Automated text detection and character recognition in natural scenes based on local image features and contour processing techniques. In: International conference on intelligent human systems integration. Springer, pp 42–48
Bušta M, Neumann L, Matas J (2017) Deep textspotter: an end-to-end trainable scene text localization and recognition framework. In: IEEE International conference on computer vision (ICCV). Venice, pp 22–29
Chang CC, Lin CJ (2011) Libsvm: a library for support vector machines. ACM Trans Intell Syst Technol (TIST) 2(3):27
Chen X, Yuille AL (2004) Detecting and reading text in natural scenes. In: Proceedings of the 2004 IEEE computer society conference on computer vision and pattern recognition, 2004. CVPR 2004, vol 2. IEEE, pp II–II
Chen H, Tsai SS, Schroth G, Chen DM, Grzeszczuk R, Girod B (2011) Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: 2011 18th IEEE International conference on image processing (ICIP). IEEE, pp 2609–2612
da Silva BLS, Ciarelli PM (2016) Edge detection and confidence map applied to identify textual elements in images
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE Computer society conference on computer vision and pattern recognition, 2005. CVPR 2005, vol 1. IEEE, pp 886–893
Dash JK, Mukhopadhyay S (2018) Similarity learning for texture image retrieval using multiple classifier system. Multimed Tools Appl 77(1):459–483
Dash JK, Mukhopadhyay S, Gupta RD (2017) Multiple classifier system using classification confidence for texture classification. Multimed Tools Appl 76(2):2535–2556
Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: 2010 IEEE Conference on computer vision and pattern recognition (CVPR). IEEE, pp 2963–2970
Extremal MS, Matas J, Chum O, Urban M, Pajdla T (2002) Robust wide baseline stereo from in. In: British machine vision conference. Citeseer
Fabrizio J, Robert-Seidowsky M, Dubuisson S, Calarasanu S, Boissel R (2016) Textcatcher: a method to detect curved and challenging text in natural scenes. Int J Doc Anal Recogn (IJDAR) 19(2):99–117
Feng Y, Song Y, Zhang Y (2016) Scene text detection based on multi-scale swt and edge filtering. In: 2016 23rd International conference on pattern recognition (ICPR). IEEE, pp 645–650
Frank E (2014) Fully supervised training of Gaussian radial basis function networks in weka
Freund Y, Schapire RE (1999) Large margin classification using the perceptron algorithm. Mach Learn 37(3):277–296
Friedman N, Geiger D, Goldszmidt M (1997) Bayesian network classifiers. Mach Learn 29(2–3):131–163
Genkin A, Lewis DD, Madigan D (2007) Large-scale bayesian logistic regression for text categorization. Technometrics 49(3):291–304
Ghanei S, Faez K (2017) A robust approach for scene text localization using rule-based confidence map and grouping. Int J Pattern Recogn Artif Intell 31(03):1753002
Ghoshal R, Roy A, Dhara BC, Parui SK (2015) Text extraction from scene images through local binary pattern and business features based color image segmentation. In: Information systems design and intelligent applications. Springer, pp 475–482
Gllavata J, Ewerth R, Freisleben B (2004) Text detection in images based on unsupervised classification of high-frequency wavelet coefficients. In: Proceedings of the 17th International conference on pattern recognition, 2004. ICPR 2004, vol 1. IEEE, pp 425–428
Gonzalez A, Bergasa LM, Yebes JJ, Bronte S (2012) Text location in complex images. In: 2012 21st International conference on pattern recognition (ICPR). IEEE, pp 617–620
Guan L, Chu J (2017) Natural scene text detection based on swt, mser and candidate classification, IEEE
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The weka data mining software: an update. ACM SIGKDD Explor Newslett 11(1):10–18
Hanif SM, Prevost L (2009) Text detection and localization in complex scene images using constrained adaboost algorithm. In: 10th International conference on document analysis and recognition, 2009. ICDAR’09. IEEE, pp 1–5
He T, Huang W, Qiao Y, Yao J (2016) Text-attentional convolutional neural network for scene text detection. IEEE Trans Image Process 25(6):2529–2541
Heckerman D, Geiger D, Chickering DM (1995) Learning bayesian networks: the combination of knowledge and statistical data. Mach Learn 20(3):197–243
Iqbal K, Yin XC, Yin X, Ali H, Hao HW (2013) Classifier comparison for mser-based text classification in scene images. In: The 2013 international joint conference on neural networks (IJCNN). IEEE, pp 1–6
Jaderberg M, Simonyan K, Vedaldi A, Zisserman A (2014) Deep structured output learning for unconstrained text recognition. arXiv:14125903
Jian H, Xiaopei L, Qian Z (2016) A swt verified method of natural scene text location. In: 2016 International symposium on computer, consumer and control (IS3C). IEEE, pp 980–984
Jiang R, Qi F, Xu L, Wu G (2006) Using connected-components’ features to detect and segment text. J Image Graph 11:1653–1656
Jiang M, Cheng J, Chen M, Ku X (2018) An improved text localization method for natural scene images. In: Journal of physics: conference series, vol 960. IOP Publishing, p 012027
Joan SF, Valli S (2017) An enhanced text detection technique for the visually impaired to read text. Inf Syst Front 19(5):1039–1056
Jung K, Kim KI, Jain AK (2004) Text information extraction in images and video: a survey. Pattern Recog 37(5):977–997
Karatzas D, Shafait F, Uchida S, Iwamura M, i Bigorda LG, Mestre SR, Mas J, Mota DF, Almazan JA, de las Heras LP (2013) Icdar 2013 robust reading competition. In: 2013 12th International conference on document analysis and recognition (ICDAR). IEEE, pp 1484–1493
Kim HK, Kim M (2016) Model-induced term-weighting schemes for text classification. Appl Intell 45(1):30–43
Kim KI, Jung K, Kim JH (2003) Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans Pattern Anal Mach Intell 25(12):1631–1639
Klein DA, Frintrop S (2011) Center-surround divergence of feature statistics for salient object detection. In: 2011 IEEE International conference on computer vision (ICCV). IEEE, pp 2214–2219
Koo HI, Kim DH (2013) Scene text detection via connected component clustering and nontext filtering. IEEE Trans Image Process 22(6):2296–2305
Lee S, Cho MS, Jung K, Kim JH (2010) Scene text extraction with edge constraint and text collinearity. In: 2010 20th International conference on pattern recognition (ICPR). IEEE, pp 3983–3986
Lee JJ, Lee PH, Lee SW, Yuille A, Koch C (2011) Adaboost for text detection in natural scene. In: 2011 International conference on document analysis and recognition (ICDAR). IEEE, pp 429–434
Li M, Wang C (2008) An adaptive text detection approach in images and video frames. In: IEEE International joint conference on neural networks 2008 IJCNN 2008 IEEE World congress on computational intelligence. IEEE, pp 72–77
Li Y, Lu H (2012) Scene text detection via stroke width. In: 2012 21st International conference on pattern recognition (ICPR). IEEE, pp 681–684
Li Y, Jia W, Shen C, van den Hengel A (2014) Characterness: an indicator of text in the wild. IEEE Trans Image Process 23(4):1666–1677
Liu X, Samarabandu J (2006) Multiscale edge-based text extraction from complex images. In: 2006 IEEE International conference on multimedia and expo. IEEE, pp 1721–1724
Liu J, Su H, Yi Y, Hu W (2016) Robust text detection via multi-degree of sharpening and blurring. Signal Process 124:259–265
Lucas SM (2005) Icdar 2005 text locating competition results. In: 2005 Proceedings. Eighth international conference on document analysis and recognition. IEEE, pp 80–84
Lucas SM, Panaretos A, Sosa L, Tang A, Wong S, Young R (2003) Icdar 2003 robust reading competitions. In: ICDAR, vol 2003. Citeseer, p 682
Ma L, Wang C, Xiao B (2010) Text detection in natural images based on multi-scale edge detetion and classification. In: 2010 3rd International congress on image and signal processing (CISP), vol 4. IEEE, pp 1961–1965
Majtey A, Lamberti P, Prato D (2005) Jensen-shannon divergence as a measure of distinguishability between mixed quantum states. Phys Rev A 72(5):052310
Maruyama M, Yamaguchi T (2009) Extraction of characters on signboards in natural scene images by stump classifiers. In: 10th International conference on document analysis and recognition, 2009. ICDAR’09. IEEE, pp 1365–1369
McCallum A, Nigam K, et al. (1998) A comparison of event models for naive bayes text classification. In: AAAI-98 workshop on learning for text categorization, vol 752. Citeseer, pp 41–48
Misra C, Swain P, Mantri J (2012) Text extraction and recognition from image using neural network. Int J Comput Appl 40(2):13–19
Mukhopadhyay A, Kumar S, Chowdhury SR, Chakraborty N, Mollah AF, Basu S, Sarkar R (2019) Multi-lingual scene text detection using one-class classifier. Int J Comput Vis Image Process (IJCVIP) 9(2):48–65
Neumann L, Matas J (2010) A method for text localization and recognition in real-world images. In: Asian conference on computer vision. Springer, pp 770–783
Neumann L, Matas J (2012) Real-time scene text localization and recognition. In: 2012 IEEE Conference on computer vision and pattern recognition (CVPR). IEEE, pp 3538–3545
Nguyen K, Thanh ND (2016) Scene text detection based on structural features. In: 2016 International conference on computer, control, informatics and its applications (IC3INA). IEEE, pp 48–53
Ou Ww, Zhu J, Liu C (2004) Text location in natural scene. J Chin Inf Process 5:006
Pan YF, Hou X, Liu CL (2011) A hybrid approach to detect and localize texts in natural scene images. IEEE Trans Image Process 20(3):800–813
Pan YF, Liu CL, Hou X (2010) Fast scene text localization by learning-based filtering and verification. In: 2010 17th IEEE International conference on image processing (ICIP). IEEE, pp 2269–2272
Panda M, Abraham A, Patra MR (2010) Discriminative multinomial naive bayes for network intrusion detection. In: 2010 Sixth International conference on information assurance and security (IAS). IEEE, pp 5–10
Pietikainen M, Okun O (2001) Edge-based method for text detection from complex document images. In: 2001 Proceedings. Sixth international conference on document analysis and recognition. IEEE, pp 286–291
Platt JC (1999) 12 fast training of support vector machines using sequential minimal optimization. Adv Kernel Methods, 185–208
Rish I, et al. (2001) An empirical study of the naive bayes classifier. In: IJCAI 2001 workshop on empirical methods in artificial intelligence, vol 3, pp 41–46
Salvithal NN, Kulkarni R (2013) Evaluating performance of data mining classification algorithm in weka
Shahab A, Shafait F, Dengel A (2011) Icdar 2011 robust reading competition challenge 2: reading text in scene images. In: 2011 International conference on document analysis and recognition (ICDAR). IEEE, pp 1491–1496
Shi H, Liu Y (2011) Naïve bayes vs. support vector machine: resilience to missing data. In: International conference on artificial intelligence and computational intelligence. Springer, pp 680–687
Shi C, Wang C, Xiao B, Zhang Y, Gao S (2013) Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recogn Lett 34(2):107–116
Singh A, Halgamuge MN, Lakshmiganthan R (2017) Impact of different data types on classifier performance of random forest, naive bayes, and k-nearest neighbors algorithms. Int J Adv Comput Sci Appl 8(12):1–10
Soni R, Kumar B, Chand S (2018) Text detection and localization in natural scene images based on text awareness score. Appl Intell, 1–30
Sun L, Huo Q, Jia W, Chen K (2015) A robust approach for text detection from natural scene images. Pattern Recogn 48(9):2906–2920
Šarić M (2017) Scene text segmentation using low variation extremal regions and sorting based character grouping. Neurocomputing 266:56–65
Tang B, Kay S, He H (2016) Toward optimal feature selection in naive bayes for text categorization. arXiv:160202850
Unar S, Hussain A, Shaikh M, Memon KH, Ansari MA, Memon Z (2018) A study on text detection and localization techniques for natural scene images. IJCSNS 18(1):100
Wang K, Belongie S (2010) Word spotting in the wild. In: European conference on computer vision. Springer, pp 591–604
Wang Q, Lu Y, Sun S (2015) Text detection in nature scene images using two-stage nontext filtering. In: 2015 13th International conference on document analysis and recognition (ICDAR). IEEE, pp 106–110
Wei Y, Zhang Z, Shen W, Zeng D, Fang M, Zhou S (2017) Text detection in scene images based on exhaustive segmentation. Signal Process Image Commun 50:1–8
Wu H, Zou B, Yq Zhao, Guo J (2017) Scene text detection using adaptive color reduction, adjacent character model and hybrid verification strategy. Vis Comput 33(1):113–126
Xiang ZL, Yu XR, Kang DK (2016) Experimental analysis of naïve bayes classifier based on an attribute weighting framework with smooth kernel density estimations. Appl Intell 44(3):611–620
Xu S, Krauthammer M (2010) A new pivoting and iterative text detection algorithm for biomedical images. J Biomed Inform 43(6):924–931
Xu S, Li Y, Wang Z (2017) Bayesian multinomial naïve bayes classifier to text classification. In: Advanced multimedia and ubiquitous engineering. Springer, pp 347–352
Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In: 2012 IEEE Conference on computer vision and pattern recognition (CVPR). IEEE, pp 1083–1090
Ye Q, Doermann D (2015) Text detection and recognition in imagery: a survey. IEEE Trans Pattern Anal Mach Intell 37(7):1480–1500
Ye Q, Jiao J, Huang J, Yu H (2007) Text detection and restoration in natural scene images. J Vis Commun Image Represent 18(6):504–513
Yi C, Tian Y (2011) Text string detection from natural scenes by structure-based partition and grouping. IEEE Trans Image Process 20(9):2594–2605
Yu C, Song Y, Zhang Y (2016) Scene text localization using edge analysis and feature pool. Neurocomputing 175:652–661
Zhang J, Kasturi R (2010) Text detection using edge gradient and graph spectrum. In: 2010 20th International conference on pattern recognition (ICPR). IEEE, pp 3979–3982
Zhang H, Zhao K, Song YZ, Guo J (2013) Text extraction from natural scene image: a survey. Neurocomputing 122:310–323
Zhang G, Kai H, Zhang B, Fu H, ZHAO J (2017) A natural scene text extraction method based on the maximum stable extremal region and stroke width transform. J Xi’an Jiaotong Univ 1:021
Zhou G, Liu Y, Meng Q, Zhang Y (2011) Detecting multilingual text in natural scene. In: 2011 1st International symposium on access spaces (ISAS). IEEE, pp 116–120
Zhu A, Wang G, Dong Y (2015) Detecting natural scenes text via auto image partition, two-stage grouping and two-layer classification. Pattern Recogn Lett 67:153–162
Zhu Y, Huang X, Huang Q, Tian Q (2016) Large-scale video copy retrieval with temporal-concentration sift. Neurocomputing 187:83–91
Zhu Y, Yao C, Bai X (2016) Scene text detection and recognition: recent advances and future trends. Front Comput Sci 10(1):19–36
Acknowledgements
This work is supported by UPE-II, Jawaharlal University, New Delhi, India.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Soni, R., Kumar, B. & Chand, S. Optimal feature and classifier selection for text region classification in natural scene images using Weka tool. Multimed Tools Appl 78, 31757–31791 (2019). https://doi.org/10.1007/s11042-019-07998-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-019-07998-z