Abstract
Feature selection plays a key role to reduce the high-dimensionality of feature space in machine learning applications by discarding irrelevant and redundant features with the aim of obtaining a subset of features that accurately describe a given problem with a minimum or no degradation of performance. In this paper, a Memetic Algorithm (MA) based Wrapper-filter feature selection framework is proposed for the recognition of handwritten Bangla city names. For evaluating the MA framework, a recently published feature extraction technique, reported in [1], is used for the said pattern recognition problem. Experimentation is conducted on an in-house dataset of 6000 words written in Bangla script. Here, 40 most popular city names of West Bengal, a state in India, have been considered to prepare the dataset. Proposed technique not only reduces the feature dimension, but also enhances the performance of the word recognition technique significantly.
This is a preview of subscription content, log in via an institution.
References
Barua, S., Malakar, S., Bhowmik, S., Sarkar, R., Nasipuri, M.: Bangla handwritten city name recognition using gradient based feature. In: 5th International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA) (2016)
Narendra, P.M., Fukunaga, K.: A branch and bound algorithm for feature subset selection. IEEE Trans. Comput. 26(9), 917–922 (1977)
Chen, X.W.: An improved branch and bound algorithm for feature selection. Pattern Recogn. Lett. 24(12), 1925–1933 (2003)
Raymer, M.L., Punch, W.F., Goodman, E.D., Kuhn, L.A., Jain, A.K.: Dimensionality reduction using genetic algorithms. IEEE Trans. Evol. Comput. 4(2), 164–171 (2000)
Oh, I.S., Lee, J.S., Moon, B.R.: Hybrid genetic algorithms for feature selection. IEEE Trans. Pattern Anal. Mach. Intell. 26(11), 1424–1437 (2004)
Pudil, P., Novovičová, J., Kittler, J.: Floating search methods in feature selection. Pattern Recogn. Lett. 15(11), 1119–1125 (1994)
Chandrashekar, G., Sahin, F.: A survey on feature selection methods. Comput. Electr. Eng. 40(1), 16–28 (2014)
Valdez, F., Melin, P., Castillo, O.: A survey on nature-inspired optimization algorithms with fuzzy logic for dynamic parameter adaptation. Expert Syst. Appl. 41(14), 6459–6466 (2014)
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3((Mar)), 1157–1182 (2003)
Kotsiantis, S.: Feature selection for machine learning classification problems: a recent overview. Artif. Intell. Rev., 1–20 (2011)
Law, M.H., Figueiredo, M.A., Jain, A.K.: Simultaneous feature selection and clustering using mixture models. IEEE Trans. Pattern Anal. Mach. Intell. 26(9), 1154–1166 (2004)
Sánchez-Maroño, N., Alonso-Betanzos, A., Tombilla-Sanromán, M.: Filter methods for feature selection – a comparative study. In: Yin, H., Tino, P., Corchado, E., Byrne, W., Yao, X. (eds.) IDEAL 2007. LNCS, vol. 4881, pp. 178–187. Springer, Heidelberg (2007). doi:10.1007/978-3-540-77226-2_19
Xue, B., Zhang, M., Browne, W.N.: Particle swarm optimization for feature selection in classification: a multi-objective approach. IEEE Trans. Cybern. 43(6), 1656–1671 (2013)
García-Pedrajas, N., de Haro-García, A., Pérez-Rodríguez, J.: A scalable memetic algorithm for simultaneous instance and feature selection. Evol. Comput. 22(1), 1–45 (2014)
Montazeri, M., Montazeri, M., Naji, H.R., Faraahi, A.: A novel memetic feature selection algorithm. In: 5th Conference on Information and Knowledge Technology (IKT), pp. 295–300. IEEE Press, New York (2013)
Yang, C.S., Chuang, L.Y., Chen, Y.J., Yang, C.H.: Feature selection using memetic algorithms. In: Third International Conference on Convergence and Hybrid Information Technology (ICCIT 2008), vol. 1, pp. 416–423. IEEE Press, New York (2008)
Cateni, S., Colla, V., Vannucci, M.: A hybrid feature selection method for classification purposes. In: European Modelling Symposium (EMS), pp. 39–44. IEEE Press, New York (2014)
Zhu, Z., Ong, Y.S., Dash, M.: Wrapper–filter feature selection algorithm using a memetic framework. IEEE Trans. Syst. Man Cybern. Part B (Cybernetics) 37(1), 70–76 (2007)
Roy, K., Vajda, S., Pal, U., Chaudhuri, B.B.: A system towards Indian postal automation. In: Ninth International Workshop on Frontiers in Handwriting Recognition (IWFHR-9), pp. 580–585. IEEE Press, New York (2004)
Bhowmik, S., Polley, S., Roushan, M.G., Malakar, S., Sarkar, R., Nasipuri, M.: A holistic word recognition technique for handwritten Bangla words. Int. J. Appl. Pattern Recogn. 2(2), 142–159 (2015)
Bhowmik, S., Malakar, S., Sarkar, R., Nasipuri, M.: Handwritten bangla word recognition using elliptical features. In: International Conference on Computational Intelligence and Communication Networks (CICN), pp. 257–261. IEEE Press, New York (2014)
Bhowmik, S., Roushan, M.G., Sarkar, R., Nasipuri, M., Polley, S., Malakar, S.: Handwritten bangla word recognition using hog descriptor. In: Fourth International Conference of Emerging Applications of Information Technology (EAIT), pp. 193–197. IEEE Press, New York (2014)
Frinken, V., Fischer, A., Manmatha, R., Bunke, H.: A novel word spotting method based on recurrent neural networks. IEEE Trans. Pattern Anal. Mach. Intell. 34(2), 211–224 (2012)
Doermann, D.: The indexing and retrieval of document images: a survey. Comput. Vis. Image Underst. 70(3), 287–298 (1998)
Manmatha, R., Han, C., Riseman, E.M.: Word spotting: a new approach to indexing handwriting. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pp. 631–637. IEEE Press, New York (1996)
Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: A hierarchical approach to recognition of handwritten Bangla characters. Pattern Recogn. 42(7), 1467–1484 (2009)
Ishidera, E., Lucas, S.M., Downton, A.C.: Top-down likelihood word image generation model for holistic word recognition. In: Lopresti, D., Hu, J., Kashi, R. (eds.) DAS 2002. LNCS, vol. 2423, pp. 82–94. Springer, Heidelberg (2002). doi:10.1007/3-540-45869-7_11
Acharyya, A., Rakshit, S., Sarkar, R., Basu, S., Nasipuri, M.: Handwritten word recognition using MLP based classifier: a holistic approach. Int. J. Comput. Sci. Issues 10(2), 422–427 (2013)
Malakar, S., Sharma, P., Singh, P.K., Das, M., Sarkar, R., Nasipuri, M.: A holistic approach for handwritten hindi word recognition. Int. J. Comput. Vision Image Process. (IJCVIP) 7(1), 59–78 (2017)
AlKhateeb, J.H., Pauplin, O., Ren, J., Jiang, J.: Performance of hidden Markov model and dynamic Bayesian network classifiers on handwritten Arabic word recognition. Knowl.-Based Syst. 24(5), 680–688 (2011)
Gao, J., Li, M., Wu, A., Huang, C.N.: Chinese word segmentation and named entity recognition: a pragmatic approach. Computational Linguistics 31(4), 531–574 (2005)
Sarkar, R., Malakar, S., Das, N., Basu, S., Kundu, M., Nasipuri, M.: Word extraction and character segmentation from text lines of unconstrained handwritten Bangla document images. J. Intell. Syst. 20(3), 227–260 (2011)
Pal, U., Roy, K., Kimura, F.: A lexicon-driven handwritten city-name recognition scheme for Indian postal automation. IEICE Trans. Inf. Syst. 92(5), 1146–1158 (2009)
Pal, U., Roy, R.K., Kimura, F.: Bangla and English city name recognition for Indian postal automation. In: 20th International Conference on Pattern Recognition (ICPR), pp. 1985–1988. IEEE Press, New York (2010)
Pal, U., Roy, R.K., Kimura, F.: Handwritten street name recognition for Indian postal automation. In: International Conference on Document Analysis and Recognition (ICDAR), pp. 483–487. IEEE Press, New York (2011)
Roy, P.P., Bhunia, A.K., Das, A., Dey, P., Pal, U.: HMM-based Indic handwritten word recognition using zone segmentation. Pattern Recogn. 60, 1057–1075 (2016)
Pal, U., Roy, R.K., Kimura, F.: Multi-lingual city name recognition for Indian postal automation. In: International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 169–173. IEEE Press, New York (2012)
Vajda, S., Roy, K., Pal, U., Chaudhuri, B.B., Belaid, A.: Automation of Indian postal documents written in Bangla and English. Int. J. Pattern Recognit Artif Intell. 23(08), 1599–1632 (2009)
Roy, P.P., Dey, P., Roy, S., Pal, U., Kimura, F.: A novel approach of Bangla handwritten text recognition using HMM. In: 14th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 661–666. IEEE Press, New York (2014)
Joarder, M.M.A., Mahmud, K., Ahmed, T., Kawser, M., Ahamed, B.: Bangla automatic number plate recognition system using artificial neural network. Asian Trans. Sci. Technol. (ATST) 2(1), 1–10 (2012)
Witkin, A.: Scale-space filtering: a new approach to multi-scale description. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 9, pp. 150–153. IEEE Press, New York (1984)
Gardner, M.W., Dorling, S.R.: Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences. Atmos. Environ. 32(14), 2627–2636 (1998)
De Jong, K.A., Spears, W.M.: A formal analysis of the role of multi-point crossover in genetic algorithms. Ann. Math. Artif. Intell. 5(1), 1–26 (1992)
Goldberg, D.E.: Genetic algorithms. Pearson Education India
Wold, S., Esbensen, K., Geladi, P.: Principal component analysis. Chemometr. Intell. Lab. Syst. 2(1–3), 37–52 (1987)
Robnik-Šikonja, M., Kononenko, I.: Theoretical and empirical analysis of ReliefF and RReliefF. Mach. Learn. 53(1–2), 23–69 (2003)
Censor, Y.: Pareto optimality in multi objective problems. Appl. Mathe. Optim. 4(1), 41–59 (1977)
Glover, F., Laguna, M.: Tabu Search. Springer, New York (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Ghosh, M., Malakar, S., Bhowmik, S., Sarkar, R., Nasipuri, M. (2017). Memetic Algorithm Based Feature Selection for Handwritten City Name Recognition. In: Mandal, J., Dutta, P., Mukhopadhyay, S. (eds) Computational Intelligence, Communications, and Business Analytics. CICBA 2017. Communications in Computer and Information Science, vol 776. Springer, Singapore. https://doi.org/10.1007/978-981-10-6430-2_47
Download citation
DOI: https://doi.org/10.1007/978-981-10-6430-2_47
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6429-6
Online ISBN: 978-981-10-6430-2
eBook Packages: Computer ScienceComputer Science (R0)