Abstract
Unsupervised data classification (data clustering) is one of the mostly used data analysis methods which groups the unlabeled data into identical clusters (groups). Classical clustering methods do not perform effectively while clustering high dimensional datasets viz micro array datasets. Therefore, a novel clustering method based on Biogeography based optimization is proposed to extend the capabilities of traditional clustering methods. Performance of proposed method has been tested on the four micro-array datasets. Experimental results validate the effectiveness of proposed method.
Similar content being viewed by others
References
Abaei G, Selamat A, Fujita F (2015) An empirical study based on semi-supervised hybrid self-organizing map for software fault prediction. Knowledge-Based Systems 74:28–39
Bansal JC, Sharma H, Jadon SS, Clerc M (2014) Spider monkey optimization algorithm for numerical optimization. Memetic Comput 1:31–47
Bansal JC, Sharma H, Arya K, Deep K, Pant M (2014) Self-adaptive artificial bee colony. Optimization 63:1513–1532
Berkhin P (2006) A survey of clustering data mining techniques. In: Grouping multidimensional data, Springer, pp. 25–71
Chapelle O, Schölkopf B, Zien et al (2009) A semi-supervised learning
Feng Q, Liu S, Zhang J, Yang G, Yong L (2017) Improved biogeography-based optimization with random ring topology and powell’s method. Appl Math Model 41:630–649
Gantz J, Reinsel D (2012) The digital universe in 2020. http://www.emc.com/leadership/digital-universe/2012iview/executive-summary-a-universe-of.htm
Gaur D, Gaur S (2013) Comprehensive analysis of data clustering algorithms. Future information communication technology and applications. Springer, Netherlands, pp 753–762
Gong W, Cai Z, Ling CX (2010) De/bbo: a hybrid differential evolution with biogeography-based optimization for global numerical optimization. Soft Comput 15:645–665
Jadon SS, Bansal JC, Tiwari R, Sharma H (2014) Artificial bee colony algorithm with global and local neighborhoods. Int J Syst Assur Eng Manag. doi:10.1007/s13198-014-0286-6
Jain AK (2010) Data clustering: 50 years beyond k-means. Pattern Recognit Lett 31:651–666
Jenifer S, Parasuraman S, Kadirvelu A (2016) Contrast enhancement and brightness preserving of digital mammograms using fuzzy clipped contrast-limited adaptive histogram equalization algorithm. Appl Soft Comp 42:167–177
Kaur M, Garg SK (2014) Survey on clustering techniques in data mining for software engineering. Int J Adv Innov Res 3:238–243
Kennedy J, Eberhart R (1995) Particle swarm optimization. Neural Netw 4:1942–1948
Kotsiantis SB, Zaharakis I, Pintelas P (2007) Supervised machine learning: a review of classification techniques. Informatica 31(3):3–24
Kulhari A, Pandey A, Pal R, Mittal H (2016) Unsupervised data classification using modified cuckoo search method. In: Proceedings ninth international conference on contemporary computing (IC3), 2016, IEEE, pp. 1–5
Lohokare M, Panigrahi BK, Pattnaik SS, Devi S, Mohapatra A (2012) Neighborhood search-driven accelerated biogeography-based optimization for optimal load dispatch. IEEE Trans Syst Man, and Cybernet Part C (Appli Rev) 42:641–652
MacArthur RH, Wilson EO (2015) Theory of Island Biogeography. (MPB-1), vol 1. Princeton University Press, New Jersey
Manikandan P, Selvarajan S (2014) Data clustering using cuckoo search algorithm (csa). In: Proceedings of the second international conference on soft computing for problem solving (SocProS 2012), December 28-30, pp. 1275–1283
Maulik U, Bandyopadhyay S (2000) Genetic algorithm-based clustering technique. Pattern Recognit 33:1455–1465
Meng L, Kang Q, Han C, Hu S (2017) Biogeography-based optimisation for road recovery problem considering value of delay after urban waterlog disaster. Int J Bio-Inspired Comput 9:157–164
Ovreiu M, Simon D (2010) Biogeography-based optimization of neuro-fuzzy system parameters for diagnosis of cardiac disease. In: Proceedings of the 12th annual conference on Genetic and evolutionary computation, ACM, pp. 1235–1242
Pal R, Pandey HMA, Saraswat M (2016) Beecp: biogeography optimization-based energy efficient clustering protocol for hwsns, In: Proceedings ninth international conference of contemporary computing (IC3), 2016, IEEE, pp. 1–6
Pandey AC, Rajpoot DS, Saraswat M (2016) Data clustering using hybrid improved cuckoo search method, In: Proceedings ninth international conference on contemporary computing (IC3), IEEE, 2016, pp. 1–6
Pandey AC, Rajpoot DS, Saraswat M (2017) Twitter sentiment analysis using hybrid cuckoo search method. Inform Proc Manag 53:764–779
Park HS, Jun CH (2009) A simple and fast algorithm for k-medoids clustering. Exp Syst Appl 36:3336–3341
Sharma PK, Sharma H, Sharma N (2016) Gbest inspired biogeography based optimization algorithm, In: Proceedings international conference of power electronics, intelligent control and energy systems (ICPEICES), IEEE, pp. 1–6
Sharma PK, Sharma H, Sharma N (2016) Gbest inspired biogeography based optimization algorithm, In: Proceedings international conference of power electronics, intelligent control and energy systems (ICPEICES), IEEE, pp. 1–6
Sharma K, Chhamunya V, Gupta P, Sharma H, Bansa JC (2015) Fitness based particle swarm optimization. Int J S Assur Eng Manag 6:319–329
Simon D (2008) Biogeography-based optimization. IEEE Trans Evolut Comput 12:702–713
Storn R, Price K (1997) Differential evolution-a simple and efficient heuristic for global optimization over continuous spaces. J Global Optim 11:341–359
Tan P-N, Steinbach M, Kumar V (2013) Data mining cluster analysis: basic concepts and algorithms. In: Xiong H (ed) Introduction to data mining. Lecture Notes for Chapter 8. Rutgers University, New Jersey
Valian E, Mohanna S, Tavakoli S (2011) Improved cuckoo search algorithm for feedforward neural network training. Int J Artif Intell Appl 2:36–43
Van der Merwe D, Engelbrecht AP (2003) Data clustering using particle swarm optimization. In: Evolutionary computation, 2003. CEC’03. The 2003 Congress on, pp. 215–220
Wang S, Li P, Chen P, Phillips P, Liu G, Du S, Zhang Y (2017a) Pathological brain detection via wavelet packet tsallis entropy and real-coded biogeography-based optimization. Fundam Inform 151:275–291
Wang C, Wang Y, Wang K, Dong Y, Yang Y (2017b) An improved hybrid algorithm based on biogeography/complex and metropolis for many-objective optimization. Math Prob Eng 2017:2462891. doi:10.1155/2017/2462891
Yang X-S, Deb S (2009) Cuckoo search via lévy flights. In: World congress on nature and biologically inspired computing, NaBIC 2009, pp. 210–214
Yogesh C, Hariharan M, Ngadiran R, Adom A H, Yaacob S, Berkai C, Polat K (2017) A new hybrid pso assisted biogeography-based optimization for emotion and stress recognition from speech signal. Exp Syst Appl 69:149–158
Žalik KR (2008) An efficient k’-means clustering algorithm. Pattern Recognit Lett 29:1385–1391
Zhang Y, Wu X, Lu S, Wang H, Phillips P, Wang S (2016a) Smart detection on abnormal breasts in digital mammography based on contrast-limited adaptive histogram equalization and chaotic adaptive real-coded biogeography-based optimization. Simulation 92(9):873–885
Zhang Y, Phillips P, Wang S, Ji G, Yang J, Wu J (2016b) Fruit classification by biogeography-based optimization and feedforward neural network. Exp Syst 33(3):239–253
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Pandey, A.C., Pal, R. & Kulhari, A. Unsupervised data classification using improved biogeography based optimization. Int J Syst Assur Eng Manag 9, 821–829 (2018). https://doi.org/10.1007/s13198-017-0660-2
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13198-017-0660-2