Skip to main content
Log in

Unsupervised data classification using improved biogeography based optimization

  • Original Article
  • Published:
International Journal of System Assurance Engineering and Management Aims and scope Submit manuscript

Abstract

Unsupervised data classification (data clustering) is one of the mostly used data analysis methods which groups the unlabeled data into identical clusters (groups). Classical clustering methods do not perform effectively while clustering high dimensional datasets viz micro array datasets. Therefore, a novel clustering method based on Biogeography based optimization is proposed to extend the capabilities of traditional clustering methods. Performance of proposed method has been tested on the four micro-array datasets. Experimental results validate the effectiveness of proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  • Abaei G, Selamat A, Fujita F (2015) An empirical study based on semi-supervised hybrid self-organizing map for software fault prediction. Knowledge-Based Systems 74:28–39

    Article  Google Scholar 

  • Bansal JC, Sharma H, Jadon SS, Clerc M (2014) Spider monkey optimization algorithm for numerical optimization. Memetic Comput 1:31–47

    Article  Google Scholar 

  • Bansal JC, Sharma H, Arya K, Deep K, Pant M (2014) Self-adaptive artificial bee colony. Optimization 63:1513–1532

    Article  MathSciNet  MATH  Google Scholar 

  • Berkhin P (2006) A survey of clustering data mining techniques. In: Grouping multidimensional data, Springer, pp. 25–71

  • Chapelle O, Schölkopf B, Zien et al (2009) A semi-supervised learning

  • Feng Q, Liu S, Zhang J, Yang G, Yong L (2017) Improved biogeography-based optimization with random ring topology and powell’s method. Appl Math Model 41:630–649

    Article  MathSciNet  Google Scholar 

  • Gantz J, Reinsel D (2012) The digital universe in 2020. http://www.emc.com/leadership/digital-universe/2012iview/executive-summary-a-universe-of.htm

  • Gaur D, Gaur S (2013) Comprehensive analysis of data clustering algorithms. Future information communication technology and applications. Springer, Netherlands, pp 753–762

    Chapter  Google Scholar 

  • Gong W, Cai Z, Ling CX (2010) De/bbo: a hybrid differential evolution with biogeography-based optimization for global numerical optimization. Soft Comput 15:645–665

    Article  Google Scholar 

  • Jadon SS, Bansal JC, Tiwari R, Sharma H (2014) Artificial bee colony algorithm with global and local neighborhoods. Int J Syst Assur Eng Manag. doi:10.1007/s13198-014-0286-6

    Google Scholar 

  • Jain AK (2010) Data clustering: 50 years beyond k-means. Pattern Recognit Lett 31:651–666

    Article  Google Scholar 

  • Jenifer S, Parasuraman S, Kadirvelu A (2016) Contrast enhancement and brightness preserving of digital mammograms using fuzzy clipped contrast-limited adaptive histogram equalization algorithm. Appl Soft Comp 42:167–177

    Article  Google Scholar 

  • Kaur M, Garg SK (2014) Survey on clustering techniques in data mining for software engineering. Int J Adv Innov Res 3:238–243

    Google Scholar 

  • Kennedy J, Eberhart R (1995) Particle swarm optimization. Neural Netw 4:1942–1948

    Google Scholar 

  • Kotsiantis SB, Zaharakis I, Pintelas P (2007) Supervised machine learning: a review of classification techniques. Informatica 31(3):3–24

  • Kulhari A, Pandey A, Pal R, Mittal H (2016) Unsupervised data classification using modified cuckoo search method. In: Proceedings ninth international conference on contemporary computing (IC3), 2016, IEEE, pp. 1–5

  • Lohokare M, Panigrahi BK, Pattnaik SS, Devi S, Mohapatra A (2012) Neighborhood search-driven accelerated biogeography-based optimization for optimal load dispatch. IEEE Trans Syst Man, and Cybernet Part C (Appli Rev) 42:641–652

    Article  Google Scholar 

  • MacArthur RH, Wilson EO (2015) Theory of Island Biogeography. (MPB-1), vol 1. Princeton University Press, New Jersey

    Google Scholar 

  • Manikandan P, Selvarajan S (2014) Data clustering using cuckoo search algorithm (csa). In: Proceedings of the second international conference on soft computing for problem solving (SocProS 2012), December 28-30, pp. 1275–1283

  • Maulik U, Bandyopadhyay S (2000) Genetic algorithm-based clustering technique. Pattern Recognit 33:1455–1465

    Article  Google Scholar 

  • Meng L, Kang Q, Han C, Hu S (2017) Biogeography-based optimisation for road recovery problem considering value of delay after urban waterlog disaster. Int J Bio-Inspired Comput 9:157–164

    Article  Google Scholar 

  • Ovreiu M, Simon D (2010) Biogeography-based optimization of neuro-fuzzy system parameters for diagnosis of cardiac disease. In: Proceedings of the 12th annual conference on Genetic and evolutionary computation, ACM, pp. 1235–1242

  • Pal R, Pandey HMA, Saraswat M (2016) Beecp: biogeography optimization-based energy efficient clustering protocol for hwsns, In: Proceedings ninth international conference of contemporary computing (IC3), 2016, IEEE, pp. 1–6

  • Pandey AC, Rajpoot DS, Saraswat M (2016) Data clustering using hybrid improved cuckoo search method, In: Proceedings ninth international conference on contemporary computing (IC3), IEEE, 2016, pp. 1–6

  • Pandey AC, Rajpoot DS, Saraswat M (2017) Twitter sentiment analysis using hybrid cuckoo search method. Inform Proc Manag 53:764–779

    Article  Google Scholar 

  • Park HS, Jun CH (2009) A simple and fast algorithm for k-medoids clustering. Exp Syst Appl 36:3336–3341

    Article  Google Scholar 

  • Sharma PK, Sharma H, Sharma N (2016) Gbest inspired biogeography based optimization algorithm, In: Proceedings international conference of power electronics, intelligent control and energy systems (ICPEICES), IEEE, pp. 1–6

  • Sharma PK, Sharma H, Sharma N (2016) Gbest inspired biogeography based optimization algorithm, In: Proceedings international conference of power electronics, intelligent control and energy systems (ICPEICES), IEEE, pp. 1–6

  • Sharma K, Chhamunya V, Gupta P, Sharma H, Bansa JC (2015) Fitness based particle swarm optimization. Int J S Assur Eng Manag 6:319–329

    Article  Google Scholar 

  • Simon D (2008) Biogeography-based optimization. IEEE Trans Evolut Comput 12:702–713

    Article  Google Scholar 

  • Storn R, Price K (1997) Differential evolution-a simple and efficient heuristic for global optimization over continuous spaces. J Global Optim 11:341–359

    Article  MathSciNet  MATH  Google Scholar 

  • Tan P-N, Steinbach M, Kumar V (2013) Data mining cluster analysis: basic concepts and algorithms. In: Xiong H (ed) Introduction to data mining.  Lecture Notes for Chapter 8. Rutgers University, New Jersey

  • Valian E, Mohanna S, Tavakoli S (2011) Improved cuckoo search algorithm for feedforward neural network training. Int J Artif Intell Appl 2:36–43

    Google Scholar 

  • Van der Merwe D, Engelbrecht AP (2003) Data clustering using particle swarm optimization. In: Evolutionary computation, 2003. CEC’03. The 2003 Congress on, pp. 215–220

  • Wang S, Li P, Chen P, Phillips P, Liu G, Du S, Zhang Y (2017a) Pathological brain detection via wavelet packet tsallis entropy and real-coded biogeography-based optimization. Fundam Inform 151:275–291

    Article  MathSciNet  Google Scholar 

  • Wang C, Wang Y, Wang K, Dong Y, Yang Y (2017b) An improved hybrid algorithm based on biogeography/complex and metropolis for many-objective optimization. Math Prob Eng 2017:2462891. doi:10.1155/2017/2462891

    MathSciNet  Google Scholar 

  • Yang X-S, Deb S (2009) Cuckoo search via lévy flights. In: World congress on nature and biologically inspired computing, NaBIC 2009, pp. 210–214

  • Yogesh C, Hariharan M, Ngadiran R, Adom A H, Yaacob S, Berkai C, Polat K (2017) A new hybrid pso assisted biogeography-based optimization for emotion and stress recognition from speech signal. Exp Syst Appl 69:149–158

    Article  Google Scholar 

  • Žalik KR (2008) An efficient k’-means clustering algorithm. Pattern Recognit Lett 29:1385–1391

    Article  Google Scholar 

  • Zhang Y, Wu X, Lu S, Wang H, Phillips P, Wang S (2016a) Smart detection on abnormal breasts in digital mammography based on contrast-limited adaptive histogram equalization and chaotic adaptive real-coded biogeography-based optimization. Simulation 92(9):873–885

    Article  Google Scholar 

  • Zhang Y, Phillips P, Wang S, Ji G, Yang J, Wu J (2016b) Fruit classification by biogeography-based optimization and feedforward neural network. Exp Syst 33(3):239–253

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Avinash Chandra Pandey.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pandey, A.C., Pal, R. & Kulhari, A. Unsupervised data classification using improved biogeography based optimization. Int J Syst Assur Eng Manag 9, 821–829 (2018). https://doi.org/10.1007/s13198-017-0660-2

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13198-017-0660-2

Keywords

Navigation