Skip to main content
Log in

Enhancement of artificial neural network learning using centripetal accelerated particle swarm optimization for medical diseases diagnosis

  • Methodologies and Application
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

In recent decades, artificial neural networks (ANNs) have been extensively applied in different areas such as engineering, medicine, business, education, manufacturing and so on. Nowadays, ANNs are as a hot research in medicine especially in the fields of medical disease diagnosis. To have a high efficiency in ANN, selection of an appropriate architecture and learning algorithm is very important. ANN learning is a complex task and an efficient learning algorithm has a significant role to enhance ANN performance. In this paper, a new meta-heuristic algorithm, centripetal accelerated particle swarm optimization (CAPSO), is applied to evolve the ANN learning and accuracy. The algorithm is based on an improved scheme of particle swarm algorithm and Newton’s laws of motion. The hybrid learning of CAPSO and multi-layer perceptron (MLP) network, CAPSO-MLP, is used to classify the data of nine standard medical datasets of Hepatitis, Heart Disease, Pima Indian Diabetes, Wisconsin Prognostic Breast Cancer, Parkinson’s disease, Echocardiogram, Liver Disorders, Laryngeal 1 and Acute Inflammations. The performance of CAPSO-MLP is compared with those of PSO, gravitational search algorithm and imperialist competitive algorithm on MLP. The efficiency of methods are evaluated based on mean square error, accuracy, sensitivity, specificity, area under the receiver operating characteristics curve and statistical tests of \(t\)-test and Wilcoxon’s signed ranks test. The results indicate that CAPSO-MLP provides more effective performance than the others for medical disease diagnosis especially in term of unseen data (testing data) and datasets with high missing data values.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  • Abbod MF, Cheng KY, Cui XR, Huang SJ, Han YY, Shieh JS (2011) Ensembled neural networks for brain death prediction for patients with severe head injury. Biomed Signal Process Control 6:414–421

    Article  Google Scholar 

  • Ahmadi MA, Ahmadi MR, Shadizadeh SR (2013) Evolving artificial neural network and imperialist competitive algorithm for prediction permeability of the reservoir. Neural Comput Appl 23:563

    Google Scholar 

  • Anderson JA (2006) An introduction to neural networks. Prentice-Hall of India Private Limited

  • Ari S, Saha G (2009) In search of an optimization technique for artificial neural network to classify abnormal heart sounds. Appl Soft Comput 9:330–340

    Article  Google Scholar 

  • Ashizawa K, Ishida T, MacMahon H, Vyborny CJ, Katsuragawa S, Doi K (1999) Artificial neural networks in chest radiography: application to the differential diagnosis of interstitial lung disease. Acad Radiol 6:2–9

    Google Scholar 

  • Atashpaz-Gargari E, Lucas C (2007) Imperialist competitive algorithm: an algorithm for optimization inspired by imperialistic competition. In: Proceedings of IEEE congress on evolutionary computation, pp 4661–4667

  • Awad M, Motai Y (2008) Dynamic classification for video stream using support vector machine. Appl Soft Comput 8:1314–1325

    Article  Google Scholar 

  • Bache K, Lichman M (2013) UCI machine learning repository. School of Information and Computer Science, University of California, Irvine. http://archive.ics.uci.edu/ml

  • Beheshti Z, Shamsuddin SM (2013a) CAPSO: centripetal accelerated particle swarm optimization. Inf Sci. doi:10.1016/j.ins.2013.08.015

  • Beheshti Z, Shamsuddin SM (2013b) A review of population-based meta-heuristic algorithm. Int J Adv Soft Comput Appl 5(1):1–35

    Google Scholar 

  • Bennett KP, Mangasarian OL (1992) Robust linear programming discrimination of two linearly inseparable sets. Optim Method Softw 1:23–34

    Article  Google Scholar 

  • Bishop CM (1995) Neural networks for pattern recognition. Oxford University Press, Oxford

  • Breiman L, Friedman JH, Olshen RA, Stone C (1984) Classification and regression trees. Wadsworth

  • Brent RP (1991) Fast training algorithms for multi-layer neural nets. IEEE Trans Neural Netw 2:346–354

    Article  Google Scholar 

  • Box GE-P, Hunter JS, Hunter WG (2005) Statistics for experiments: design, innovation, and discovery, 2nd edn. Wiley, New York

    Google Scholar 

  • Chau KW (2007) Application of a PSO-based neural network in analysis of outcomes of construction claims. Autom Constr 16:642– 646

    Google Scholar 

  • Chen MY, Chen KK, Chiang HK, Huang HS, Huang MJ (2007) Comparing extended classifier system and genetic programming for financial forecasting: an empirical study. Soft Comput. 11:1173–1183

    Article  MATH  Google Scholar 

  • Coppini G, Miniati M, Paterni M, Monti S, Ferdeghini EM (2007) Computer-aided diagnosis of emphysema in COPD patients: neural-network-based analysis of lung shape in digital chest radiographs. Med Eng Phys 29:76–86

    Article  Google Scholar 

  • Cristianini N, Shawe-Taylor J (2000) An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press, Cambridge

  • Delen D, Walker G, Kadam A (2005) Predicting breast cancer survivability: a comparison of three data mining methods. Artif Intell Med 34:113–127

    Article  Google Scholar 

  • Dorigo M, Maniezzo V, Colorni A (1996) The ant system: optimization by a colony of cooperating agents. IEEE Trans Syst Man Cybern Part C Appl Rev 26:29–41

    Article  Google Scholar 

  • Ekici S (2012) Support vector machines for classification and locating faults on transmission lines. Appl Soft Comput 12:1650–1658

    Article  Google Scholar 

  • Elveren E, Yumuşak N (2011) Tuberculosis disease diagnosis using artificial neural network trained with genetic algorithm. J Med Syst 35:329–332

    Article  Google Scholar 

  • Er O, Sertkaya C, Temurtas F, Tanrikulu AC (2009) A comparative study on chronic obstructive pulmonary and pneumonia diseases diagnosis using neural networks and artificial immune system. J Med Syst 33:485–492

    Article  Google Scholar 

  • Fana CY, Changb PC, Linb JJ, Hsiehb JC (2011) A hybrid model combining case-based reasoning and fuzzy decision tree for medical data classification. Appl Soft Comput 11:632–644

    Article  Google Scholar 

  • Farmer JD, Packard NH, Perelson AS (1986) The immune system, adaptation and machine learning. Phys D 2:187–204

    Article  MathSciNet  Google Scholar 

  • Fausett LV (1994) Fundamentals of neural networks: architectures, algorithms, and applications. Prentice-Hall, New Jersey

  • Fawcett T (2006) An introduction to ROC analysis. Pattern Recognit Lett 27:861–874

    Article  Google Scholar 

  • Folland R, Hines E, Dutta R, Boilot P, Morgan D (2004) Comparison of neural network predictors in the classification of tracheal-bronchial breath sounds by respiratory auscultation. Artif Intell Med 31:211–220

    Article  Google Scholar 

  • Gao WF, Liu SY, Huang LL (2012) Particle swarm optimization with chaotic opposition based population initialization and stochastic search technique. Commun Nonlinear Sci Numer Simul 17:4316–4327

    Article  MathSciNet  MATH  Google Scholar 

  • García S, Fernández A, Herrera F (2009) Enhancing the effectiveness and interpretability of decision tree and rule induction classifiers with evolutionary training set selection over imbalanced problems. Appl Soft Comput 9:1304–1314

    Article  Google Scholar 

  • Gori M, Tesi A (1992) On the problem of local minima in backpropagation. IEEE Trans Pattern Anal Mach Intell 14:76–85

    Article  Google Scholar 

  • Gurney KN (1997) An introduction to neural networks. Routledge

  • Handels H, Roß TH, Kreusch J, Wolff HH, Poppl SJ (1999) Feature selection for optimized skin tumor recognition using genetic algorithms. Artif Intell Med 16:283–297

    Article  Google Scholar 

  • Hecht-Nielsen R (1987) Kolmogorov’s mapping neural network existence theorem. In: Proceedings of IEEE international conference on neural networks, pp 11–14

  • Heckerling PS, Gerber BS, Tape TG, Wigton RS (2004) Use of genetic algorithms for neural networks to predict community-acquired pneumonia. Artif Intell Med 30:71–84

    Article  Google Scholar 

  • Hornik K, Stinchcombe M, White H (1989) Multilayer feedforward networks are universal approximators. Neural Netw 2:359–366

    Article  Google Scholar 

  • Hrstka O, Kučerová A (2004) Improvements of real coded genetic algorithms based on differential operators preventing premature convergence. Adv Eng Softw 35:237–246

    Article  Google Scholar 

  • Huang ML, Hung YH, Chen WY (2010) Neural network classifier with entropy based feature selection on breast cancer diagnosis. J Med Syst 34:865–873

    Article  Google Scholar 

  • Jerez JM, Molina I, García-Laencina PJ, Alba E, Ribelles N, Martín M, Franco L (2010) Missing data imputation using statistical and machine learning methods in a real breast cancer problem. Artif Intell Med 50:105–115

    Article  Google Scholar 

  • Karabatak M, Ince MC (2009) An expert system for detection of breast cancer based on association rules and neural network. Expert Syst Appl 36:3465–3469

    Article  Google Scholar 

  • Karaboga D (2005) An idea based on honey bee swarm for numerical optimization. Technical Report, TR06, Erciyes University

  • Kayaer K, Yildirim T (2003) Medical diagnosis on Pima Indian diabetes using general regression neural networks. In: Proceedings of international conference on artificial neural networks and neural information, pp 181–184

  • Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Proceedings of IEEE international conference on neural networks, pp 1942–1948

  • Kisi O, Ozturk O (2007) Adaptive neuro-fuzzy computing technique for evapotranspiration estimation. J Irrig Drainage Eng ASCE 133:368–379

    Article  Google Scholar 

  • Kuncheva LI (2005) Real medical data sets. Tech. Rep. School of Informatics: University of Wales, Bangor. http://www.bangor.ac.uk/~mas00a/activities/real_data.htm

  • Leung Y, Gao Y, Xu ZB (1997) Degree of population diversity—a perspective on premature convergence in genetic algorithms and its Markov chain analysis. IEEE Trans Neural Netw 8:1165–1176

    Article  Google Scholar 

  • Liang JJ, Qin AK, Suganthan PN, Baskar S (2006) Comprehensive learning particle swarm optimizer for global optimization of multimodal functions. IEEE Trans Evol Comput 10:281–295

    Article  Google Scholar 

  • Mitra V, Wang CJ (2008) Content based audio classification: a neural network approach. Soft Comput 12:639–646

    Article  Google Scholar 

  • Moslemipour G, Lee TS, Rilling D (2012) A review of intelligent approaches for designing dynamic and robust layouts in flexible manufacturing systems. Int J Adv Manuf Technol 60:11–27

    Article  Google Scholar 

  • Oreski S, Oreski D, Oreski G (2012) Hybrid system with genetic algorithm and artificial neural networks and its application to retail credit risk assessment. Expert Syst Appl 39:12605–12617

    Article  Google Scholar 

  • Ozkan C, Kisi O, Akay B (2011) Neural networks with artificial bee colony algorithm for modeling daily reference evapotranspiration. Irrig Sci 29:431–441

    Article  Google Scholar 

  • Park J, Sandberg I (1991) Universal approximation using radial-basis function networks. Neural Comput 3:246–257

    Article  Google Scholar 

  • Prechelt L (1995) Some notes on neural learning algorithm benchmarking. Neurocomputing 9:343–347

    Google Scholar 

  • Qasem SN, Shamsuddin SM (2011) Radial basis function network based on time variant multi-objective particle swarm optimization for medical diseases diagnosis. Appl Soft Comput 11:1427–1438

    Article  Google Scholar 

  • Quinlan JR (1993) C4.5: Programs for Machine Learning. Morgan Kaufmann

  • Rashedi E, Nezamabadi-pour H, Saryazdi S (2009) GSA: a gravitational search algorithm. Inf Sci 179:2232–2248

    Article  MATH  Google Scholar 

  • Reynolds AP, Iglesia B (2009) A multi-objective GRASP for partial classification. Soft Comput 13:227–243

    Article  Google Scholar 

  • Rumelhart DE, McClelland JL (1986) Parallel distributed processing: explorations in the microstructure of cognition. MIT, Cambridge

    Google Scholar 

  • Samanta B, Al-Balushi KR, Al-Araimi SA (2003) Artificial neural networks and support vector machines with genetic algorithm for bearing fault detection. Eng Appl Artif Intell 16:657–665

    Article  Google Scholar 

  • Saxena A, Saad A (2007) Evolving an artificial neural network classifier for condition monitoring of rotating mechanical systems. Appl Soft Comput 7:441–454

    Article  Google Scholar 

  • Schutz B (2003) Gravity from the ground up. Cambridge University Press, Cambridge

  • Setiono R, Hui LCK (1995) Use of a quasinewton method in a feedforward neural network construction algorithm. IEEE Trans Neural Netw 6:740–747

    Article  Google Scholar 

  • Shi Y, Eberhart R (1998) A modified particle swarm optimizer. In: Proceedings of IEEE international conference on evolutionary computation, pp 69–73

  • Sivagaminathan RK, Ramakrishnan S (2007) A hybrid approach for feature subset selection using neural networks and ant colony optimization. Expert Syst Appl 33:49–60

    Article  Google Scholar 

  • Socha K, Blum C (2007) An ant colony optimization algorithm for continuous optimization: application to feed-forward neural network training. Neural Comput Appl 16:235–247

    Article  Google Scholar 

  • Sulaiman SI, Abdul-Rahman TK, Musirin I, Shaari S (2012) An artificial immune-based hybrid multi-layer feedforward neural network for predicting grid-connected photovoltaic system output. Energy Proc 14:260–264

    Article  Google Scholar 

  • Tang KS, Man KF, Kwong S, He Q (1996) Genetic algorithms and their applications. IEEE Signal Process Mag 13:22–37

    Article  Google Scholar 

  • Tayefeh-Mahmoudi M, Taghiyareh F, Forouzideh N, Lucas C (2013) Evolving artificial neural network structure using grammar encoding and colonial competitive algorithm. Neural Comput Appl 22:1–16

    Article  Google Scholar 

  • Temurtas F (2007) A comparative study on thyroid disease diagnosis using neural networks. Expert Syst Appl 36:944–949

    Article  Google Scholar 

  • Tsoulos I, Gavrilis D, Glavas E (2008) Neural network construction and training using grammatical evolution. Neurocomputing 72:269–277

    Article  Google Scholar 

  • Wilcoxon F (1945) Individual comparisons by ranking methods. Biometrics Bull 1:80–83

    Article  Google Scholar 

  • Yao X (1999) Evolving artificial neural networks. Proc IEEE 87:1423–1447

    Article  Google Scholar 

  • Zhang C, Shao H, Li Y (2000) Particle swarm optimization for evolving artificial neural network. In: Proceedings of IEEE international conference on systems, man, and sybernetics, pp 2487–2490

  • Ziver AK, Pain CC, Carter JN, de Oliveira CRE, Goddard AJ, Overton RS (2004) Genetic algorithms and artificial neural networks for loading pattern optimization of advanced gas-cooled reactors. Ann Nucl Energy 13:431–457

    Article  Google Scholar 

Download references

Acknowledgments

This research is supported by Research University Grant (03H72), Universiti Teknologi Malaysia. The authors would like to thank Soft Computing Research Group (SCRG), Universiti Teknologi Malaysia (UTM), Johor Bahru Malaysia, for supporting this study. Also, they give kind respect and special thanks to Dr. Zhaleh Beheshti, for proof reading the manuscript and providing valuable comments.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zahra Beheshti.

Additional information

Communicated by D. Liu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Beheshti, Z., Shamsuddin, S.M.H., Beheshti, E. et al. Enhancement of artificial neural network learning using centripetal accelerated particle swarm optimization for medical diseases diagnosis. Soft Comput 18, 2253–2270 (2014). https://doi.org/10.1007/s00500-013-1198-0

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-013-1198-0

Keywords

Navigation