Skip to main content

Advertisement

Log in

A hybrid genetic based functional link artificial neural network with a statistical comparison of classifiers over multiple datasets

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

This paper proposed a hybrid genetic based functional link artificial neural network (HFLANN) with simultaneous optimization of input features for the purpose of solving the problem of classification in data mining. The aim of the proposed approach is to choose an optimal subset of input features using genetic algorithm by eliminating features with little or no predictive information and increase the comprehensibility of resulting HFLANN. Using the functionally expanded of selected features, HFLANN overcomes the nonlinearity nature of problems, which is commonly encountered in single-layer neural networks. The features like simplicity of the architecture and low computational complexity of the network encourage us to use it in classification task of data mining. Further, the issue of statistical tests for comparison of algorithms on multiple datasets, which is even more essential to typical machine learning and data mining studies, has been all but ignored. In this work, we recommend a set of simple, yet safe and robust parametric and nonparametric tests for statistical comparisons of HFLANN with FLANN and RBF classifiers over multiple datasets by an extensive simulation studies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Abbreviations

Ω:

Universal set of individuals

M :

Number of classes

X :

Number of patterns

N :

Number of datasets used for experimental studies

K :

Number of algorithms (both proposed and used for comparisons)

\( P_{1}^{j} \) :

Performance of the jth algorithms on the ith dataset

\( \bar{P} \) :

Mean performance difference of algorithms

σ i :

Standard deviation of the ith algorithms over multiple datasets

σ p :

Variance of the difference between two means

R pos :

Summation of all positive ranks

R neg :

Summation of all negative ranks

R s :

The smallest rank among R pos and R neg

α:

Level of significance

Z :

z-Distributions

N :

Original set of features

D :

Selected set of features

T :

Number of iterations

E :

Error criterion

T 1 :

Test set 2

T 2 :

Test set 1

T :

Training/test set 1/2

τ:

Tradeoff between criteria

References

  1. Kriegel H-P et al (2007) Future trends in data mining. Data Min Knowl Disc 15(1):87–97

    Article  MathSciNet  Google Scholar 

  2. Fayyad UM, Piatetsky-Shapiro G, Smyth P (1996) From data mining to knowledge discovery: an overview. In: Fayyad UM, Piatetsky-Shapiro G, Smyth P (eds) Advances in knowledge discovery and data mining. AAAI Press, Menlo Park, CA, pp 1–34

    Google Scholar 

  3. Misra BB, Dehuri S (2007) Functional link artificial neural network for classification task in data mining. J Comput Sci 3(12):948–955

    Article  Google Scholar 

  4. Oh II-S et al (2004) Hybrid genetic algorithms for feature selection. IEEE Trans Pattern Anal Mach Intell 20(11):1424–1437

    Google Scholar 

  5. Goldberg DE (1989) Genetic algorithms in search, optimization and machine learning. Morgan Kaufmann, Los Altos

    MATH  Google Scholar 

  6. Duda RO, Hart PE, Stork DG (2001) Pattern classification. Wiley, New York

    MATH  Google Scholar 

  7. Zhang GP (2000) Neural networks for classification: a survey. IEEE Trans Syst Man Cybern Part C Appl Rev 30(4):451–461

    Article  Google Scholar 

  8. Pao Y-H et al (1992) Neural-net computing and intelligent control systems. Int J Control 56(2):263–289. doi:10.1080/00207179208934315

    Article  MATH  MathSciNet  Google Scholar 

  9. Siedlecki W, Skalansky J (1988) On automatic feature selection. Int J Pattern Recognit Artif Intell 2(2):197–220. doi:10.1142/S0218001488000145

    Article  Google Scholar 

  10. Koza JR (1994) Genetic programming: on the programming of computers by means of natural selection. MIT Press, Cambridge

    Google Scholar 

  11. Chang C-G et al (2007) Application of particle swarm optimization based BP neural network on engineering project risk evaluating. In: Proceedings of 3rd international conference on natural computation (ICNC 2007), pp 750–754

  12. Wilcoxon F (1945) Individual comparisons by ranking methods. Biometrics 1:80–83. doi:10.2307/3001968

    Article  Google Scholar 

  13. Blake CL, Merz CJ UCI repository of machine learning databases. http://www.ics.uci.edu/~mlearn/MLRepository.html

  14. Pal SK, Majumdar DD (1977) Fuzzy sets and decision making approaches in vowel and speaker recognition. IEEE Trans Syst Man Cybern 7:625–629

    Article  MATH  Google Scholar 

Download references

Acknowledgments

Dr. S. Dehuri would like to thank Department of Science and Technology, Govt. of India, vide letter number SR/BY/E-07/2007 dated 03-01-2008 for financial support under the BOYSCAST Program 2007–2008 and the financial support of BK21 Project, Korea’s Ministry of Education and Human Research Development. Prof. S. B. Cho would like to thank the financial support of BK21 Project initiated by Korea’s Ministry of Education and Human Resource Development.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Satchidananda Dehuri.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dehuri, S., Cho, SB. A hybrid genetic based functional link artificial neural network with a statistical comparison of classifiers over multiple datasets. Neural Comput & Applic 19, 317–328 (2010). https://doi.org/10.1007/s00521-009-0310-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-009-0310-y

Keywords

Navigation