Abstract
A number of parameters must be specified for a data-mining algorithm. Default values of these parameters are given and generally accepted as ‘good’ estimates for any data set. However, data mining models are known to be data dependent, and so are for their parameters. Default values may be good estimates, but they are often not the best parameter values for a particular data set. A tuned set of parameter values is able to produce a data-mining model of better classification and higher prediction accuracy. However parameter search is known to be expensive. This paper investigates GA-based heuristic techniques in a case study of optimizing parameters of back-propagation neural network classifier. Our experiments show that GA-based optimization technique is capable of finding a better set of parameter values than random search. In addition, this paper extends the island-model of Parallel GA (PGA) and proposes a VC-PGA, which communicates globally fittest individuals to local population with reduced communication overhead. Our result shows that GA-based parallel heuristic optimization technique provides a solution to large parametric optimization problems.
Chapter PDF
Similar content being viewed by others
References
D.E. Goldberg, Genetic Algorithms in Search, Optimization, and Machine Learning, Addison-Wesley Publishing Co., 1989.
T. Starkweather, D. Whitley, and K. Mathias, “Optimization using distributed genetic algorithm”, Parallel Problem Solving from Nature, pp. 176–185, Springer Verlag, 1991.
M. Gorges-Schleuter, “Explicit parallelism of genetic algorithms through population structures”, Parallel Problem Solving from Nature, pp 150–159, Springer Verlag, 1991.
V. S. Gordon and D. Whitley, “A Machine-Independent Analysis of Parallel Genetic Algorithms”, Complex Systems, 8:181–214, 1994.
I. H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations, Morgan Kaufmann, 2000.
S. M. Sait and Y. Youusef, “Iterative Computer Algorithms with Application in Engineering”, Solving Combinatorial Optimization Problems, IEEE Computer Society, 1999.
L.B. Booker, D.E. Goldberg, and J.H. Holland, “Classifier Systems and Genetic Algorithms”, Artificial Intelligence, Vol 40, pp. 235–282, 1989.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tam, L., Taniar, D., Smith, K. (2002). Parametric Optimization in Data Mining Incorporated with GA-Based Search. In: Sloot, P.M.A., Hoekstra, A.G., Tan, C.J.K., Dongarra, J.J. (eds) Computational Science — ICCS 2002. ICCS 2002. Lecture Notes in Computer Science, vol 2329. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46043-8_59
Download citation
DOI: https://doi.org/10.1007/3-540-46043-8_59
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43591-4
Online ISBN: 978-3-540-46043-5
eBook Packages: Springer Book Archive