Skip to main content

Automatic Clustering Using a Genetic Algorithm with New Solution Encoding and Operators

  • Conference paper
Book cover Computational Science and Its Applications – ICCSA 2014 (ICCSA 2014)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8580))

Included in the following conference series:

Abstract

Genetic algorithms (GA) are randomized search and optimization techniques which have proven to be robust and effective in large scale problems. In this work, we propose a new GA approach for solving the automatic clustering problem, ACGA - Automatic Clustering Genetic Algorithm. It is capable of finding the optimal number of clusters in a dataset, and correctly assign each data point to a cluster without any prior knowledge about the data. An encoding scheme which had not yet been tested with GA is adopted and new genetic operators are developed. The algorithm can use any cluster validity function as fitness function. Experimental validation shows that this new approach outperforms the classical clustering methods K-means and FCM. The method provides good results, and requires a small number of iterations to converge.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Belahbib, F., Souami, F.: Genetic algorithm clustering for color image quantization. In: 3rd European Workshop on Visual Information Processing (EUVIP), pp. 83–87 (2011)

    Google Scholar 

  2. Mecca, G., Raunich, S., Pappalardo, A.: A New Algorithm for Clustering Search Results. Data and Knowledge Engineering 62, 504–522 (2007)

    Article  Google Scholar 

  3. Valafar, F.: Pattern Recognition Techniques in Microarray Data Analysis: A Survey. Annals of New York Academy of Sciences 980, 41–64 (2002)

    Article  Google Scholar 

  4. Hartigan, J., Wong, M.: Algorithm AS 136: A K-Means Clustering Algorithm. Applied Statistics 28(1), 100–108 (1979)

    Article  MATH  Google Scholar 

  5. Bezdek, J., Ehrlich, R., Full, W.: FCM: The fuzzy c-means clustering algorithm. Computers and Geosciences 10(2-3), 191–203 (1984)

    Article  Google Scholar 

  6. Holland, J.: Genetic algorithms. Scientific American (1992)

    Google Scholar 

  7. Srinivas, M., Patnaik, M.: Genetic algorithm: A survey. IEEE Computer 27(6), 17–26 (1994)

    Article  Google Scholar 

  8. Murthy, C., Chowdhury, N.: In search of optimal clusters using GA. Pattern Recognition Letters 17, 825–832 (1996)

    Article  Google Scholar 

  9. Tseng, L., Yang, S.: A genetic approach to the automatic clustering problem. Pattern Recognition 34(2), 415–424 (2001)

    Article  MATH  Google Scholar 

  10. Agustin-Blas, L., Salcedo-Sanz, S., Jimenez-Fernandez, S., Carro-Calvo, L., Del Ser, J., Portilla-Figueras, J.A.: A new grouping GA for clustering problems. Expert Systems with Applications 39(10) (2012)

    Google Scholar 

  11. Sheikh, R., Raghuwanshi, M., Jaiswal, A.: Genetic Algorithm Based Clustering: A Survey. In: First International Conference on Emerging Trends in Engineering and Technology, vol. 2(6), pp. 314–319 (2008)

    Google Scholar 

  12. Liu, Y., Wu, X., Shen, Y.: Automatic clustering using genetic algorithms. Applied Mathematics and Computation 218(4), 1267–1279 (2011)

    Article  MATH  MathSciNet  Google Scholar 

  13. He, H., Tan, Y.: A two-stage genetic algorithm for automatic clustering. Neurocomputing 81, 49–59 (2012)

    Article  Google Scholar 

  14. Das, S., Abraham, A., Konar, A.: Automatic Clustering Using an Improved Differential Evolution Algorithm. IEEE Transactions on Systems, Man and Cybernetics, Part A: Systems and Humans 38(1), 218–237 (2008)

    Article  Google Scholar 

  15. Calinski, R., Harabasz, J.: A dendrite method for cluster analysis. Communications in Statistics 3(1), 1–27 (1974)

    MATH  MathSciNet  Google Scholar 

  16. Asuncion, A., Newman, J.: UCI Machine Learning Repository. University of California, Department of Information and Computer Science, Irvine, CA (2007), http://www.ics.uci.edu/~mlearn/MLRepository.html

  17. Speech and Image Processing Unit. Clustering datasets, http://www.cs.joensuu.fi/sipu/datasets/

  18. Hubert, L., Arabie, P.: Comparing Partitions. Journal of Classification (2), 193–218 (1985)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Raposo, C., Antunes, C.H., Barreto, J.P. (2014). Automatic Clustering Using a Genetic Algorithm with New Solution Encoding and Operators. In: Murgante, B., et al. Computational Science and Its Applications – ICCSA 2014. ICCSA 2014. Lecture Notes in Computer Science, vol 8580. Springer, Cham. https://doi.org/10.1007/978-3-319-09129-7_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-09129-7_7

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-09128-0

  • Online ISBN: 978-3-319-09129-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics