Skip to main content

A Heuristic Automatic Clustering Method Based on Hierarchical Clustering

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8946))

Abstract

We propose a clustering method which produces valid results while automatically determining an optimal number of clusters. The proposed method achieves these results with minimal user input, of which none pertains to a number of clusters. Our method’s effectiveness in clustering, including its ability to produce valid results on data sets presenting nested or interlocking shapes, is demonstrated and compared with cluster validity analysis to other methods to which a known optimal number of clusters is provided, and to other automatic clustering methods. Depending on the particularities of the data set used, our method has produced results which are roughly equivalent or better than those of the compared methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Gan, G.: Data Clustering in C++: An Object-Oriented Approach. Chapman and Hall/CRC, Boca Raton (2011)

    Book  MATH  Google Scholar 

  2. Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31, 264–323 (1999)

    Article  Google Scholar 

  3. Guan, Y., Ghorbani, A., Belacel, N.: Y-means: a clustering method for intrusion detection. In: Canadian Conference on Electrical and Computer Engineering 2003, IEEE CCECE 2003, vol. 2, pp. 1083–1086, IEEE (2003)

    Google Scholar 

  4. Mok, P., Huang, H., Kwok, Y., Au, J.: A robust adaptive clustering analysis method for automatic identification of clusters. Pattern Recogn. 45, 3017–3033 (2012)

    Article  Google Scholar 

  5. Wu, K.L., Yang, M.S.: A cluster validity index for fuzzy clustering. Pattern Recogn. Lett. 26, 1275–1291 (2005)

    Article  Google Scholar 

  6. Xie, X., Beni, G.: A validity measure for fuzzy clustering. IEEE Trans. Pattern Anal. Mach. Intell. 13, 841–847 (1991)

    Article  Google Scholar 

  7. Pal, N., Bezdek, J.: On cluster validity for the fuzzy c-means model. IEEE Trans. Fuzzy Syst. 3, 370–379 (1995)

    Article  Google Scholar 

  8. Fukuyama, Y., Sugeno, M.: A new method of choosing the number of clusters for the fuzzy c-means method. In: Proceedings of Fifth Fuzzy Systems Symposium, pp. 247–250 (1989)

    Google Scholar 

  9. Pakhira, M.K., Bandyopadhyay, S., Maulik, U.: Validity index for crisp and fuzzy clusters. Pattern Recogn. 37, 487–501 (2004)

    Article  MATH  Google Scholar 

  10. Rezaee, M.R., Lelieveldt, B., Reiber, J.: A new cluster validity index for the fuzzy c-mean. Pattern Recogn. Lett. 19, 237–246 (1998)

    Article  MATH  Google Scholar 

  11. Rousseeuw, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987)

    Article  MATH  Google Scholar 

  12. Kaufman, L.R., Rousseeuw, P.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, New York (1990)

    Book  MATH  Google Scholar 

  13. MacNaughton-Smith, P.: Dissimilarity analysis: a new technique of hierarchical sub-division. Nature 202, 1034–1035 (1964)

    Article  MATH  Google Scholar 

  14. Bezdek, J.C., Ehrlich, R., Full, W.: Fcm: the fuzzy c-means clustering algorithm. Comput. Geosci. 10, 191–203 (1984)

    Article  Google Scholar 

  15. Fisher, R.A.: The use of multiple measurements in taxonomic problems. Ann. Eugen. 7, 179–188 (1936)

    Article  Google Scholar 

  16. Gionis, A., Mannila, H., Tsaparas, P.: Clustering aggregation. ACM Trans. Knowl. Discov. Data 1, Article 4 (2007)

    Article  Google Scholar 

Download references

Acknowledgements

We gratefully acknowledge the support from NBIF’s (RAI 2012-047) New Brunswick Innovation Funding granted to Dr. Nabil Belacel.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to François LaPlante .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

LaPlante, F., Belacel, N., Kardouchi, M. (2015). A Heuristic Automatic Clustering Method Based on Hierarchical Clustering. In: Duval, B., van den Herik, J., Loiseau, S., Filipe, J. (eds) Agents and Artificial Intelligence. ICAART 2014. Lecture Notes in Computer Science(), vol 8946. Springer, Cham. https://doi.org/10.1007/978-3-319-25210-0_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25210-0_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25209-4

  • Online ISBN: 978-3-319-25210-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics