Skip to main content

A Quality-Driven Ensemble Approach to Automatic Model Selection in Clustering

  • Conference paper

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 26))

Abstract

A fundamental limitation of the data clustering task is that it has an inherent, ill-defined model selection problem: the choice of a clustering technique also implies some a-priori decision on cluster geometry. In this work we explore the combined use of two different clustering paradigms and their combination by means of an ensemble technique. Mixing coefficients are computed on the basis of partition quality, so that the ensemble is automatically tuned so as to give more weight to the best-performing (in terms of the selected quality indices) clustering method.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Asuncion, A., Newman, D.J.: UCI machine learning repository (2007)

    Google Scholar 

  2. Bach, F.R., Jordan, M.I.: Learning spectral clustering. Tech. Rep. UCB/CSD-03-1249, EECS Department, University of California, Berkeley (2003)

    Google Scholar 

  3. Baraldi, A., Blonda, P.: A survey of fuzzy clustering algorithms for pattern recognition. I. IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics) 29, 778–785 (1999)

    Article  Google Scholar 

  4. Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Kluwer Academic Publishers, Norwell (1981)

    Book  MATH  Google Scholar 

  5. Chaudhuri, K., Chung, F., Tsiatas, A.: Spectral clustering of graphs with general degrees in the extended planted partition model. Journal of Machine Learning Research 2012, 1–23 (2012)

    Google Scholar 

  6. Chung, F.R.K.: Spectral Graph Theory. CBMS Regional Conference Series in Mathematics, vol. 92. American Mathematical Society (February 1997)

    Google Scholar 

  7. Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI 1(2), 224–227 (1979)

    Google Scholar 

  8. Dunn, J.C.: A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. Journal of Cybernetics 3, 32–57 (1974)

    Article  Google Scholar 

  9. Fiedler, M.: Algebraic connectivity of graphs. Czechoslovak Mathematical Journal 23(2), 298–305 (1973)

    MathSciNet  Google Scholar 

  10. Filippone, M., Camastra, F., Masulli, F., Rovetta, S.: A survey of kernel and spectral methods for clustering. Pattern Recognition 40(1), 176–190 (2008)

    Article  Google Scholar 

  11. Fischer, I., Poland, J.: New methods for spectral clustering. Tech. rep., IDSIA/USI-SUPSI (2004)

    Google Scholar 

  12. Fred, A.L.N., Jain, A.K.: Data clustering using evidence accumulation. In: International Conference on Pattern Recognition, vol. 4 (2002)

    Google Scholar 

  13. Gower, J.C., Ross, G.J.S.: Minimum spanning trees and single linkage cluster analysis. Journal of the Royal Statistical Society 18(1), 54–64 (1969)

    MathSciNet  Google Scholar 

  14. Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice Hall, Englewood Cliffs (1988)

    MATH  Google Scholar 

  15. Kriegel, H.P., Kröger, P., Sander, J., Zimek, A.: Density-based clustering. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 1(3), 231–240 (2011)

    Google Scholar 

  16. Krishnapuram, R., Keller, J.M.: A possibilistic approach to clustering. IEEE Transactions on Fuzzy Systems 1(2), 98–110 (1993)

    Article  Google Scholar 

  17. Kuncheva, L.: Combining pattern classifiers. Methods and Algorithms. Wiley, Chichester (2004)

    Google Scholar 

  18. Masulli, F., Rovetta, S.: Soft transition from probabilistic to possibilistic fuzzy clustering. IEEE Transactions on Fuzzy Systems 14(4), 516–527 (2006)

    Article  Google Scholar 

  19. Nadler, B., Galun, M.: Fundamental limitations of spectral clustering. In: Advances in Neural Information Processing Systems, vol. 19, p. 1017 (2007)

    Google Scholar 

  20. Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: Analysis and an algorithm. In: Dietterich, T.G., Becker, S., Ghahramani, Z. (eds.) Advances in Neural Information Processing Systems, vol. 14. MIT Press, Cambridge (2002)

    Google Scholar 

  21. Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(8), 888–905 (2000)

    Article  Google Scholar 

  22. Strehl, A., Ghosh, J.: Cluster ensembles — a knowledge reuse framework for combining multiple partitions. J. Mach. Learn. Res. 3, 583–617 (2003)

    MATH  MathSciNet  Google Scholar 

  23. Von Luxburg, U.: A tutorial on spectral clustering. Statistics and Computing 17(4), 395–416 (2007)

    Article  MathSciNet  Google Scholar 

  24. Xie, X.L., Beni, G.: A validity measure for fuzzy clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 13(8), 841–847 (1991)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Raffaella Rosasco .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Rosasco, R., Mahmoud, H., Rovetta, S., Masulli, F. (2014). A Quality-Driven Ensemble Approach to Automatic Model Selection in Clustering. In: Bassis, S., Esposito, A., Morabito, F. (eds) Recent Advances of Neural Network Models and Applications. Smart Innovation, Systems and Technologies, vol 26. Springer, Cham. https://doi.org/10.1007/978-3-319-04129-2_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-04129-2_6

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-04128-5

  • Online ISBN: 978-3-319-04129-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics