Skip to main content

An Overview of Hierarchical and Non-hierarchical Algorithms of Clustering for Semi-supervised Classification

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7647))

Abstract

An overview of a variety of methods of agglomerative hierarchical clustering as well as non-hierarchical clustering for semi-supervised classification is given. Two different formulations for semi-supervised classification are introduced: one is with pairwise constraints, while the other does not use constraints. Two methods of the mixture of densities and fuzzy c-means are contrasted and their theoretical properties are discussed. A number of agglomerative hierarchical algorithms are then discussed. It will be shown that the single linkage has different characteristics when compared with the complete linkage and average linkage. Moreover the centroid method and the Ward method are discussed. It will also be shown that the must-link constraints and the cannot-link constraints are handled in different ways in these methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)

    Book  MATH  Google Scholar 

  2. Basu, S., Bilenko, M., Mooney, R.J.: A Probabilistic Framework for Semi-Supervised Clustering. In: Proc. of the Tenth ACM SIGKDD (KDD 2004), pp. 59–68 (2004)

    Google Scholar 

  3. Basu, S., Banerjee, A., Mooney, R.J.: Active Semi-Supervision for Pairwise Constrained Clustering. In: Proc. of the SIAM International Conference on Data Mining (SDM 2004), pp. 333–344 (2004)

    Google Scholar 

  4. Basu, S., Davidson, I., Wagstaff, K.L.: Constrained Clustering. CRC Press, Boca Raton (2009)

    MATH  Google Scholar 

  5. Chapelle, O., Schölkopf, B., Zien, A. (eds.): Semi-Supervised Learning. MIT Press, Cambridge (2006)

    Google Scholar 

  6. Davidson, I., Ravi, S.S.: Agglomerative Hierarchical Clustering with Constraints: Theoretical and Empirical Results. In: Jorge, A.M., Torgo, L., Brazdil, P.B., Camacho, R., Gama, J. (eds.) PKDD 2005. LNCS (LNAI), vol. 3721, pp. 59–70. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  7. Davidson, I., Ravi, S.S.: Using instance-level constraints in agglomerative hierarchical clustering: theoretical and empirical results. Data Min., Knowl., Disc. 18, 257–282 (2009)

    Article  MathSciNet  Google Scholar 

  8. Endo, Y., Haruyama, H., Okubo, T.: On some hierarchical clustering algorithms using kernel functions. In: Proc. of FUZZ-IEEE 2004, CD-ROM Proc., Budapest, Hungary, July 25-29, pp. 1–6 (2004)

    Google Scholar 

  9. Everitt, B.S.: Cluster Analysis, 3rd edn. Arnold, London (1993)

    Google Scholar 

  10. Klein, D., Kamvar, S.D., Manning, C.: From Instance-level Constraints to Space-level Constraints: Making the Most of Prior Knowledge in Data Clustering. In: Proc. of the Intern. Conf. on Machine Learning, Sydney, Australia, pp. 307–314 (2002)

    Google Scholar 

  11. Kulis, B., Basu, S., Dhillon, I., Mooney, R.: Semi-supervised graph clustering: a kernel approach. Mach. Learn. 74, 1–22 (2009)

    Article  Google Scholar 

  12. Ichihashi, H., Honda, K., Tani, N.: Gaussian mixture PDF approximation and fuzzy c-means clustering with entropy regularization. In: Proc. of Fourth Asian Fuzzy Systems Symposium, vol. 1, pp. 217–221 (2000)

    Google Scholar 

  13. Ichihashi, H., Miyagishi, K., Honda, K.: Fuzzy c-means clustering with regularization by K-L information. In: Proc. of 10th IEEE International Conference on Fuzzy Systems, vol. 2, pp. 924–927 (2001)

    Google Scholar 

  14. McLachlan, G.J., Krishnan, T.: The EM algorithms and Extensions. Wiley, New York (1997)

    Google Scholar 

  15. Miyamoto, S.: Fuzzy Sets in Information Retrieval and Cluster Analysis. Kluwer, Dordrecht (1990)

    Book  MATH  Google Scholar 

  16. Miyamoto, S., Ichihashi, H., Honda, K.: Algorithms for Fuzzy Clustering. Springer (2008)

    Google Scholar 

  17. Miyamoto, S., Terami, A.: Semi-Supervised Agglomerative Hierarchical Clustering Algorithms with Pairwise Constraints. In: Proc. of WCCI 2010 IEEE World Congress on Computational Intelligence, CCIB, Barcelona, Spain, July, 18-23, pp. 2796–2801 (2010)

    Google Scholar 

  18. Miyamoto, S., Terami, A.: Constrained Agglomerative Hierarchical Clustering Algorithms with Penalties. In: Proc. of 2011 IEEE International Conference on Fuzzy Systems, Taipei, Taiwan, June 27-30, pp. 422–427 (2011)

    Google Scholar 

  19. Miyamoto, S., Terami, A.: Inductive vs. Transductive Clustering Using Kernel Functions and Pairwise Constraints. In: Proc. of 11th Intern. Conf. on Intelligent Systems Design and Applications (ISDA 2011), Cordoba, Spain, November 22-24, pp. 1258–1264 (2011)

    Google Scholar 

  20. Schölkopf, B., Smola, A.J.: Learning with Kernels. MIT Press, Cambridge (2002)

    Google Scholar 

  21. Shental, N., Bar-Hillel, A., Hertz, T., Weinshall, D.: Computing Gaussian Mixture Models with EM using Equivalence Constraints. In: Thrun, S., Saul, L.K., Schölkopf, B. (eds.) Advances In Neural Information Processing Systems 16, pp. 465–472 (2004)

    Google Scholar 

  22. Wagstaff, K., Cardie, C., Rogers, S., Schroedl, S.: Constrained K-means Clustering with Background Knowledge. In: Proc. of the 9th ICML, pp. 577–584 (2001)

    Google Scholar 

  23. Wang, N., Li, X., Luo, X.: Semi-supervised Kernel-based Fuzzy c-Means with Pairwise Constraints. In: Proc. of WCCI 2008, pp.1099-1103 (2008)

    Google Scholar 

  24. Zhu, X., Goldberg, A.B.: Introduction to Semi-Supervised Learning. Morgan and Claypool (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Miyamoto, S. (2012). An Overview of Hierarchical and Non-hierarchical Algorithms of Clustering for Semi-supervised Classification. In: Torra, V., Narukawa, Y., López, B., Villaret, M. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2012. Lecture Notes in Computer Science(), vol 7647. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34620-0_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-34620-0_1

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-34619-4

  • Online ISBN: 978-3-642-34620-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics