Skip to main content

Part of the book series: Studies in Computational Intelligence ((SCI,volume 245))

Abstract

In this work we combine clustering ensembles and semi-supervised clustering to address the ill-posed nature of clustering. We introduce a hybrid approach that extends our previous work on clustering ensembles to situations where some knowledge from the end user is available, by enforcing constraints during the partitioning process. The experimental results show that our constrained ensemble technique is capable of producing a partition that is as good as, or better, than those computed by other semi-supervised clustering approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Al-Razgan, M., Domeniconi, C.: Weighted clustering ensembles. In: Proc. 2006 SIAM Int. Conf. Data Mining, Bethesda, MD, pp. 258–269. SIAM, Philadelphia (2006)

    Google Scholar 

  2. Asuncion, A., Newman, D.J.: UCI Machine Learning Repository. University of California, Irvine, School of Information and Computer Sciences (2007)

    Google Scholar 

  3. Bar-Hillel, A., Hertz, T., Shental, N., Weinshall, D.: Learning distance functions using equivalence relations. In: Fawcett, T., Mishra, N. (eds.) Proc. 20th Int. Conf. Mach. Learn., pp. 11–18. AAAI Press, Menlo Park (2003)

    Google Scholar 

  4. Basu, S., Banerjee, A., Mooney, R.: Semi-supervised clustering by seeding. In: Sammut, C., Hoffmann, A.G. (eds.) Proc. 19th Int. Conf. Mach. Learn., Sydney, NSW, Australia, pp. 27–34. Morgan Kaufmann, San Francisco (2002)

    Google Scholar 

  5. Dimitriadou, E., Weingessel, A., Hornik, K.: A mixed ensemble approach for the semi-supervised problem. In: Dorronsoro, J.R. (ed.) ICANN 2002. LNCS, vol. 2415, pp. 571–576. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  6. Domeniconi, C., Papadopoulos, D., Gunopulos, D., Ma, S.: Subspace clustering of high dimensional data. In: Proc. 2004 SIAM Int. Conf. Data Mining, Arlington, VA, pp. 517–521. SIAM, Philadelphia (2004)

    Google Scholar 

  7. Domeniconi, C., Gunopulos, D., Ma, S., Yan, B., Al-Razgan, M., Papadopoulos, D.: Locally adaptive metrics for clustering high dimensional data. Data Mining and Knowl. Discovery J. 14(1), 63–97 (2007)

    Article  MathSciNet  Google Scholar 

  8. Fred, A., Jain, A.: Data clustering using evidence accumulation. In: Proc. 16th Int. Conf. Patt. Recogn., Quebec, QB, pp. 276–280. IEEE Comp. Soc., Los Alamitos (2002)

    Google Scholar 

  9. Greene, D., Cunningham, P.: An ensemble approach to identifying informative constraints for semi-supervised clustering. In: Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) ECML 2007. LNCS (LNAI), vol. 4701, pp. 140–151. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  10. Hu, X.: Integration of cluster ensemble and text summarization for gene expression analysis. In: Proc. 4th IEEE Symp. Bioinformatics and Bioengineering, Taichung, Taiwan, pp. 251–258. IEEE Comp. Soc., Los Alamitos (2004)

    Google Scholar 

  11. Kang, N., Domeniconi, C., Barbará, D.: Categorization and keyword identification of unlabeled documents. In: Proc. 5th IEEE Int. Conf. Data Mining, Houston, TX, pp. 677–680. IEEE Comp. Soc., Los Alamitos (2005)

    Google Scholar 

  12. Karypis, G., Kumar, V.: A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM J. Scientific Computing 20(1), 359–392 (1998)

    Article  MathSciNet  Google Scholar 

  13. Kuncheva, L., Hadjitodorov, S.: Using diversity in cluster ensembles. In: Proc. 2004 Int. Conf. Syst. Man and Cybern., The Hague, The Netherlands, pp. 1214–1219. IEEE Comp. Soc., Los Alamitos (2004)

    Google Scholar 

  14. Mangasarian, O., Wolberg, W.: Cancer diagnosis via linear programming. SIAM News 23(5), 1–18 (1990)

    Google Scholar 

  15. Wagstaff, K., Cardie, C., Rogers, S., Schrödl, S.: Constrained k-means clustering with background knowledge. In: Brodley, C., Pohoreckyj-Danyluk, A. (eds.) Proc. 18th Int. Conf. Mach. Learn., Williamstown, MA, pp. 577–584. Morgan Kaufmann, San Francisco (2001)

    Google Scholar 

  16. Zeng, Y., Tang, J., Garcia-Frias, J., Gao, G.: An adaptive meta-clustering approach: combining the information from different clustering results. In: Proc. 1st IEEE Comp. Soc. Conf. Bioinformatics, Stanford, CA, pp. 276–287. IEEE Comp. Soc., Los Alamitos (2002)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Al-Razgan, M., Domeniconi, C. (2009). Clustering Ensembles with Active Constraints. In: Okun, O., Valentini, G. (eds) Applications of Supervised and Unsupervised Ensemble Methods. Studies in Computational Intelligence, vol 245. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03999-7_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-03999-7_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-03998-0

  • Online ISBN: 978-3-642-03999-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics