Abstract
In this work we combine clustering ensembles and semi-supervised clustering to address the ill-posed nature of clustering. We introduce a hybrid approach that extends our previous work on clustering ensembles to situations where some knowledge from the end user is available, by enforcing constraints during the partitioning process. The experimental results show that our constrained ensemble technique is capable of producing a partition that is as good as, or better, than those computed by other semi-supervised clustering approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Al-Razgan, M., Domeniconi, C.: Weighted clustering ensembles. In: Proc. 2006 SIAM Int. Conf. Data Mining, Bethesda, MD, pp. 258–269. SIAM, Philadelphia (2006)
Asuncion, A., Newman, D.J.: UCI Machine Learning Repository. University of California, Irvine, School of Information and Computer Sciences (2007)
Bar-Hillel, A., Hertz, T., Shental, N., Weinshall, D.: Learning distance functions using equivalence relations. In: Fawcett, T., Mishra, N. (eds.) Proc. 20th Int. Conf. Mach. Learn., pp. 11–18. AAAI Press, Menlo Park (2003)
Basu, S., Banerjee, A., Mooney, R.: Semi-supervised clustering by seeding. In: Sammut, C., Hoffmann, A.G. (eds.) Proc. 19th Int. Conf. Mach. Learn., Sydney, NSW, Australia, pp. 27–34. Morgan Kaufmann, San Francisco (2002)
Dimitriadou, E., Weingessel, A., Hornik, K.: A mixed ensemble approach for the semi-supervised problem. In: Dorronsoro, J.R. (ed.) ICANN 2002. LNCS, vol. 2415, pp. 571–576. Springer, Heidelberg (2002)
Domeniconi, C., Papadopoulos, D., Gunopulos, D., Ma, S.: Subspace clustering of high dimensional data. In: Proc. 2004 SIAM Int. Conf. Data Mining, Arlington, VA, pp. 517–521. SIAM, Philadelphia (2004)
Domeniconi, C., Gunopulos, D., Ma, S., Yan, B., Al-Razgan, M., Papadopoulos, D.: Locally adaptive metrics for clustering high dimensional data. Data Mining and Knowl. Discovery J. 14(1), 63–97 (2007)
Fred, A., Jain, A.: Data clustering using evidence accumulation. In: Proc. 16th Int. Conf. Patt. Recogn., Quebec, QB, pp. 276–280. IEEE Comp. Soc., Los Alamitos (2002)
Greene, D., Cunningham, P.: An ensemble approach to identifying informative constraints for semi-supervised clustering. In: Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) ECML 2007. LNCS (LNAI), vol. 4701, pp. 140–151. Springer, Heidelberg (2007)
Hu, X.: Integration of cluster ensemble and text summarization for gene expression analysis. In: Proc. 4th IEEE Symp. Bioinformatics and Bioengineering, Taichung, Taiwan, pp. 251–258. IEEE Comp. Soc., Los Alamitos (2004)
Kang, N., Domeniconi, C., Barbará, D.: Categorization and keyword identification of unlabeled documents. In: Proc. 5th IEEE Int. Conf. Data Mining, Houston, TX, pp. 677–680. IEEE Comp. Soc., Los Alamitos (2005)
Karypis, G., Kumar, V.: A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM J. Scientific Computing 20(1), 359–392 (1998)
Kuncheva, L., Hadjitodorov, S.: Using diversity in cluster ensembles. In: Proc. 2004 Int. Conf. Syst. Man and Cybern., The Hague, The Netherlands, pp. 1214–1219. IEEE Comp. Soc., Los Alamitos (2004)
Mangasarian, O., Wolberg, W.: Cancer diagnosis via linear programming. SIAM News 23(5), 1–18 (1990)
Wagstaff, K., Cardie, C., Rogers, S., Schrödl, S.: Constrained k-means clustering with background knowledge. In: Brodley, C., Pohoreckyj-Danyluk, A. (eds.) Proc. 18th Int. Conf. Mach. Learn., Williamstown, MA, pp. 577–584. Morgan Kaufmann, San Francisco (2001)
Zeng, Y., Tang, J., Garcia-Frias, J., Gao, G.: An adaptive meta-clustering approach: combining the information from different clustering results. In: Proc. 1st IEEE Comp. Soc. Conf. Bioinformatics, Stanford, CA, pp. 276–287. IEEE Comp. Soc., Los Alamitos (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Al-Razgan, M., Domeniconi, C. (2009). Clustering Ensembles with Active Constraints. In: Okun, O., Valentini, G. (eds) Applications of Supervised and Unsupervised Ensemble Methods. Studies in Computational Intelligence, vol 245. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03999-7_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-03999-7_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03998-0
Online ISBN: 978-3-642-03999-7
eBook Packages: EngineeringEngineering (R0)