Abstract
Clustering has been widely used for knowledge discovery. In this paper, we propose an effective approach known as Multi-Clustering to mine the data generated from different clustering methods for discovering relationships between clusters of data. In the proposed Multi-Clustering technique, it first generates combined vectors from the multiple clustering data. Then, the distances between the combined vectors are calculated using the Mahalanobis distance. The Agglomerative Hierarchical Clustering method is used to cluster the combined vectors. And finally, relationship vectors that can be used to identify the cluster relationships are generated. To illustrate the technique, we also discuss an application example that uses the proposed Multi-Clustering technique to mine the author clusters and document clusters for identifying the relationships on authors working on research areas. The performance of the proposed technique is also evaluated.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Berkhin, P.: Survey of Clustering Data Mining Techniques. Technical Report. Accrue Soft-ware, Inc (2002)
Cios, K.J., Pedrycz, W., Swiniarski, R.W.: Data Mining: Methods for Knowledge Discovery. Kluwer Academic Publisher, Norwell (1998)
Van Rijsbergen, C.: Information Retrieval. Utterworths, London (1979)
He, Y., Hui, S.C.: Mining aWeb Citation Database for Author Co-citation Analysis. Information Processing and Management 38(4), 491–508 (2002)
He, Y., Hui, S.C., Fong, A.C.M.: Mining a Web Citation Database for Document Clustering. Applied Artificial Intelligence 16(4), 283–302 (2002)
Bohm, C., Berchtold, S.: Keim: Searching in High-Dimensional Spaces – Index structures for Improving the Performance of Multimedia Databases. ACM Computing Surveys 33(8), 322–373 (2001)
Carkacioglu, A., Vural, F.Y.: Learning Similarity Space. In: International Conference on Image Processing, pp. 405–408 (2002)
Weinberg, S.: Applied linear regression. John Wiley and Sons, Chichester (1985)
Everitt, B.: Cluster Analysis, 3rd edn. Edward Arnold, London (1993)
Mitchell, T.M.: Machine Learning. McGraw Hill, United States (1997)
Boley, D.: Principal Direction Divisive Partitioning. Data Mining and Knowledge Discovery 2(4), 325–344 (1998)
Zamir, O., Etzioni, O.: Web Document Clustering: a Feasibility Demonstration. In: Proceeding of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 46–54 (1998)
Kohonen, T.: Self-Organizing Maps. Springer, Berlin (2001)
Grossberg, S.: The Adaptive Self-Organization of Serial Order in Behavior: Speech, Language and Motor Control. In: Pattern Recognition By Humans and Machines, vol. I, Speech Perception. Academic Press Inc., London (1986)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Quan, T.T., Hui, S.C., Fong, A. (2003). Mining Multiple Clustering Data for Knowledge Discovery. In: Grieser, G., Tanaka, Y., Yamamoto, A. (eds) Discovery Science. DS 2003. Lecture Notes in Computer Science(), vol 2843. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39644-4_45
Download citation
DOI: https://doi.org/10.1007/978-3-540-39644-4_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20293-6
Online ISBN: 978-3-540-39644-4
eBook Packages: Springer Book Archive