Mining Multiple Clustering Data for Knowledge Discovery

Quan, Thanh Tho; Hui, Siu Cheung; Fong, Alvis

doi:10.1007/978-3-540-39644-4_45

Thanh Tho Quan⁴,
Siu Cheung Hui⁴ &
Alvis Fong⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2843))

Included in the following conference series:

International Conference on Discovery Science

471 Accesses
2 Citations

Abstract

Clustering has been widely used for knowledge discovery. In this paper, we propose an effective approach known as Multi-Clustering to mine the data generated from different clustering methods for discovering relationships between clusters of data. In the proposed Multi-Clustering technique, it first generates combined vectors from the multiple clustering data. Then, the distances between the combined vectors are calculated using the Mahalanobis distance. The Agglomerative Hierarchical Clustering method is used to cluster the combined vectors. And finally, relationship vectors that can be used to identify the cluster relationships are generated. To illustrate the technique, we also discuss an application example that uses the proposed Multi-Clustering technique to mine the author clusters and document clusters for identifying the relationships on authors working on research areas. The performance of the proposed technique is also evaluated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Berkhin, P.: Survey of Clustering Data Mining Techniques. Technical Report. Accrue Soft-ware, Inc (2002)
Google Scholar
Cios, K.J., Pedrycz, W., Swiniarski, R.W.: Data Mining: Methods for Knowledge Discovery. Kluwer Academic Publisher, Norwell (1998)
MATH Google Scholar
Van Rijsbergen, C.: Information Retrieval. Utterworths, London (1979)
Google Scholar
He, Y., Hui, S.C.: Mining aWeb Citation Database for Author Co-citation Analysis. Information Processing and Management 38(4), 491–508 (2002)
Article MATH Google Scholar
He, Y., Hui, S.C., Fong, A.C.M.: Mining a Web Citation Database for Document Clustering. Applied Artificial Intelligence 16(4), 283–302 (2002)
Article Google Scholar
Bohm, C., Berchtold, S.: Keim: Searching in High-Dimensional Spaces – Index structures for Improving the Performance of Multimedia Databases. ACM Computing Surveys 33(8), 322–373 (2001)
Article Google Scholar
Carkacioglu, A., Vural, F.Y.: Learning Similarity Space. In: International Conference on Image Processing, pp. 405–408 (2002)
Google Scholar
Weinberg, S.: Applied linear regression. John Wiley and Sons, Chichester (1985)
Google Scholar
Everitt, B.: Cluster Analysis, 3rd edn. Edward Arnold, London (1993)
Google Scholar
Mitchell, T.M.: Machine Learning. McGraw Hill, United States (1997)
MATH Google Scholar
Boley, D.: Principal Direction Divisive Partitioning. Data Mining and Knowledge Discovery 2(4), 325–344 (1998)
Article Google Scholar
Zamir, O., Etzioni, O.: Web Document Clustering: a Feasibility Demonstration. In: Proceeding of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 46–54 (1998)
Google Scholar
Kohonen, T.: Self-Organizing Maps. Springer, Berlin (2001)
MATH Google Scholar
Grossberg, S.: The Adaptive Self-Organization of Serial Order in Behavior: Speech, Language and Motor Control. In: Pattern Recognition By Humans and Machines, vol. I, Speech Perception. Academic Press Inc., London (1986)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Engineering, Nanyang Technological University, Singapore
Thanh Tho Quan, Siu Cheung Hui & Alvis Fong

Authors

Thanh Tho Quan
View author publications
You can also search for this author in PubMed Google Scholar
Siu Cheung Hui
View author publications
You can also search for this author in PubMed Google Scholar
Alvis Fong
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

FG Knowledge Engineering, FB Informatik, Technical University Darmstadt, Hochschulstr. 10, 64289, Darmstadt
Gunter Grieser
Meme Media Laboratory, Hokkaido University, N13 W8, 0608628, Sapporo, Japan
Yuzuru Tanaka
Graduate School of Informatics, Kyoto University Yoshida Honmachi, Sakyo-ku, 606-850, Kyoto, Japan
Akihiro Yamamoto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Quan, T.T., Hui, S.C., Fong, A. (2003). Mining Multiple Clustering Data for Knowledge Discovery. In: Grieser, G., Tanaka, Y., Yamamoto, A. (eds) Discovery Science. DS 2003. Lecture Notes in Computer Science(), vol 2843. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39644-4_45

Download citation

DOI: https://doi.org/10.1007/978-3-540-39644-4_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20293-6
Online ISBN: 978-3-540-39644-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics