Abstract
An algorithm of agglomerative hierarchical clustering using an asymmetric similarity measure based on a bag model is proposed. This bag model is studied for document clustering and analysis of information on the web. The definition of an inter-cluster similarity is proposed and a dendrogram output reflecting asymmetry of the similarity measure is shown. It is also proved that the dendrogram has no reversals. An example of word clusters on Twitter shows how the method works.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Anderberg, M.R.: Cluster Analysis for Applications. Academic Press, New York (1960)
Everitt, B.S.: Cluster Analysis, 3rd edn. Arnold, London (1993)
Hubert, L.: Min and max hierarchical clustering using asymmetric similarity measures. Psychometrika 38(1), 63–72 (1973)
Miyamoto, S.: Fuzzy Sets in Information Retrieval and Cluster Analysis. Kluwer, Dordrecht (1990)
Miyamoto, S.: Introduction to Cluster Analysis, Morikita-Shuppan, Tokyo (1999) (in Japanese)
Okada, A., Iwamoto, T.: A Comparison before and after the Joint First Stage Achievement Test by Asymmetric Cluster Analysis. Behaviormetrika 23(2), 169–185 (1996)
Saito, T., Yadohisa, H.: Data Analysis of Asymmetric Structures. Marcel Dekker, New York (2005)
Takeuchi, A., Saito, T., Yadohisa, H.: Asymmetric agglomerative hierarchical clustering algorithms and their evaluations. Journal of Classification 24, 123–143 (2007)
Takumi, S., Miyamoto, S.: Agglomerative Clustering Using Asymmetric Similarities. In: Torra, V., Narakawa, Y., Yin, J., Long, J. (eds.) MDAI 2011. LNCS (LNAI), vol. 6820, pp. 114–125. Springer, Heidelberg (2011)
Yadohisa, H.: Formulation of Asymmetric Agglomerative Clustering and Graphical Representation of Its Result. J. of Japanese Society of Computational Statistics 15(2), 309–316 (2002) (in Japanese)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Takumi, S., Miyamoto, S. (2011). Agglomerative Hierarchical Clustering Using Asymmetric Similarity Based on a Bag Model and Application to Information on the Web. In: Tang, Y., Huynh, VN., Lawry, J. (eds) Integrated Uncertainty in Knowledge Modelling and Decision Making. IUKM 2011. Lecture Notes in Computer Science(), vol 7027. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24918-1_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-24918-1_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24917-4
Online ISBN: 978-3-642-24918-1
eBook Packages: Computer ScienceComputer Science (R0)