Abstract
One way of representing semantics is via a high dimensional conceptual space constructed from lexical co-occurrence. Concepts (words) are represented as a vector whereby the dimensions are other words. As the words are represented as dimensional objects, clustering techniques can be applied to compute word clusters. Conventional clustering algorithms, e.g., the K-means method, however, normally produce crisp clusters, i.e., an object is assigned to only one cluster. This is sometimes not desirable. Therefore, a fuzzy membership function can be applied to the K-Means clustering, which models the degree of an object belonging to certain cluster. This paper introduces a fuzzy k-means clustering algorithm and how it is used to word clustering on the high dimensional semantic space constructed by a cognitively motivated semantic space model, namely Hyperspace Analogue to Language. A case study demonstrates the method is promising.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bezdek, J.C.: Pattern Recognition with Fussy Objective Function Algorithms. Plenum, New York (1981)
Gärdenfors, P.: Conceptual Spaces: The Geometry of Thought. MIT Press, Cambridge (2000)
Gärdenfors, P., Williams, M.: Reasoning about Categories in Conceptual Spaces. In: Proceedings of 14th International Joint Conference of Artificial intelligence (IJCAI 2001), pp. 385–392 (2001)
Hathaway, R.J., Bezdek, J.C., Hu, Y.: Generalized fuzzy c-means clustering strategies using Lp norm distances. IEEE Trans. on Fuzzy Systems 8, 576–582 (2000)
Landauer, T., Dumais, S.: A Solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychological Review 104(2), 211–240 (1997)
Lund, K., Burgess, C.: Producing High-dimensional Semantic Spaces from Lexical Co-occurrence. Behavior research Methods, Instruments, & Computers 28(2), 203–208 (1996)
Song, D., Bruza, P.D.: Towards Context Sensitive Informational Inference. Journal of the American Society for Information Science and Technology (JASIST) 52(4), 321–334 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cao, G., Song, D., Bruza, P. (2004). Fuzzy K-Means Clustering on a High Dimensional Semantic Space. In: Yu, J.X., Lin, X., Lu, H., Zhang, Y. (eds) Advanced Web Technologies and Applications. APWeb 2004. Lecture Notes in Computer Science, vol 3007. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24655-8_103
Download citation
DOI: https://doi.org/10.1007/978-3-540-24655-8_103
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21371-0
Online ISBN: 978-3-540-24655-8
eBook Packages: Springer Book Archive