Abstract
As a model of information retrieval on the WWW, a fuzzy multiset model is overviewed and a family of fuzzy document clustering algorithms is developed. The fuzzy multiset model is enhanced in order to adapt clustering applications. The standard proximity measure of the cosine coefficient is generalized in the multiset model, and two basic objective functions of fuzzy c-means are considered. Moreover two methods of handling nonlinear classification is proposed: introduction of a cluster volume variable and a kernel trick used in support vector machines. A crisp c-means algorithm and clustering by competitive learning are also studied. A numerical example based on real documents is shown.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum, New York (1981)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley, New York (2001)
Girolami, M.: Mercer kernel based clustering in feature space. IEEE Trans. on Neural Networks 13(3), 780–784 (2002)
Kohonen, T.: Self-Organization and Associative Memory. Springer-Verlag, Heiderberg (1989)
Liu, Z.Q., Miyamoto, S. (eds.): Soft Computing and Human-Centered Machines. Springer, Tokyo (2000)
Miyamoto, S., Mukaidono, M.: Fuzzy c - means as a regularization and maximum entropy approach. In: Proc. of the 7th International Fuzzy Systems Association World Congress (IFSA 1997), Prague, Chech, June 25-30, vol. II, pp. 86–92 (1997)
Miyamoto, S.: Fuzzy multisets and their generalizations. In: Calude, C.S., Pun, G., Rozenberg, G., Salomaa, A. (eds.) Multiset Processing. LNCS, vol. 2235, pp. 225–235. Springer, Heidelberg (2001)
Miyamoto, S.: Information clustering based on fuzzy multisets. Information Processing and Management 39(2), 195–213 (2003)
Yager, R.R.: On the theory of bags. Int. J. General Systems 13, 23–37 (1986)
Vapnik, V.: Statistical Learning Theory. Wiley, New York (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Miyamoto, S., Mizutani, K. (2004). Fuzzy Multiset Model and Methods of Nonlinear Document Clustering for Information Retrieval. In: Torra, V., Narukawa, Y. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2004. Lecture Notes in Computer Science(), vol 3131. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27774-3_26
Download citation
DOI: https://doi.org/10.1007/978-3-540-27774-3_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22555-3
Online ISBN: 978-3-540-27774-3
eBook Packages: Springer Book Archive