Abstract
Although the approaches are fundamentally different, the derivation of decision rules from information systems in the form of tables can be compared to supervised classification in pattern recognition; in the latter case classification rules should be derived from the classes of given points in a feature space. We also notice that methods of unsupervised classification (in other words, data clustering) in pattern recognition are closely related to supervised classification techniques. This observation leads us to the discussion of clustering for information systems by investigating relations between the two methods in the pattern classification. We thus discuss a number of methods of data clustering of information tables without decision attributes on the basis of rough set approach in this paper. Current clustering algorithms using rough sets as well as new algorithms motivated from pattern classification techniques are considered. Agglomerative clustering are generalized into a method of poset-valued clustering for discussing structures of information systems using new notations in relational databases. On the other hand K-means algorithms are developed using the kernel function approach. Illustrative examples are given.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum, New York (1981)
Birkhoff, G.: Lattice Theory. Amer. Math. Soc. (1967)
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, Cambridge (2000)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley, Chichester (2001)
Everitt, B.S.: Cluster Analysis, 3rd edn. Arnold, London (1993)
Hirano, S., Tsumoto, S.: A framework for unsupervised selection of indiscernibility threshold in rough clustering. In: Greco, S., et al. (eds.) RSCTC 2006. LNCS (LNAI), vol. 4259, pp. 872–881. Springer, Heidelberg (2006)
Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, Chichester (1990)
Lingras, P., West, C.: Interval set clustering of web users with rough K-means. J. of Intel. Informat. Sci. 23(1), 5–16 (2004)
Liu, Z.Q., Miyamoto, S. (eds.): Soft Computing and Human-Centered Machines. Springer, Tokyo (2000)
MacLane, S., Birkhoff, G.: Algebra, 2nd edn. Macmillan, Basingstoke (1979)
Miyamoto, S.: Fuzzy Sets in Information Retrieval and Cluster Analysis. Kluwer, Dordrecht (1990)
Miyamoto, S.: Introduction to Cluster Analysis: Theory and Applications of Fuzzy Clustering (in Japanese). Morikita-Shuppan, Tokyo (1990)
Miyamoto, S., Suizu, D.: Fuzzy c-means clustering using transformations into high-dimensional spaces. In: Proc. of FSKD’02: 1st International Conference on Fuzzy Systems and Knowledge Discovery, vol. 2 Singapore, Nov. 18-22, 2002, pp. 656–660 (2002)
Miyamoto, S., Nakayama, Y.: Algorithms of hard c-means clustering using kernel functions in support vector machines. Journal of Advanced Computational Intelligence and Intelligent Informatics 7(1), 19–24 (2003)
Miyamoto, S., Suizu, D.: Fuzzy c-means clustering using kernel functions in support vector machines. Journal of Advanced Computational Intelligence and Intelligent Informatics 7(1), 25–30 (2003)
Miyamoto, S., Hayakawa, S.: A fuzzy neighborhood model for clustering, classification, and approximations. In: Greco, S., et al. (eds.) RSCTC 2006. LNCS (LNAI), vol. 4259, pp. 882–890. Springer, Heidelberg (2006)
Miyamoto, S.: Lattice-valued hierarchical clustering for analyzing information systems. In: Greco, S., et al. (eds.) RSCTC 2006. LNCS (LNAI), vol. 4259, pp. 909–917. Springer, Heidelberg (2006)
Miyamoto, S., Mizutani, K.: Fuzzy multiset space and c-means clustering using kernels with application to information retrieval. In: De Baets, B., Kaynak, O., Bilgiç, T. (eds.) IFSA 2003. LNCS, vol. 2715, pp. 387–395. Springer, Heidelberg (2003)
Okuzaki, T., et al.: A rough set based clustering method by knowledge combination. IEICE Trans. on Informat. Syst. E85-D(12), 1898–1908 (2002)
Pawlak, Z.: Rough sets. International Journal of Computer and Information Sciences 11, 341–356 (1982)
Pawlak, Z.: Rough Sets. Kluwer Academic Publishers, Dordrecht (1991)
Pawlak, Z., Skowron, A.: Rudiments of rough sets. Information Sciences 177, 3–27 (2007)
Peters, G., Lampert, M.: A partitive rough clustering algorithm. In: Greco, S., et al. (eds.) RSCTC 2006. LNCS (LNAI), vol. 4259, pp. 657–666. Springer, Heidelberg (2006)
Ullman, J.D.: Database and Knowledge-base Systems: Volume I. Computer Science Press, Rockville (1988)
Vapnik, V.N.: Statistical Learning Theory. Wiley, Chichester (1998)
Vapnik, V.N.: The Nature of the Statistical Learning Theory, 2nd edn. Springer, Heidelberg (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Miyamoto, S. (2007). Data Clustering Algorithms for Information Systems. In: An, A., Stefanowski, J., Ramanna, S., Butz, C.J., Pedrycz, W., Wang, G. (eds) Rough Sets, Fuzzy Sets, Data Mining and Granular Computing. RSFDGrC 2007. Lecture Notes in Computer Science(), vol 4482. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72530-5_2
Download citation
DOI: https://doi.org/10.1007/978-3-540-72530-5_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72529-9
Online ISBN: 978-3-540-72530-5
eBook Packages: Computer ScienceComputer Science (R0)