Data Clustering Algorithms for Information Systems

Miyamoto, Sadaaki

doi:10.1007/978-3-540-72530-5_2

Sadaaki Miyamoto²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4482))

Included in the following conference series:

International Workshop on Rough Sets, Fuzzy Sets, Data Mining, and Granular-Soft Computing

1511 Accesses
3 Citations

Abstract

Although the approaches are fundamentally different, the derivation of decision rules from information systems in the form of tables can be compared to supervised classification in pattern recognition; in the latter case classification rules should be derived from the classes of given points in a feature space. We also notice that methods of unsupervised classification (in other words, data clustering) in pattern recognition are closely related to supervised classification techniques. This observation leads us to the discussion of clustering for information systems by investigating relations between the two methods in the pattern classification. We thus discuss a number of methods of data clustering of information tables without decision attributes on the basis of rough set approach in this paper. Current clustering algorithms using rough sets as well as new algorithms motivated from pattern classification techniques are considered. Agglomerative clustering are generalized into a method of poset-valued clustering for discussing structures of information systems using new notations in relational databases. On the other hand K-means algorithms are developed using the kernel function approach. Illustrative examples are given.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum, New York (1981)
Book MATH Google Scholar
Birkhoff, G.: Lattice Theory. Amer. Math. Soc. (1967)
Google Scholar
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, Cambridge (2000)
Book MATH Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley, Chichester (2001)
MATH Google Scholar
Everitt, B.S.: Cluster Analysis, 3rd edn. Arnold, London (1993)
MATH Google Scholar
Hirano, S., Tsumoto, S.: A framework for unsupervised selection of indiscernibility threshold in rough clustering. In: Greco, S., et al. (eds.) RSCTC 2006. LNCS (LNAI), vol. 4259, pp. 872–881. Springer, Heidelberg (2006)
Chapter Google Scholar
Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, Chichester (1990)
Book MATH Google Scholar
Lingras, P., West, C.: Interval set clustering of web users with rough K-means. J. of Intel. Informat. Sci. 23(1), 5–16 (2004)
Article MATH Google Scholar
Liu, Z.Q., Miyamoto, S. (eds.): Soft Computing and Human-Centered Machines. Springer, Tokyo (2000)
MATH Google Scholar
MacLane, S., Birkhoff, G.: Algebra, 2nd edn. Macmillan, Basingstoke (1979)
MATH Google Scholar
Miyamoto, S.: Fuzzy Sets in Information Retrieval and Cluster Analysis. Kluwer, Dordrecht (1990)
Book MATH Google Scholar
Miyamoto, S.: Introduction to Cluster Analysis: Theory and Applications of Fuzzy Clustering (in Japanese). Morikita-Shuppan, Tokyo (1990)
Google Scholar
Miyamoto, S., Suizu, D.: Fuzzy c-means clustering using transformations into high-dimensional spaces. In: Proc. of FSKD’02: 1st International Conference on Fuzzy Systems and Knowledge Discovery, vol. 2 Singapore, Nov. 18-22, 2002, pp. 656–660 (2002)
Google Scholar
Miyamoto, S., Nakayama, Y.: Algorithms of hard c-means clustering using kernel functions in support vector machines. Journal of Advanced Computational Intelligence and Intelligent Informatics 7(1), 19–24 (2003)
Article Google Scholar
Miyamoto, S., Suizu, D.: Fuzzy c-means clustering using kernel functions in support vector machines. Journal of Advanced Computational Intelligence and Intelligent Informatics 7(1), 25–30 (2003)
Article Google Scholar
Miyamoto, S., Hayakawa, S.: A fuzzy neighborhood model for clustering, classification, and approximations. In: Greco, S., et al. (eds.) RSCTC 2006. LNCS (LNAI), vol. 4259, pp. 882–890. Springer, Heidelberg (2006)
Chapter Google Scholar
Miyamoto, S.: Lattice-valued hierarchical clustering for analyzing information systems. In: Greco, S., et al. (eds.) RSCTC 2006. LNCS (LNAI), vol. 4259, pp. 909–917. Springer, Heidelberg (2006)
Chapter Google Scholar
Miyamoto, S., Mizutani, K.: Fuzzy multiset space and c-means clustering using kernels with application to information retrieval. In: De Baets, B., Kaynak, O., Bilgiç, T. (eds.) IFSA 2003. LNCS, vol. 2715, pp. 387–395. Springer, Heidelberg (2003)
Chapter Google Scholar
Okuzaki, T., et al.: A rough set based clustering method by knowledge combination. IEICE Trans. on Informat. Syst. E85-D(12), 1898–1908 (2002)
Google Scholar
Pawlak, Z.: Rough sets. International Journal of Computer and Information Sciences 11, 341–356 (1982)
Article MathSciNet MATH Google Scholar
Pawlak, Z.: Rough Sets. Kluwer Academic Publishers, Dordrecht (1991)
Book MATH Google Scholar
Pawlak, Z., Skowron, A.: Rudiments of rough sets. Information Sciences 177, 3–27 (2007)
Article MathSciNet MATH Google Scholar
Peters, G., Lampert, M.: A partitive rough clustering algorithm. In: Greco, S., et al. (eds.) RSCTC 2006. LNCS (LNAI), vol. 4259, pp. 657–666. Springer, Heidelberg (2006)
Chapter Google Scholar
Ullman, J.D.: Database and Knowledge-base Systems: Volume I. Computer Science Press, Rockville (1988)
Google Scholar
Vapnik, V.N.: Statistical Learning Theory. Wiley, Chichester (1998)
MATH Google Scholar
Vapnik, V.N.: The Nature of the Statistical Learning Theory, 2nd edn. Springer, Heidelberg (2000)
Book MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Risk Engineering, Faculty of Systems and Information Engineering, University of Tsukuba, Ibaraki 305-8573, Japan
Sadaaki Miyamoto

Authors

Sadaaki Miyamoto
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science and Engineering, York University, M3J 1P3, Toronto, Ontario, Canada
Aijun An
Institute of Computing Sciences, Poznań University of Technology, ul. Piotrowo 2, 60–965, Poznań, Poland
Jerzy Stefanowski
Department of Applied Computer Science, University of Winnipeg, R3B 2E9, Winnipeg, Manitoba, Canada
Sheela Ramanna
Department of Computer Science, University of Regina, S4S 0A2, Regina, Saskatchewan, Canada
Cory J. Butz
Department of Electrical and Computer Engineering, University of Alberta, T6G 2V4, Edmonton, Alberta, Canada
Witold Pedrycz
Institute of Compuer Science and Technology, Chongqing University of Posts and Telecommunications, 40065, Chongqing, P.R. China
Guoyin Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Miyamoto, S. (2007). Data Clustering Algorithms for Information Systems. In: An, A., Stefanowski, J., Ramanna, S., Butz, C.J., Pedrycz, W., Wang, G. (eds) Rough Sets, Fuzzy Sets, Data Mining and Granular Computing. RSFDGrC 2007. Lecture Notes in Computer Science(), vol 4482. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72530-5_2

Download citation

DOI: https://doi.org/10.1007/978-3-540-72530-5_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72529-9
Online ISBN: 978-3-540-72530-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics