Skip to main content

Data Clustering Algorithms for Information Systems

  • Conference paper
Rough Sets, Fuzzy Sets, Data Mining and Granular Computing (RSFDGrC 2007)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4482))

Abstract

Although the approaches are fundamentally different, the derivation of decision rules from information systems in the form of tables can be compared to supervised classification in pattern recognition; in the latter case classification rules should be derived from the classes of given points in a feature space. We also notice that methods of unsupervised classification (in other words, data clustering) in pattern recognition are closely related to supervised classification techniques. This observation leads us to the discussion of clustering for information systems by investigating relations between the two methods in the pattern classification. We thus discuss a number of methods of data clustering of information tables without decision attributes on the basis of rough set approach in this paper. Current clustering algorithms using rough sets as well as new algorithms motivated from pattern classification techniques are considered. Agglomerative clustering are generalized into a method of poset-valued clustering for discussing structures of information systems using new notations in relational databases. On the other hand K-means algorithms are developed using the kernel function approach. Illustrative examples are given.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum, New York (1981)

    Book  MATH  Google Scholar 

  2. Birkhoff, G.: Lattice Theory. Amer. Math. Soc. (1967)

    Google Scholar 

  3. Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, Cambridge (2000)

    Book  MATH  Google Scholar 

  4. Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley, Chichester (2001)

    MATH  Google Scholar 

  5. Everitt, B.S.: Cluster Analysis, 3rd edn. Arnold, London (1993)

    MATH  Google Scholar 

  6. Hirano, S., Tsumoto, S.: A framework for unsupervised selection of indiscernibility threshold in rough clustering. In: Greco, S., et al. (eds.) RSCTC 2006. LNCS (LNAI), vol. 4259, pp. 872–881. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  7. Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, Chichester (1990)

    Book  MATH  Google Scholar 

  8. Lingras, P., West, C.: Interval set clustering of web users with rough K-means. J. of Intel. Informat. Sci. 23(1), 5–16 (2004)

    Article  MATH  Google Scholar 

  9. Liu, Z.Q., Miyamoto, S. (eds.): Soft Computing and Human-Centered Machines. Springer, Tokyo (2000)

    MATH  Google Scholar 

  10. MacLane, S., Birkhoff, G.: Algebra, 2nd edn. Macmillan, Basingstoke (1979)

    MATH  Google Scholar 

  11. Miyamoto, S.: Fuzzy Sets in Information Retrieval and Cluster Analysis. Kluwer, Dordrecht (1990)

    Book  MATH  Google Scholar 

  12. Miyamoto, S.: Introduction to Cluster Analysis: Theory and Applications of Fuzzy Clustering (in Japanese). Morikita-Shuppan, Tokyo (1990)

    Google Scholar 

  13. Miyamoto, S., Suizu, D.: Fuzzy c-means clustering using transformations into high-dimensional spaces. In: Proc. of FSKD’02: 1st International Conference on Fuzzy Systems and Knowledge Discovery, vol. 2 Singapore, Nov. 18-22, 2002, pp. 656–660 (2002)

    Google Scholar 

  14. Miyamoto, S., Nakayama, Y.: Algorithms of hard c-means clustering using kernel functions in support vector machines. Journal of Advanced Computational Intelligence and Intelligent Informatics 7(1), 19–24 (2003)

    Article  Google Scholar 

  15. Miyamoto, S., Suizu, D.: Fuzzy c-means clustering using kernel functions in support vector machines. Journal of Advanced Computational Intelligence and Intelligent Informatics 7(1), 25–30 (2003)

    Article  Google Scholar 

  16. Miyamoto, S., Hayakawa, S.: A fuzzy neighborhood model for clustering, classification, and approximations. In: Greco, S., et al. (eds.) RSCTC 2006. LNCS (LNAI), vol. 4259, pp. 882–890. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  17. Miyamoto, S.: Lattice-valued hierarchical clustering for analyzing information systems. In: Greco, S., et al. (eds.) RSCTC 2006. LNCS (LNAI), vol. 4259, pp. 909–917. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  18. Miyamoto, S., Mizutani, K.: Fuzzy multiset space and c-means clustering using kernels with application to information retrieval. In: De Baets, B., Kaynak, O., Bilgiç, T. (eds.) IFSA 2003. LNCS, vol. 2715, pp. 387–395. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  19. Okuzaki, T., et al.: A rough set based clustering method by knowledge combination. IEICE Trans. on Informat. Syst. E85-D(12), 1898–1908 (2002)

    Google Scholar 

  20. Pawlak, Z.: Rough sets. International Journal of Computer and Information Sciences 11, 341–356 (1982)

    Article  MathSciNet  MATH  Google Scholar 

  21. Pawlak, Z.: Rough Sets. Kluwer Academic Publishers, Dordrecht (1991)

    Book  MATH  Google Scholar 

  22. Pawlak, Z., Skowron, A.: Rudiments of rough sets. Information Sciences 177, 3–27 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  23. Peters, G., Lampert, M.: A partitive rough clustering algorithm. In: Greco, S., et al. (eds.) RSCTC 2006. LNCS (LNAI), vol. 4259, pp. 657–666. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  24. Ullman, J.D.: Database and Knowledge-base Systems: Volume I. Computer Science Press, Rockville (1988)

    Google Scholar 

  25. Vapnik, V.N.: Statistical Learning Theory. Wiley, Chichester (1998)

    MATH  Google Scholar 

  26. Vapnik, V.N.: The Nature of the Statistical Learning Theory, 2nd edn. Springer, Heidelberg (2000)

    Book  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Miyamoto, S. (2007). Data Clustering Algorithms for Information Systems. In: An, A., Stefanowski, J., Ramanna, S., Butz, C.J., Pedrycz, W., Wang, G. (eds) Rough Sets, Fuzzy Sets, Data Mining and Granular Computing. RSFDGrC 2007. Lecture Notes in Computer Science(), vol 4482. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72530-5_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-72530-5_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-72529-9

  • Online ISBN: 978-3-540-72530-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics