Skip to main content

A Comparison of Four Classification Systems Using Rule Sets Induced from Incomplete Data Sets by Local Probabilistic Approximations

  • Conference paper
  • First Online:
Book cover Foundations of Intelligent Systems (ISMIS 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10352))

Included in the following conference series:

  • 1664 Accesses

Abstract

This paper is a continuation of our previous research in which we compared four classification strategies using rule sets induced from incomplete data sets and global probabilistic approximations. In our current research we use local probabilistic approximations. In our incomplete data sets, missing attribute values are interpreted as lost values and “do not care” conditions. Our current results are that for symbolic data and numerical data with a few attributes, the best strategy is strength with support, while for data sets with many numerical attributes the best strategy is probability only. Our results for incomplete data sets with many numerical attributes are supported by only three data sets so further research is required.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Clark, P.G., Grzymala-Busse, J.W.: A comparison of classification systems for rule sets induced from incomplete data by probabilistic approximations. In: Proceedings of the ALLDATA 1-st International Conference on Big Data, Small Data, Linked Data and Open Data, pp. 46–51 (2015)

    Google Scholar 

  2. Grzymala-Busse, J.W., Ziarko, W.: Data mining based on rough sets. In: Wang, J. (ed.) Data Mining: Opportunities and Challenges, pp. 142–173. Idea Group Publishing, Hershey (2003)

    Chapter  Google Scholar 

  3. Pawlak, Z., Wong, S.K.M., Ziarko, W.: Rough sets: probabilistic versus deterministic approach. Int. J. Man-Mach. Stud. 29, 81–95 (1988)

    Article  MATH  Google Scholar 

  4. Wong, S.K.M., Ziarko, W.: INFER–an adaptive decision support system based on the probabilistic approximate classification. In: Proceedings of the 6-th International Workshop on Expert Systems and their Applications, pp. 713–726 (1986)

    Google Scholar 

  5. Yao, Y.Y.: Probabilistic rough set approximations. Int. J. Approximate Reasoning 49, 255–271 (2008)

    Article  MATH  Google Scholar 

  6. Ziarko, W.: Variable precision rough set model. J. Comput. Syst. Sci. 46, 39–59 (1993)

    Article  MathSciNet  MATH  Google Scholar 

  7. Clark, P.G., Grzymala-Busse, J.W.: Experiments on probabilistic approximations. In: Proceedings of the 2011 IEEE International Conference on Granular Computing, pp. 144–149 (2011)

    Google Scholar 

  8. Bruha, I.: Missing attribute values. In: Sammut, C., Webb, G.I. (eds.) Encyclopedia of Machine Learning, pp. 674–680. Springer, Heidelberg (2010)

    Google Scholar 

  9. Grzymala-Busse, J.W., Grzymala-Busse, W.J.: Handling missing attribute values. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, 2nd edn, pp. 33–51. Springer, Heidelberg (2010)

    Google Scholar 

  10. Grzymala-Busse, J.W.: A rough set approach to incomplete data. In: Ciucci, D., Wang, G., Mitra, S., Wu, W.-Z. (eds.) RSKT 2015. LNCS, vol. 9436, pp. 3–14. Springer, Cham (2015). doi:10.1007/978-3-319-25754-9_1

    Chapter  Google Scholar 

  11. Grzymala-Busse, J.W.: Three approaches to missing attribute values–a rough set perspective. In: Proceedings of the Workshop on Foundation of Data Mining, in Conjunction with the Fourth IEEE International Conference on Data Mining, pp. 55–62 (2004)

    Google Scholar 

  12. Clark, P.G., Grzymala-Busse, J.W.: Experiments on rule induction from incomplete data using three probabilistic approximations. In: Proceedings of the 2012 IEEE International Conference on Granular Computing, pp. 90–95 (2012)

    Google Scholar 

  13. Grzymala-Busse, J.W., Rzasa, W.: Definability of approximations for a generalization of the indiscernibility relation. In: Proceedings of the 2007 IEEE Symposium on Foundations of Computational Intelligence, pp. 65–72 (2007)

    Google Scholar 

  14. Clark, P.G., Grzymala-Busse, J.W., Kuehnhausen, M.: Local probabilistic approximations for incomplete data. In: Proceedings of the 20-th International Symposium on Methodologies for Intelligent Systems, pp. 93–98 (2012)

    Google Scholar 

  15. Grzymala-Busse, J.W., Clark, P.G., Kuehnhausen, M.: Generalized probabilistic approximations of incomplete data. Int. J. Approximate Reasoning 132, 180–196 (2014)

    Article  MathSciNet  MATH  Google Scholar 

  16. Grzymala-Busse, J.W.: Rough set strategies to data with missing attribute values. In: Notes of the Workshop on Foundations and New Directions of Data Mining, in Conjunction with the Third International Conference on Data Mining, pp. 56–63 (2003)

    Google Scholar 

  17. Lin, T.Y.: Topological and fuzzy rough sets. In: Slowinski, R. (ed.) Intelligent Decision Support. Handbook of Applications and Advances of the Rough Sets Theory, pp. 287–304. Kluwer Academic Publishers, Dordrecht (1992)

    Google Scholar 

  18. Grzymala-Busse, J.W., Rzasa, W.: A local version of the MLEM2 algorithm for rule induction. Fundamenta Informaticae 100, 99–116 (2010)

    MathSciNet  MATH  Google Scholar 

  19. Booker, L.B., Goldberg, D.E., Holland, J.H.: Classifier Systems and Genetic Algorithms. MIT Press, Boston (1990)

    Google Scholar 

  20. Holland, J.H., Holyoak, K.J., Nisbett, R.E.: Induction: Processes of Inference, Learning, and Discovery. MIT Press, Boston (1986)

    Google Scholar 

  21. Stefanowski, J.: Algorithms of Decision Rule Induction in Data Mining. Poznan University of Technology Press, Poznan (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jerzy W. Grzymala-Busse .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Clark, P.G., Gao, C., Grzymala-Busse, J.W. (2017). A Comparison of Four Classification Systems Using Rule Sets Induced from Incomplete Data Sets by Local Probabilistic Approximations. In: Kryszkiewicz, M., Appice, A., Ślęzak, D., Rybinski, H., Skowron, A., Raś, Z. (eds) Foundations of Intelligent Systems. ISMIS 2017. Lecture Notes in Computer Science(), vol 10352. Springer, Cham. https://doi.org/10.1007/978-3-319-60438-1_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-60438-1_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-60437-4

  • Online ISBN: 978-3-319-60438-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics