Skip to main content

Incomplete Data Decomposition for Classification

  • Conference paper
  • First Online:
Rough Sets and Current Trends in Computing (RSCTC 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2475))

Included in the following conference series:

Abstract

In this paper we present a method of data decomposition to avoid the necessity of reasoning on data with missing attribute values. The original incomplete data is decomposed into data subsets without missing values. Next, methods for classifier induction are applied to such sets. Finally, a conflict resolving method is used to combine partial answers from classifiers to obtain final classification. We provide an empirical evaluation of the decomposition method with use of various decomposition criteria.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. M. Dash and H. Liu. Feature selection for classification. Intelligent Data Analysis, 1(3), 1997.

    Google Scholar 

  2. E. Frank, L. Trigg, and M. Hall. Weka 3.3.2, Waikato Environment for Knowledge Analysis. http://www.cs.waikato.ac.nz/ml/weka, The University of Waikato, Hamilton, New Zealand, 2002.

    Google Scholar 

  3. Y. Fujikawa and T. B. Ho. Scalable algorithms for dealing with missing values. 2001.

    Google Scholar 

  4. J. W. GrzymaDla-Busse and M. Hu. A comparison of several approaches to missing attribute values in data mining. In W. Ziarko and Y. Y. Yao, editors, Proceedings of 2nd International Conference on Rough Sets and Current Trends in Computing, RSCTC-2000, volume 2005 of LNAI, pages 180–187. Springer, 2000.

    Google Scholar 

  5. J. W. Grzymała-Busse and A. Y. Wang. Modified algorithms LEM1 and LEM2 for rule induction from data with missing attribute values. In Proceedings of 5th Workshop on Rough Sets and Soft Computing (RSSC’97) at the 3rd Joint Conference on Information Sciences, pages 69–72, Research Triangle Park (NC, USA), 1997.

    Google Scholar 

  6. J. Komorowski, Z. Pawlak, L. Polkowski, and A. Skowron. Rough sets: A tutorial. In S. K. Pal and A. Skowron, editors, Rough Fuzzy Hybridization. A New Trend in Decision Making, pages 3–98, Singapore, 1999. Springer.

    Google Scholar 

  7. M. Kryszkiewicz. Properties of incomplete information systems in the framework of rough sets. In L. Polkowski and A. Skowron, editors, Rough Sets in Knowledge Discovery 1: Methodology and Applications, pages 422–450. Physica-Verlag, 1998.

    Google Scholar 

  8. M. Meyer and P. Vlachos. StatLib — Data, Software and News from the Statistics Community. http://lib.stat.cmu.edu/, Carnegie Mellon University, Pittsburgh, PA, 1998.

    Google Scholar 

  9. S. H. Nguyen. Regularity Analysis and its Application in Data Mining. PhD thesis, Warsaw University, Faculty of Mathematics, Computer Science and Mechanics, 1999.

    Google Scholar 

  10. S. H. Nguyen, A. Skowron, and P. Synak. Discovery of data patterns with applications to decomposition and classification problems. In L. Polkowski and A. Skowron, editors, Rough Sets in Knowledge Discovery, volume 2, pages 55–97, Heidelberg, 1998. Physica-Verlag.

    Google Scholar 

  11. J. R. Quinlan. Unknown attribute values in induction. In A. M. Segre, editor, Proceedings of the Sixth International Machine Learning Workshop, pages 31–37. Morgan Kaufmann, 1989.

    Google Scholar 

  12. J. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufman, San Mateo, 1993.

    Google Scholar 

  13. A. Skowron and C. Rauszer. The discernibility matrices and functions in information systems. In R. SDlowiński, editor, Intelligent Decision Support. Handbook of Applications and Advances in Rough Sets Theory, pages 331–362, Dordrecht, 1992. Kluwer.

    Google Scholar 

  14. J. Stefanowski and A. Tsoukiás. Incomplete information tables and rough classification. International Journal of Computational Intelligence, 17(3):545–566, August 2001.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Latkowski, R. (2002). Incomplete Data Decomposition for Classification. In: Alpigini, J.J., Peters, J.F., Skowron, A., Zhong, N. (eds) Rough Sets and Current Trends in Computing. RSCTC 2002. Lecture Notes in Computer Science(), vol 2475. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45813-1_54

Download citation

  • DOI: https://doi.org/10.1007/3-540-45813-1_54

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44274-5

  • Online ISBN: 978-3-540-45813-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics