Incomplete Data Decomposition for Classification

Latkowski, Rafal

doi:10.1007/3-540-45813-1_54

Rafal Latkowski⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2475))

Included in the following conference series:

International Conference on Rough Sets and Current Trends in Computing

854 Accesses
5 Citations

Abstract

In this paper we present a method of data decomposition to avoid the necessity of reasoning on data with missing attribute values. The original incomplete data is decomposed into data subsets without missing values. Next, methods for classifier induction are applied to such sets. Finally, a conflict resolving method is used to combine partial answers from classifiers to obtain final classification. We provide an empirical evaluation of the decomposition method with use of various decomposition criteria.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

M. Dash and H. Liu. Feature selection for classification. Intelligent Data Analysis, 1(3), 1997.
Google Scholar
E. Frank, L. Trigg, and M. Hall. Weka 3.3.2, Waikato Environment for Knowledge Analysis. http://www.cs.waikato.ac.nz/ml/weka, The University of Waikato, Hamilton, New Zealand, 2002.
Google Scholar
Y. Fujikawa and T. B. Ho. Scalable algorithms for dealing with missing values. 2001.
Google Scholar
J. W. GrzymaDla-Busse and M. Hu. A comparison of several approaches to missing attribute values in data mining. In W. Ziarko and Y. Y. Yao, editors, Proceedings of 2nd International Conference on Rough Sets and Current Trends in Computing, RSCTC-2000, volume 2005 of LNAI, pages 180–187. Springer, 2000.
Google Scholar
J. W. Grzymała-Busse and A. Y. Wang. Modified algorithms LEM1 and LEM2 for rule induction from data with missing attribute values. In Proceedings of 5th Workshop on Rough Sets and Soft Computing (RSSC’97) at the 3rd Joint Conference on Information Sciences, pages 69–72, Research Triangle Park (NC, USA), 1997.
Google Scholar
J. Komorowski, Z. Pawlak, L. Polkowski, and A. Skowron. Rough sets: A tutorial. In S. K. Pal and A. Skowron, editors, Rough Fuzzy Hybridization. A New Trend in Decision Making, pages 3–98, Singapore, 1999. Springer.
Google Scholar
M. Kryszkiewicz. Properties of incomplete information systems in the framework of rough sets. In L. Polkowski and A. Skowron, editors, Rough Sets in Knowledge Discovery 1: Methodology and Applications, pages 422–450. Physica-Verlag, 1998.
Google Scholar
M. Meyer and P. Vlachos. StatLib — Data, Software and News from the Statistics Community. http://lib.stat.cmu.edu/, Carnegie Mellon University, Pittsburgh, PA, 1998.
Google Scholar
S. H. Nguyen. Regularity Analysis and its Application in Data Mining. PhD thesis, Warsaw University, Faculty of Mathematics, Computer Science and Mechanics, 1999.
Google Scholar
S. H. Nguyen, A. Skowron, and P. Synak. Discovery of data patterns with applications to decomposition and classification problems. In L. Polkowski and A. Skowron, editors, Rough Sets in Knowledge Discovery, volume 2, pages 55–97, Heidelberg, 1998. Physica-Verlag.
Google Scholar
J. R. Quinlan. Unknown attribute values in induction. In A. M. Segre, editor, Proceedings of the Sixth International Machine Learning Workshop, pages 31–37. Morgan Kaufmann, 1989.
Google Scholar
J. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufman, San Mateo, 1993.
Google Scholar
A. Skowron and C. Rauszer. The discernibility matrices and functions in information systems. In R. SDlowiński, editor, Intelligent Decision Support. Handbook of Applications and Advances in Rough Sets Theory, pages 331–362, Dordrecht, 1992. Kluwer.
Google Scholar
J. Stefanowski and A. Tsoukiás. Incomplete information tables and rough classification. International Journal of Computational Intelligence, 17(3):545–566, August 2001.
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science, Warsaw University, ul. Banacha 2, 02-097, Warsaw, Poland
Rafal Latkowski

Authors

Rafal Latkowski
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Penn State Great Valley School of Professional Graduate Studies, 30 East Swedesford Road, Malvern, PA, 19355, USA
James J. Alpigini
Department of Electrical and Computer Engineering, University of Manitoba, Winnipeg, MB, R3T 5V6, Canada
James F. Peters
Institute of Mathematics, Warsaw University, Banacha 2, 02-097, Warsaw, Poland
Andrzej Skowron
Department of Systems and Information Engineering, Maebashi Institute of Technology, 460-1 Kamisadori-Cho, Maebashi-City, 371-0816, Japan
Ning Zhong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Latkowski, R. (2002). Incomplete Data Decomposition for Classification. In: Alpigini, J.J., Peters, J.F., Skowron, A., Zhong, N. (eds) Rough Sets and Current Trends in Computing. RSCTC 2002. Lecture Notes in Computer Science(), vol 2475. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45813-1_54

Download citation

DOI: https://doi.org/10.1007/3-540-45813-1_54
Published: 20 September 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44274-5
Online ISBN: 978-3-540-45813-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics