Abstract
In this paper we present a method of data decomposition to avoid the necessity of reasoning on data with missing attribute values. The original incomplete data is decomposed into data subsets without missing values. Next, methods for classifier induction are applied to such sets. Finally, a conflict resolving method is used to combine partial answers from classifiers to obtain final classification. We provide an empirical evaluation of the decomposition method with use of various decomposition criteria.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
M. Dash and H. Liu. Feature selection for classification. Intelligent Data Analysis, 1(3), 1997.
E. Frank, L. Trigg, and M. Hall. Weka 3.3.2, Waikato Environment for Knowledge Analysis. http://www.cs.waikato.ac.nz/ml/weka, The University of Waikato, Hamilton, New Zealand, 2002.
Y. Fujikawa and T. B. Ho. Scalable algorithms for dealing with missing values. 2001.
J. W. GrzymaDla-Busse and M. Hu. A comparison of several approaches to missing attribute values in data mining. In W. Ziarko and Y. Y. Yao, editors, Proceedings of 2nd International Conference on Rough Sets and Current Trends in Computing, RSCTC-2000, volume 2005 of LNAI, pages 180–187. Springer, 2000.
J. W. Grzymała-Busse and A. Y. Wang. Modified algorithms LEM1 and LEM2 for rule induction from data with missing attribute values. In Proceedings of 5th Workshop on Rough Sets and Soft Computing (RSSC’97) at the 3rd Joint Conference on Information Sciences, pages 69–72, Research Triangle Park (NC, USA), 1997.
J. Komorowski, Z. Pawlak, L. Polkowski, and A. Skowron. Rough sets: A tutorial. In S. K. Pal and A. Skowron, editors, Rough Fuzzy Hybridization. A New Trend in Decision Making, pages 3–98, Singapore, 1999. Springer.
M. Kryszkiewicz. Properties of incomplete information systems in the framework of rough sets. In L. Polkowski and A. Skowron, editors, Rough Sets in Knowledge Discovery 1: Methodology and Applications, pages 422–450. Physica-Verlag, 1998.
M. Meyer and P. Vlachos. StatLib — Data, Software and News from the Statistics Community. http://lib.stat.cmu.edu/, Carnegie Mellon University, Pittsburgh, PA, 1998.
S. H. Nguyen. Regularity Analysis and its Application in Data Mining. PhD thesis, Warsaw University, Faculty of Mathematics, Computer Science and Mechanics, 1999.
S. H. Nguyen, A. Skowron, and P. Synak. Discovery of data patterns with applications to decomposition and classification problems. In L. Polkowski and A. Skowron, editors, Rough Sets in Knowledge Discovery, volume 2, pages 55–97, Heidelberg, 1998. Physica-Verlag.
J. R. Quinlan. Unknown attribute values in induction. In A. M. Segre, editor, Proceedings of the Sixth International Machine Learning Workshop, pages 31–37. Morgan Kaufmann, 1989.
J. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufman, San Mateo, 1993.
A. Skowron and C. Rauszer. The discernibility matrices and functions in information systems. In R. SDlowiński, editor, Intelligent Decision Support. Handbook of Applications and Advances in Rough Sets Theory, pages 331–362, Dordrecht, 1992. Kluwer.
J. Stefanowski and A. Tsoukiás. Incomplete information tables and rough classification. International Journal of Computational Intelligence, 17(3):545–566, August 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Latkowski, R. (2002). Incomplete Data Decomposition for Classification. In: Alpigini, J.J., Peters, J.F., Skowron, A., Zhong, N. (eds) Rough Sets and Current Trends in Computing. RSCTC 2002. Lecture Notes in Computer Science(), vol 2475. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45813-1_54
Download citation
DOI: https://doi.org/10.1007/3-540-45813-1_54
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44274-5
Online ISBN: 978-3-540-45813-5
eBook Packages: Springer Book Archive