Abstract
Rough set based rule induction methods have been applied to knowledge discovery in databases, whose empirical results obtained show that they are very powerful and that some important knowledge has been extracted from datasets. For rule induction, lower/upper approximations and reducts play important roles and the approximations can be extended to variable precision model, using accuracy and coverage. However, the formal characteristics of accuracy and coverage for rule induction have never been discussed. In this paper, several following characteristics of accuracy and coverage are discussed: (1) accuracy and coverage measure the degree of sufficiency an necessity, respectively. Also, they measure that of lower approximation and upper approximation. (2) Coverage can be viewed as likelihood. (3) These two measures are related with statistical independence. (4) These two indices have trade-off relations. (5) When we focus on the conjunction of attribute-value pairs, coverage decreases more than accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Darst, R. B. Introduction to Linear Programming, Marcel and Dekker, 1990.
Polkowski, L. and Skowron, A. (Eds.) Rough Sets and Knowledge Discovery 1, Physica Verlag, Heidelberg, 1998.
Polkowski, L. and Skowron, A. (Eds.) Rough Sets and Knowledge Discovery 2, Physica Verlag, Heidelberg, 1998.
Pawlak, Z., Rough Sets. Kluwer Academic Publishers, Dordrecht, 1991.
Rissanen J: Stochastic Complexity in Statistical Inquiry. World Scientific, Singapore, 1989.
Skowron, A. and Grzymala-Busse, J. From rough set theory to evidence theory. In: Yager, R., Fedrizzi, M. and Kacprzyk, J.(eds.) Advances in the Dempster-Shafer Theory of Evidence, pp. 193–236, John Wiley & Sons, New York, 1994.
Tsumoto, S. Extraction of Experts’ Decision Rules from Clinical Databases using Rough Set Model Journal of Intelligent Data Analysis, 2(3), 1998.
Tsumoto, S. Knowledge discovery in clinical databases and evaluation of discovered knowledge in outpatient clinic. Information Sciences, 124, 125–137, 2000.
Tsumoto, S. Automated Discovery of Positive and Negative Knowledge in Clinical Databases based on Rough Set Model., IEEE EMB Magazine, 56–62, 2000.
Tsumoto, S. Statistical Extension of Rough Set Rule Induction Proceedings of SPIE: Data Mining and Knowledge Discovery: Theory, Tools, and Technology III, 2001.
Yao, Y.Y. and Wong, S.K.M., A decision theoretic framework for approximating concepts, International Journal of Man-machine Studies, 37, 793–809, 1992.
Yao, Y.Y. and Zhong, N., An analysis of quantitative measures associated with rules, N. Zhong and L. Zhou (Eds.), Methodologies for Knowledge Discovery and Data Mining, Proceedings of the Third Pacific-Asia Conference on Knowledge Discovery and Data Mining, LNAI 1574, Springer, Berlin, pp. 479–488, 1999.
Ziarko, W., Variable Precision Rough Set Model. Journal of Computer and System Sciences, 46, 39–59, 1993.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tsumoto, S. (2002). Accuracy and Coverage in Rough Set Rule Induction. In: Alpigini, J.J., Peters, J.F., Skowron, A., Zhong, N. (eds) Rough Sets and Current Trends in Computing. RSCTC 2002. Lecture Notes in Computer Science(), vol 2475. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45813-1_49
Download citation
DOI: https://doi.org/10.1007/3-540-45813-1_49
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44274-5
Online ISBN: 978-3-540-45813-5
eBook Packages: Springer Book Archive