Abstract
This paper applies rough set theory to find decision rules using ORACLE RDBMS. The major steps include elimination of redundant attributes and of redundant attribute values for each tuple. In this paper three algorithms to extract high frequency decision rules from very large decision tables are presented. One algorithm uses pure SQL syntax. The other two use sorting algorithm and binary tree data structure respectively. The performances among these methods are evaluated and compared. The use of binary tree structure improves the computational time tremendously. So pure SQL results indicate some major change to query optimizer may be desirable if it will be used for data mnining.
Preview
Unable to display preview. Download preview PDF.
References
Z. Pawlak. Rough Sets: Theoretical Aspects of Reasoning about Data, Kluwer Academic, Dordrecht (1991)
T. Y. Lin, “Rough Set Theory in Very Large Databases,” Symposium on Modeling, Analysis and Simulation, IMACS Multi Conference (Computational Engineering in Systems Applications), Lille, France, July 9-12, 1996, Vol. 2 of 2, 936–941.
T. Y. Lin. “An Overview of Rough Set Theory from the Point of View of Relational Databases,” Bulletin of International Rough Set Society, vol. 1, no. 1, March 1997.
R. Agrawal, R. Srikant. “Fast Algorithms for Mining Association Rules,” in Proceeding of 20th VLDB Conference San Tiago, Chile, 1994.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Young (“T.Y.”) Lin, T., Cao, H. (2001). Searching Decision Rules in Very Large Databases Using Rough Set Theory. In: Ziarko, W., Yao, Y. (eds) Rough Sets and Current Trends in Computing. RSCTC 2000. Lecture Notes in Computer Science(), vol 2005. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45554-X_42
Download citation
DOI: https://doi.org/10.1007/3-540-45554-X_42
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43074-2
Online ISBN: 978-3-540-45554-7
eBook Packages: Springer Book Archive