Abstract
In this paper we investigate the scalability features of rough set based methods in the context of their applicability in knowledge discovery from databases (KDD) and data mining. We summarize some previously known scalable methods and present one of the latest scalable rough set classifiers. The proposed solution is based on the relationship between rough sets and association discovering methods, which has been described in our previous papers [10] [11]. In this paper, the set of decision rules satisfying the test object is generated directly from the training data set. To make it scalable, we adopted the idea of the FP-growth algorithm for frequent item-sets [7], [6]. The proposed method can be applied in construction of incremental rule-based classification system for stream data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast discovery of association rules, pp. 307–328. American Association for Artificial Intelligence, Menlo Park (1996)
Bazan, J.G.: A comparison and non-dynamic rough set method for extracting laws decision tables. In: Polkowski, L., Skowron, A. (eds.) Rough Sets in Knowledge Discovery 1. Methodology and Applications, Studies in Fuzziness and Soft Computing, pp. 321–365. Physica-Verlag, Heidelberg (1998)
Bondi, A.B.: Characteristics of scalability and their impact on performance. In: WOSP 2000: Proceedings of the 2nd international workshop on Software and performance, pp. 195–203. ACM, New York (2000)
Fayyad, U.M., Haussler, D., Stolorz, P.E.: Mining scientific data. Commun. ACM 39(11), 51–57 (1996)
Grahne, G., Zhu, J.: High performance mining of maximal frequent itemsets. In: Proceedings of 6th International Workshop on High Performance Data Mining, HPDM 2003 (2003)
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. The Morgan Kaufmann Series in Data Management Systems. Morgan Kaufmann, San Francisco (2000)
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Chen, W., Naughton, J., Bernstein, P.A. (eds.) 2000 ACM SIGMOD Intl. Conference on Management of Data, pp. 1–12. ACM Press, New York (2000)
Komorowski, H.J., Pawlak, Z., Polkowski, L.T., Skowron, A.: Rough Sets: A Tutorial, pp. 3–98. Springer, Singapore (1999)
Kwiatkowski, P.: Scalable classification method based on fp-growth algorithm (in polish). Master’s thesis, Warsaw University (2008)
Nguyen, H.S.: Scalable classification method based on rough sets. In: Alpigini, J.J., Peters, J.F., Skowron, A., Zhong, N. (eds.) RSCTC 2002. LNCS (LNAI), vol. 2475, pp. 433–440. Springer, Heidelberg (2002)
Nguyen, H.S.: Approximate boolean reasoning: Foundations and applications in data mining. In: Peters, J.F., Skowron, A. (eds.) Transactions on Rough Sets V. LNCS, vol. 4100, pp. 334–506. Springer, Heidelberg (2006)
Pawlak, Z.: Rough Sets. Theoretical Aspects of Reasoning about Data. Theory and decision library. D: System theory, knowledge engineering and problem solving, vol. 9. Kluwer Academic Publishers, Dordrecht (1991)
Shafer, J.C., Agrawal, R., Mehta, M.: Sprint: A scalable parallel classifier for data mining. In: Vijayaraman, T.M., Buchmann, A.P., Mohan, C., Sarda, N.L. (eds.) VLDB 1996, Proceedings of 22th International Conference on Very Large Data Bases, Mumbai (Bombay), India, September 3-6, pp. 544–555. Morgan Kaufmann, San Francisco (1996)
Skowron, A., Rauszer, C.M.: The discernibility matrices and functions in information systems, ch. 3, pp. 331–362. Kluwer Academic Publishers, Dordrecht (1992)
Stefanowski, J.: On rough set based approaches to induction of decision rules. In: Polkowski, L., Skowron, A. (eds.) Rough Sets in Knowledge Discovery 1. Methodology and Applications, Studies in Fuzziness and Soft Computing, pp. 500–529. Physica-Verlag, Heidelberg (1998)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
Wroblewski, J.: Covering with reducts - a fast algorithm for rule generation. In: Polkowski, L., Skowron, A. (eds.) RSCTC 1998. LNCS (LNAI), vol. 1424, pp. 402–407. Springer, Heidelberg (1998)
Ziarko, W.: Rough sets as a methodology for data mining. In: Polkowski, L., Skowron, A. (eds.) Rough Sets in Knowledge Discovery 1. Methodology and Applications, Studies in Fuzziness and Soft Computing, pp. 554–571. Physica-Verlag, Heidelberg (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Nguyen, S.H., Nguyen, H.S. (2010). Scalable Methods in Rough Sets. In: Huynh, VN., Nakamori, Y., Lawry, J., Inuiguchi, M. (eds) Integrated Uncertainty Management and Applications. Advances in Intelligent and Soft Computing, vol 68. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11960-6_39
Download citation
DOI: https://doi.org/10.1007/978-3-642-11960-6_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11959-0
Online ISBN: 978-3-642-11960-6
eBook Packages: EngineeringEngineering (R0)