Abstract
Many present methods for dealing with the continuous data and missing values in information systems for constructing decision tree do not perform well in practical applications. In this paper, a new algorithm, Decision Tree Construction based on the Cloud Transform and Rough Set Theory under Characteristic Relation (DTCCRSCR), is proposed for mining classification knowledge from the data set. The cloud transform is applied to discretize continuous data and the attribute whose weighted mean roughness under the characteristic relation is the smallest will be selected as the current splitting node. Experimental results show the decision trees constructed by DTCCRSCR tend to have a simpler structure, much higher classification accuracy and more understandable rules than C5.0 in most cases.
This work is partially supported by NSFC (No.60074014), the Research Fund for the Doctoral Program of Higher Education (No.20060613007) and the Basic Science Foundation of Southwest Jiaotong University (No.2007B13).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Dunham, M.H.: Data Mining – Introductory and Advanced Topics. Prentice Hall, Englewood Cliffs (2002)
Quinlan, J.R.: Induction of decision tree. Machine Learning 11, 80–106 (1986)
Quinlan, J.R.: C4.5: Programs for machine learning. Morgan Kaufmann, San Francisco (1993)
Grzymala-Busse, J.W., Siddhaye, S.: Rough set approaches to rule induction from incomplete data. In: Proc. of the Int. Conf. on IPMU in KBS, pp. 923–930 (2004)
Li, T., Ruan, D., Geert, W., Song, J., Xu, Y.: A rough sets based characteristic relation approach for dynamic attribute generalization in data mining. Knowledge-Based Systems 20(5), 485–494 (2007)
Grzymala-Busse, J.W.: Characteristic relations for incomplete data: A generalization of the indiscernibility relation. Transactions on Rough Sets IV. 58–68 (2005)
Wei, J., Huang, D., Wang, S., Ma, Z.: Rough Set Based Decision Tree. In: Proc. of the 4th World Congress on Intel. Cont. and Auto, pp. 426–430 (2002)
Jiang, Y., Li, Z., Qi, Z., Liu, Y.: New method for constructing decision tree based on rough set theory. Computer Application 24(8), 21–23 (2004)
Peters, J.F., Suraj, Z., Shan, S., Ramanna, S., Pedrycz, W., Pizzi, N.: Classification of meteorological volumetric radar data using rough set methods. Pattern Recognition Letters 24, 911–920 (2003)
Li, D., Meng, H., Shi, X.: Cloud model and cloud model generator. Computer Research and Development 6, 15–20 (1995)
Fan, J., Li, D.: Mining classfictions knowledge based on cloud models. In: Zhong, N., Zhou, L. (eds.) PAKDD 1999. LNCS (LNAI), vol. 1574, pp. 26–28. Springer, Heidelberg (1999)
Song, J., Li, T., Wang, Y., Qi, J.: Decision Tree Construction based on rough set theory under Characteristic Relation. In: Li, T., Xu, Y., Ruan, D. (eds.) Proc. of the 2007 ISKE, pp. 788–792. Atlantis Press, Paris (2007)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Song, J., Li, T., Ruan, D. (2008). A New Decision Tree Construction Using the Cloud Transform and Rough Sets. In: Wang, G., Li, T., Grzymala-Busse, J.W., Miao, D., Skowron, A., Yao, Y. (eds) Rough Sets and Knowledge Technology. RSKT 2008. Lecture Notes in Computer Science(), vol 5009. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-79721-0_71
Download citation
DOI: https://doi.org/10.1007/978-3-540-79721-0_71
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-79720-3
Online ISBN: 978-3-540-79721-0
eBook Packages: Computer ScienceComputer Science (R0)