Classification Based on Attribute Dependency

Lee, Yue-Shi; Yen, Show-Jane

doi:10.1007/978-3-540-30076-2_26

Classification Based on Attribute Dependency

Yue-Shi Lee¹⁹ &
Show-Jane Yen¹⁹

Conference paper

453 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3181))

Abstract

The decision tree learning algorithms, e.g., C5, are good at dataset classification. But those algorithms usually work with only one attribute at a time. The dependencies among attributes are not considered in those algorithms. Thus, it is very important to construct a model to discover the dependencies among attributes and to improve the accuracy of the decision tree learning algorithms. Association mining is a good choice for us to concern with the problems of attribute dependencies. Generally, these dependencies are classified into three types: categorical-type, numerical-type, and categorical- numerical-mixed dependencies. This paper proposes a CAM (Classification based on Association Mining) model to deal with such kind of dependency. The CAM model combines the association mining technologies and the traditional decision-tree learning capabilities to handle the complicated and real cases. According to the experiments on fifteen datasets from the UCI database repository, the CAM model can significantly improve both the accuracy and the rule size of C5. At the same time, the CAM model also outperforms the existing association-based classification models, i.e., ADT (Association-based Decision Tree) and CBA (Classification Based on Association).

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Chen, M.S., Han, J., Yu, P.S.: Data Mining: An Overview from a Database Perspective. IEEE Transaction on Knowledge and Data Engineering 8(6), 866–882 (1996)
Article Google Scholar
Quinlan, J.R.: Improved Use of Continuous Attributes in C4. 5. Journal of Artificial Intelligence Approach 4, 77–90 (1996)
MATH Google Scholar
Lee, Y.S., Yen, S.J.: Neural-Based Approaches for Improving the Accuracy of Decision Trees. In: Proceedings of International Conference on Data Warehousing and Knowledge Discovery, pp. 114–123 (2002)
Google Scholar
Chen, M.S.: On the Evaluation of Using Multiple Attributes for Mining Classification Rules. In: Proceedings of IEEE International Conference on Tools with Artificial Intelligence, pp. 130–137 (1998)
Google Scholar
Liu, B., Hsu, W., Ma, Y.: Integrating Classification and Association Rule Mining. In: Proceedings of International Conference on Knowledge Discovery and Data Mining, pp. 80–86 (1998)
Google Scholar
Wang, K., Zhou, S.Q., He, Y.: Growing Decision Trees on Support-Less Association Rules. In: Proceedings of International Conference on Knowledge Discovery and Data Mining, pp. 265–269 (2000)
Google Scholar
Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules between Sets of Items in Large Databases. In: Proceedings of ACM SIGMOD International Conference on Management of Data, pp. 207–216 (1993)
Google Scholar
Park, J.S., Chen, M.S., Yu, P.S.: Using a Hash-Based Method with Transaction Trimming for Mining Association Rules. IEEE Transactions on Knowledge and Data Engineering 9(5), 813–825 (1997)
Article Google Scholar
Pei, J., Han, J., Lu, H., Nishio, S., Tang, S., Yang, D.: H-Mine: Hyper-Structure Mining of Frequent Patterns in Large Databases. In: Proceedings of IEEE International Conference on Data Mining, pp. 441–448 (2001)
Google Scholar
Han, J., Pei, J., Yin, Y., Mao, R.: Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach. Data Mining and Knowledge Discovery 1, 53–87 (2004)
Article MathSciNet Google Scholar
Kamber, M., Han, J., Chiang, J.Y.: Metarule-Guided Mining of Multidimensional Association Rules Using Data Cubes. In: Proceedings of International Conference on Knowledge Discovery and Data Mining, pp. 207–210 (1997)
Google Scholar
Merz, C.J., Murphy, P.: UCI repository of machine learning databases (1996), http://www.cs.uci.edu/mlearn/MLRepository.html

Download references

Author information

Authors and Affiliations

Department of Computer Science and Information Engineering, Ming Chuan University, 5 The-Ming Rd., Gwei Shan District, Taoyuan County, 333, Taiwan, R.O.C.
Yue-Shi Lee & Show-Jane Yen

Authors

Yue-Shi Lee
View author publications
You can also search for this author in PubMed Google Scholar
Show-Jane Yen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Graduate School of Informatics, Kyoto University, Yoshida-Honmachi, 606-8501, Sakyo, Kyoto, Japan
Yahiko Kambayashi
I.B.M. India Research Lab,, India
Mukesh Mohania
Institute for Application Oriented Knowledge Processing (FAW), Johannes Kepler University Linz, Austria
Wolfram Wöß

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lee, YS., Yen, SJ. (2004). Classification Based on Attribute Dependency. In: Kambayashi, Y., Mohania, M., Wöß, W. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2004. Lecture Notes in Computer Science, vol 3181. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30076-2_26

Download citation

DOI: https://doi.org/10.1007/978-3-540-30076-2_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22937-7
Online ISBN: 978-3-540-30076-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics