Improvement of Decision Accuracy Using Discretization of Continuous Attributes

Wu, QingXiang; Bell, David; McGinnity, Martin; Prasad, Girijesh; Qi, Guilin; Huang, Xi

doi:10.1007/11881599_81

QingXiang Wu^23,24,25,
David Bell²⁴,
Martin McGinnity²⁵,
Girijesh Prasad²⁵,
Guilin Qi²⁴ &
…
Xi Huang²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4223))

Included in the following conference series:

International Conference on Fuzzy Systems and Knowledge Discovery

1247 Accesses
7 Citations

Abstract

The naïve Bayes classifier has been widely applied to decision-making or classification. Because the naïve Bayes classifier prefers to dealing with discrete values, an novel discretization approach is proposed to improve naïve Bayes classifier and enhance decision accuracy in this paper. Based on the statistical information of the naïve Bayes classifier, a distributional index is defined in the new discretization approach. The distributional index can be applied to find a good solution for discretization of continuous attributes so that the naïve Bayes classifier can reach high decision accuracy for instance information systems with continuous attributes. The experimental results on benchmark data sets show that the naïve Bayes classifier with the new discretizer can reach higher accuracy than the C5.0 tree.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Comparison of Two Approaches to Discretization: Multiple Scanning and C4.5

An Experimental Study on Decision Tree Classifier Using Discrete and Continuous Data

Attribute Selection Based on Reduction of Numerical Attributes During Discretization

References

Mitchell, T.: Machine Learning. McGraw Hill, Co-published by the MIT Press Companies, Inc. (1997)
Google Scholar
Rish, I.: An empirical study of the naive Bayes classifier. In: IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence (2001)
Google Scholar
Wu, Q.X., Bell, D.A., McGinnity, T.M.: Multi-knowledge for Decision Making. International Journal of Knowledge and Information Systems 2, 246–266 (2005)
Article Google Scholar
Wu, X.: A Bayesian Discretizer for Real-Valued Attributes. The Computer J. 39(8), 688–691 (1996)
Article Google Scholar
Kurgan, L.A., Cios, K.J.: CAIM Discretization Algorithm. IEEE Transactions on Knowledge and Data Engineering 16(2), 145–153 (2004)
Article Google Scholar
Dougherty, J., Kohavi, R., Sahami, M.: Supervised and Unsupervised Discretization of Continuous Features. In: Proc. of International Conference on Machine Learning, pp. 194–202 (1995)
Google Scholar
Wu, Q.X., Bell, D.A.: Multi-Knowledge Extraction and Application. In: Wang, G., Liu, Q., Yao, Y., Skowron, A. (eds.) RSFDGrC 2003. LNCS (LNAI), vol. 2639, pp. 274–279. Springer, Heidelberg (2003)
Chapter Google Scholar
Quinlan, J.R.: Induction of Decision Trees. Machine Learning 1(1), 81–106 (1986)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Physics and OptoElectronic Technology, Fujian Normal University, Fujian, Fuzhou, China
QingXiang Wu & Xi Huang
School of Computer Science, Queen’s University, Belfast, BT7 1NN, UK
QingXiang Wu, David Bell & Guilin Qi
School of Computing and Intelligent Systems, University of Ulster at Magee, Londonderry, BT48 7JL, N.Ireland, UK
QingXiang Wu, Martin McGinnity & Girijesh Prasad

Authors

QingXiang Wu
View author publications
You can also search for this author in PubMed Google Scholar
David Bell
View author publications
You can also search for this author in PubMed Google Scholar
Martin McGinnity
View author publications
You can also search for this author in PubMed Google Scholar
Girijesh Prasad
View author publications
You can also search for this author in PubMed Google Scholar
Guilin Qi
View author publications
You can also search for this author in PubMed Google Scholar
Xi Huang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Electrical and Electronic Engineering, Nanyang Technological University,, Block S1, Nanyang Avenue, 639798, Singapore
Lipo Wang
Life Science Research Center, School of Electronic Engineering, Xidian University,, 710071, Xi’an, Shaanxi, China
Licheng Jiao
School of Electrical and Electronic Engineering, Xidian University, 710071, Xi’an, China
Guanming Shi
School of Information Technology and Electrical Engineering, The University of Queensland, 4072, Brisbane, Queensland, Australia
Xue Li
College of Mathematics and Information Science, Hebei Normal University, 050016, Shijiazhuang, Hebei, P.R. China
Jing Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, Q., Bell, D., McGinnity, M., Prasad, G., Qi, G., Huang, X. (2006). Improvement of Decision Accuracy Using Discretization of Continuous Attributes. In: Wang, L., Jiao, L., Shi, G., Li, X., Liu, J. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2006. Lecture Notes in Computer Science(), vol 4223. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11881599_81

Download citation

DOI: https://doi.org/10.1007/11881599_81
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45916-3
Online ISBN: 978-3-540-45917-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Improvement of Decision Accuracy Using Discretization of Continuous Attributes

Abstract

Access this chapter

Preview

Similar content being viewed by others

A Comparison of Two Approaches to Discretization: Multiple Scanning and C4.5

An Experimental Study on Decision Tree Classifier Using Discrete and Continuous Data

Attribute Selection Based on Reduction of Numerical Attributes During Discretization

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Improvement of Decision Accuracy Using Discretization of Continuous Attributes

Abstract

Access this chapter

Preview

Similar content being viewed by others

A Comparison of Two Approaches to Discretization: Multiple Scanning and C4.5

An Experimental Study on Decision Tree Classifier Using Discrete and Continuous Data

Attribute Selection Based on Reduction of Numerical Attributes During Discretization

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation