Incremental Discretization for Naïve-Bayes Classifier

Lu, Jingli; Yang, Ying; Webb, Geoffrey I.

doi:10.1007/11811305_25

Jingli Lu²²,
Ying Yang²² &
Geoffrey I. Webb²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4093))

Included in the following conference series:

International Conference on Advanced Data Mining and Applications

3048 Accesses
10 Citations

Abstract

Naïve-Bayes classifiers (NB) support incremental learning. However, the lack of effective incremental discretization methods has been hindering NB’s incremental learning in face of quantitative data. This problem is further compounded by the fact that quantitative data are everywhere, from temperature readings to share prices. In this paper, we present a novel incremental discretization method for NB, incremental flexible frequency discretization (IFFD). IFFD discretizes values of a quantitative attribute into a sequence of intervals of flexible sizes. It allows online insertion and splitting operation on intervals. Theoretical analysis and experimental test are conducted to compare IFFD with alternative methods. Empirical evidence suggests that IFFD is efficient and effective. NB coupled with IFFD achieves a rapport between high learning efficiency and high classification accuracy in the context of incremental learning.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Incremental Update of Locally Optimal Classification Rules

Regularized and incremental decision trees for data streams

Article 02 July 2020

Uncertainty Measure-Based Incremental Feature Selection For Hierarchical Classification

Article 18 May 2024

References

Blake, C.L., Merz, C.J.: UCI repository of machine learning databases (1998), http://www.ics.uci.edu/mlearn/mlrepository.html
Clark, P., Niblett, T.: The CN2 induction algorithm. Machine Learning 3(4), 261–283 (1989)
Google Scholar
Cestnik, B.: Estimating probabilities: A crucial task in machine learning. In: Proceedings of the 9th European Conference on Artificial Intelligence, p. 3, 23, pp. 147–149 (1990)
Google Scholar
Duda, R.O., Hart, P.E.: Pattern classification and scene analysis. John Wiley and Sons, New York (1973)
MATH Google Scholar
Gama, J., Castillo, G.: Adaptive Bayes. In: Proceedings of the 8th Ibero-American Conference on AI: Advances in Artificial Intelligence, pp. 765–774 (2002)
Google Scholar
Gama, J., Pinto, C.: Discretization from Data Streams: Applications to Histograms and Data Mining. In: Second International Workshop on Knowledge Discovery from data Streams (2005)
Google Scholar
John, G.H., Langley, P.: Estimating continuous distributions in Bayesian classifiers. In: Proceedings of the 11th Conference on Uncertainty in Artificial Intelligence, pp. 338–345 (1995)
Google Scholar
Langley, P., Iba, W., Thompson, K.: An analysis of Bayesian classifiers. In: Proceedings of the Tenth National Conference on Artificial Intelligence, pp. 223–228. AAAI Press, San Jose, CA (1992)
Google Scholar
Yang, Y., Webb, G.I.: Proportional kinterval discretization for naive-Bayes classifiers. In: Flach, P.A., De Raedt, L. (eds.) ECML 2001. LNCS (LNAI), vol. 2167, pp. 564–575. Springer, Heidelberg (2001)
Google Scholar
Yang, Y., Webb, G.I.: Discretization For Naive-Bayes Learning: Managing Discretization Bias And Variance. Technical Report 2003/131, School of Computer Science and Software Engineering, Monash University (2003)
Google Scholar
Yang, Y.: Discretization for Naïve-Bayes Learning. PhD thesis, school of Computer Science and Software Engineering of Monash University
Google Scholar
Yang, Y., Webb, G.: On why discretization works for naïve-Bayes classifiers. In: Proceedings of the 16th Australian Joint Conference on Artificial Intelligence (AI) (2003)
Google Scholar
Weiss, N.A.: Introductory Statistics, 6th edn., p. 98. Greg Tobin (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Clayton School of Information Technology, Monash University, VIC, 3800, Australia
Jingli Lu, Ying Yang & Geoffrey I. Webb

Authors

Jingli Lu
View author publications
You can also search for this author in PubMed Google Scholar
Ying Yang
View author publications
You can also search for this author in PubMed Google Scholar
Geoffrey I. Webb
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology and Electronic Engineering, The University of Queensland, Queensland, Australia
Xue Li
University of Alberta, Canada
Osmar R. Zaïane
Northwest Polytechnical University, China
Zhanhuai Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lu, J., Yang, Y., Webb, G.I. (2006). Incremental Discretization for Naïve-Bayes Classifier. In: Li, X., Zaïane, O.R., Li, Z. (eds) Advanced Data Mining and Applications. ADMA 2006. Lecture Notes in Computer Science(), vol 4093. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811305_25

Download citation

DOI: https://doi.org/10.1007/11811305_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37025-3
Online ISBN: 978-3-540-37026-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics