Searching for Dependencies in Bayesian Classifiers

Pazzani, Michael J.

doi:10.1007/978-1-4612-2404-4_23

Michael J. Pazzani³

Part of the book series: Lecture Notes in Statistics ((LNS,volume 112))

875 Accesses
72 Citations

Abstract

Naive Bayesian classifiers which make independence assumptions perform remarkably well on some data sets but poorly on others. We explore ways to improve the Bayesian classifier by searching for dependencies among attributes. We propose and evaluate two algorithms for detecting dependencies among attributes and show that the backward sequential elimination and joining algorithm provides the most improvement over the naive Bayesian classifier. The domains on which the most improvement occurs are those domains on which the naive Bayesian classifier is significantly less accurate than a decision tree learner. This suggests that the attributes used in some common databases are not independent conditioned on the class and that the violations of the independence assumption that affect the accuracy of the classifier can be detected from training data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Almuallim, H., and Dietterich, T. G. (1991). Learning with many irrelevant features. In Ninth National Conference on Artificial Intelligence, 547–552. MIT Press.
Google Scholar
Caruana, R., and Freitag, D. (1994). Greedy attribute selection. In Cohen, W., and Hirsh, H., eds., Machine Learning: Proceedings of the Eleventh International Conference. Morgan Kaufmann
Google Scholar
Cooper, G., and Herskovits, E. (1992). A Bayesian method for the induction of probabilistic networks from data. Machine Learning, 9, 309–347.
MATH Google Scholar
Danyluk, A., and Provost, F. (1993). Small disjuncts in action: Learning to diagnose errors in the telephone network local loop. Machine Learning Conference, pp 81–88.
Google Scholar
Duda, R., and Hart, P. (1973). Pattern classification and scene analysis. New York: John Wiley & Sons.
MATH Google Scholar
John, G. Kohavi, R., and Pfleger, K. (1994). Irrelevant Features and the subset selection problem Proceedings of the Eleventh International Conference on Machine Learning. New Brunswick, NJ.
Google Scholar
Kittler, J. (1986). Feature selection and extraction. In Young & Fu, (eds.), Handbook of pattern recognition and image processing. New York: Academic Press.
Google Scholar
Kononenko, I. (1990). Comparison of inductive and naive Bayesian learning approaches to automatic knowledge acquisition. In B. Wielinga (Eds..), Current trends in knowledge acquisition. Amsterdam: IOS Press.
Google Scholar
Kononenko, I. (1991). Semi-naive Bayesian classifier. Proceedings of the Sixth European Working Session on Learning. (pp. 206–219). Porto, Portugal: Pittman.
Google Scholar
Langley, P. (1993). Induction of recursive Bayesian classifiers. Proceedings of the 1993 European Conference on Machine Learning. (pp. 153–164). Vienna: Springer-Verlag.
Google Scholar
Langley, P., and Sage, S. (1994). Induction of selective Bayesian classifiers. Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence. Seattle, WA
Google Scholar
Moore, A. W., and Lee, M. S. (1994). Efficient algorithms for minimizing cross validation error. In Cohen, W. W., and Hirsh, H., eds., Machine Learning: Proceedings of the Eleventh International Conference. Morgan Kaufmann.
Google Scholar
Murphy, P. M., and Aha, D. W. (1995). UCI Repository of machine learning databases. Irvine: University of California, Department of Information & Computer Science. Machine-readable data repository ftp://ics.uci.edu:/pub/machine-learning-databases.
Google Scholar
Pazzani, M., Merz, C., Murphy, P., Ali, K., Hume, T., and Brunk, C. (1994). Reducing Misclassification Costs. Proceedings of the Eleventh International Conference on Machine Learning. New Brunswick, NJ.
Google Scholar
Pearl, J. (1988). Probabilistic reasoning in intelligent systems: Networks of plausible inference. San Mateo, CA: Morgan Kaufmann.
Google Scholar
Quinlan, J.R. (1986). Induction of decision trees. Machine Learning, 1, 81–106.
Google Scholar
Rachlin, Kasif, Salzberg, and Aha, (1994). Towards a better understanding of memory-based reasoning systems. Proceedings of the Eleventh International Conference on Machine Learning. New Brunswick, NJ.
Google Scholar
Ragavan, H., and Rendell, L. (1993). Lookahead feature construction for learning hard concepts. Machine Learning: Proceedings of the Tenth International Conference. Morgan Kaufmann
Google Scholar
Schlimmer, J. (1987). Incremental adjustment of representations for learning. Machine Learning: Proceedings of the Fourth International Workshop. Morgan Kaufmann
Google Scholar
Schaffer, C. (1994). A conservation law of generalization performance. Proceedings of the Eleventh International Conference on Machine Learning. New Brunswick, NJ.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information and Computer Science, University of California, Irvine, Irvine, CA, 92717, USA
Michael J. Pazzani

Authors

Michael J. Pazzani
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Vanderbilt University, Box 1679, Station B, Nashville, Tennessee, 37235, USA
Doug Fisher
Department of Economics Institute of Statistics and Econometrics, Free University of Berlin, 14185, Berlin, Garystre 21, Germany
Hans-J. Lenz

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Pazzani, M.J. (1996). Searching for Dependencies in Bayesian Classifiers. In: Fisher, D., Lenz, HJ. (eds) Learning from Data. Lecture Notes in Statistics, vol 112. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-2404-4_23

Download citation

DOI: https://doi.org/10.1007/978-1-4612-2404-4_23
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-94736-5
Online ISBN: 978-1-4612-2404-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics