Skip to main content

Naïve Bayes vs. Support Vector Machine: Resilience to Missing Data

  • Conference paper
Artificial Intelligence and Computational Intelligence (AICI 2011)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7003))

Abstract

The naïve Bayes and support vector machine are the typical generative and discriminative classification models respectively, which are two popular classification approaches. Few studies have been done comparing their resilience to missing data. This paper provides an experimental comparison of the naïve Bayes and support vector machine regarding the resilience to missing data on 24 UCI data sets. The experimental results show that when the missing rate is very small (e.g. 1%), the resilience of the naïve Bayes classifiers to missing data are approximately similar to that of support vector machine classifiers. With the increase of the missing rate, however, the resilience of the naïve Bayes classifiers to missing data are slowly decreased and that of support vector machine classifiers to missing data are rapidly decreased. This demonstrates that the naïve Bayes classifiers have better resilience to missing data than support vector machine classifiers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. García-Laencina, P.J., Sancho-Gómez, J.L., Figueiras-Vidal, A.R.: Pattern classification with missing data: a review. Neural Computation & Applications 9, 1–12 (2010)

    Google Scholar 

  2. Little, R.J.A., Rubin, D.B.: Statistical Analysis with Missing Data, 2nd edn. John Wiley & Sons, New York (2002)

    MATH  Google Scholar 

  3. Webb, G.I.: The problem of missing values in decision tree grafting. In: 10th Australian Joint Conference on Artificial Intelligence, pp. 273–283. Springer, London (1998)

    Google Scholar 

  4. Ichihashi, H., Honda, K.: Fuzzy c-means classifier for incomplete data sets with outliers and missing values. In: International Conference on Computational Intelligence for Modeling, Control and Automation, pp. 457–464. IEEE Computer Society, Washington, DC (2005)

    Google Scholar 

  5. Ramoni, M., Sebastiani, P.: Robust Bayes classifier. Artificial Intelligence 125, 209–226 (2001)

    Article  MathSciNet  MATH  Google Scholar 

  6. Pelckmans, K., Brabanter, J.D., Suykens, J.A.K., Moor, B.D.: Handling missing values in support vector machine classifiers. Neural Network 18, 684–692 (2005)

    Article  MATH  Google Scholar 

  7. Kalousis, A., Hilario, M.: Supervised knowledge discovery from incomplete data. In: 2nd International Conference on Data Mining. WIT Press, Cambridge (2000)

    Google Scholar 

  8. Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Machine Learning 29, 131–163 (1997)

    Article  MATH  Google Scholar 

  9. Vapnik, V.: Statistical learning theory. John Wiley & Sons, New York (1998)

    MATH  Google Scholar 

  10. Schafer, J.L.: Analysis of incomplete multivariate data. Chapman & Hall, Florida (1997)

    Book  MATH  Google Scholar 

  11. Frank, A., Asuncion, A.: UCI Machine Learning Repository. University of California, School of Information and Computer Science, Irvine, CA (2010), http://archive.ics.uci.edu/ml

    Google Scholar 

  12. Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann Publishers, Seattle (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Shi, H., Liu, Y. (2011). Naïve Bayes vs. Support Vector Machine: Resilience to Missing Data. In: Deng, H., Miao, D., Lei, J., Wang, F.L. (eds) Artificial Intelligence and Computational Intelligence. AICI 2011. Lecture Notes in Computer Science(), vol 7003. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23887-1_86

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23887-1_86

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23886-4

  • Online ISBN: 978-3-642-23887-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics