Naïve Bayes vs. Support Vector Machine: Resilience to Missing Data

Shi, Hongbo; Liu, Yaqin

doi:10.1007/978-3-642-23887-1_86

Hongbo Shi²³ &
Yaqin Liu²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7003))

Included in the following conference series:

International Conference on Artificial Intelligence and Computational Intelligence

2475 Accesses
8 Citations

Abstract

The naïve Bayes and support vector machine are the typical generative and discriminative classification models respectively, which are two popular classification approaches. Few studies have been done comparing their resilience to missing data. This paper provides an experimental comparison of the naïve Bayes and support vector machine regarding the resilience to missing data on 24 UCI data sets. The experimental results show that when the missing rate is very small (e.g. 1%), the resilience of the naïve Bayes classifiers to missing data are approximately similar to that of support vector machine classifiers. With the increase of the missing rate, however, the resilience of the naïve Bayes classifiers to missing data are slowly decreased and that of support vector machine classifiers to missing data are rapidly decreased. This demonstrates that the naïve Bayes classifiers have better resilience to missing data than support vector machine classifiers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

García-Laencina, P.J., Sancho-Gómez, J.L., Figueiras-Vidal, A.R.: Pattern classification with missing data: a review. Neural Computation & Applications 9, 1–12 (2010)
Google Scholar
Little, R.J.A., Rubin, D.B.: Statistical Analysis with Missing Data, 2nd edn. John Wiley & Sons, New York (2002)
MATH Google Scholar
Webb, G.I.: The problem of missing values in decision tree grafting. In: 10th Australian Joint Conference on Artificial Intelligence, pp. 273–283. Springer, London (1998)
Google Scholar
Ichihashi, H., Honda, K.: Fuzzy c-means classifier for incomplete data sets with outliers and missing values. In: International Conference on Computational Intelligence for Modeling, Control and Automation, pp. 457–464. IEEE Computer Society, Washington, DC (2005)
Google Scholar
Ramoni, M., Sebastiani, P.: Robust Bayes classifier. Artificial Intelligence 125, 209–226 (2001)
Article MathSciNet MATH Google Scholar
Pelckmans, K., Brabanter, J.D., Suykens, J.A.K., Moor, B.D.: Handling missing values in support vector machine classifiers. Neural Network 18, 684–692 (2005)
Article MATH Google Scholar
Kalousis, A., Hilario, M.: Supervised knowledge discovery from incomplete data. In: 2nd International Conference on Data Mining. WIT Press, Cambridge (2000)
Google Scholar
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Machine Learning 29, 131–163 (1997)
Article MATH Google Scholar
Vapnik, V.: Statistical learning theory. John Wiley & Sons, New York (1998)
MATH Google Scholar
Schafer, J.L.: Analysis of incomplete multivariate data. Chapman & Hall, Florida (1997)
Book MATH Google Scholar
Frank, A., Asuncion, A.: UCI Machine Learning Repository. University of California, School of Information and Computer Science, Irvine, CA (2010), http://archive.ics.uci.edu/ml
Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann Publishers, Seattle (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Management, Shanxi University of Finance and Economics, 030031, Taiyuan, China
Hongbo Shi & Yaqin Liu

Authors

Hongbo Shi
View author publications
You can also search for this author in PubMed Google Scholar
Yaqin Liu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Business Information Technology, RMIT University, City Campus, 124 La Trobe Street, 3000, Melbourne, VIC, Australia
Hepu Deng
School of Electronics and Information, Tongji University, 201804, Shanghai, China
Duoqian Miao
School of Computer and Information Engineering, Shanghai University of Electric Power, 200090, Shanghai, China
Jingsheng Lei
Department of Business Administration, Caritas Institute of Higher Education, 18 Chui Ling Road, Tseung Kwan O, Hong Kong, China
Fu Lee Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shi, H., Liu, Y. (2011). Naïve Bayes vs. Support Vector Machine: Resilience to Missing Data. In: Deng, H., Miao, D., Lei, J., Wang, F.L. (eds) Artificial Intelligence and Computational Intelligence. AICI 2011. Lecture Notes in Computer Science(), vol 7003. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23887-1_86

Download citation

DOI: https://doi.org/10.1007/978-3-642-23887-1_86
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23886-4
Online ISBN: 978-3-642-23887-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics