Privacy-Preserving Naive Bayes Classification

Huai, Mengdi; Huang, Liusheng; Yang, Wei; Li, Lu; Qi, Mingyu

doi:10.1007/978-3-319-25159-2_57

Mengdi Huai^22,23,
Liusheng Huang^22,23,
Wei Yang^22,23,
Lu Li^22,23 &
…
Mingyu Qi^22,23

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9403))

Included in the following conference series:

International Conference on Knowledge Science, Engineering and Management

3055 Accesses
10 Citations

Abstract

In this paper, we propose differentially private protocols for Naive Bayes classification over distributed data. Compared with existing works, the privacy and security models in the proposed protocols are stronger: firstly, both the miner and parties can be arbitrarily malicious and can collude with each other to violate the remaining honest parties privacy; secondly, all communication channels between them can be assumed to be insecure. Specifically, we build a guarantee of differential privacy into the cryptographic construction so that the proposed protocols can tolerate collusions and resist eavesdropping attacks which are caused by insecure communication channels. Additionally, the proposed protocols can be implemented at lower computation and communication costs, and some extensions to our protocols (e.g. supporting parties dynamic joins or leaves) are also proposed in this paper. Both theoretical analysis and simulation results show that the proposed privacy-preserving protocols for Naive Bayes have strong security and better classification performance than the standard one.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Dwork, C., McSherry, F., Nissim, K., Smith, A.: Calibrating noise to sensitivity in private data analysis. In: Halevi, S., Rabin, T. (eds.) TCC 2006. LNCS, vol. 3876, pp. 265–284. Springer, Heidelberg (2006)
Chapter Google Scholar
Ghosh, A., Roughgarden, T., Sundararajan, M.: Universally utility-maximizing privacy mechanisms. SIAM Journal on Computing (2012)
Google Scholar
Jung, T., Li, X.-Y.: Collusion-tolerable privacy-preserving sum and product calculation without secure channel (2014)
Google Scholar
Kantarcioglu, M., Vaidya, J., Clifton, C.: Privacy preserving naive bayes classifier for horizontally partitioned data. In: IEEE ICDM Workshop on Privacy Preserving Data Mining, pp. 3–9 (2003)
Google Scholar
Kargupta, H., Datta, S., Wang, Q., Sivakumar, K.: On the privacy preserving properties of random data perturbation techniques. pp. 99–106. IEEE (2003)
Google Scholar
Keshavamurthy, B.N., Toshniwal, D.: Privacy-preserving Naïve Bayes classification using trusted third party computation over vertically partitioned distributed progressive sequential data streams. In: Nagamalai, D., Kaushik, B.K., Meghanathan, N. (eds.) CCSIT 2011 Part II. CCIS, vol. 132, pp. 444–452. Springer, Heidelberg (2011)
Chapter Google Scholar
Lichman, M.: UCI machine learning repository (2013)
Google Scholar
Mironov, I., Pandey, O., Reingold, O., Vadhan, S.: Computational differential privacy. In: Halevi, S. (ed.) CRYPTO 2009. LNCS, vol. 5677, pp. 126–142. Springer, Heidelberg (2009)
Chapter Google Scholar
Shi, E., Chan, T-.H.H., Rieffel, E.G., Chow, R., Song, D.: Privacy-preserving aggregation of time-series data. In: NDSS (2011)
Google Scholar
Vaidya, J., Kantarciouglu, M., Clifton, C.: Privacy-preserving naive bayes classification. VLDB 17(4), 879–898 (2008)
Article Google Scholar
Vaidya, J., Shafiq, B., Basu, A., Hong, Y.: Differentially private naive bayes classification. In: IEEE/WIC/ACM on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), vol. 1, pp. 571–576. IEEE (2013)
Google Scholar
Yi, X., Zhang, Y.: Privacy-preserving naive bayes classification on distributed data via semi-trusted mixers. Information Systems 34(3), 371–380 (2009)
Article Google Scholar
Zhang, P., Tong, Y., Tang, S., Yang, D.: Privacy preserving naive bayes classification. In: Li, X., Wang, S., Dong, Z.Y. (eds.) ADMA 2005. LNCS (LNAI), vol. 3584, pp. 744–752. Springer, Heidelberg (2005)
Chapter Google Scholar
Yang, Z., Zhong, S., Wright, R.N.: Privacy-preserving classification of customer data without loss of accuracy. In: SDM. SIAM (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Technology, University of Science and Technology of China, Hefei, 230026, China
Mengdi Huai, Liusheng Huang, Wei Yang, Lu Li & Mingyu Qi
Suzhou Institute for Advanced Study, University of Science and Technology of China, Suzhou, 215123, China
Mengdi Huai, Liusheng Huang, Wei Yang, Lu Li & Mingyu Qi

Authors

Mengdi Huai
View author publications
You can also search for this author in PubMed Google Scholar
Liusheng Huang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Yang
View author publications
You can also search for this author in PubMed Google Scholar
Lu Li
View author publications
You can also search for this author in PubMed Google Scholar
Mingyu Qi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mengdi Huai .

Editor information

Editors and Affiliations

Chinese Academy of Sciences, Beijing, China
Songmao Zhang
Ludwig-Maximilians-Universität München, Munich, Germany
Martin Wirsing
Southwest University, Chongqing, China
Zili Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huai, M., Huang, L., Yang, W., Li, L., Qi, M. (2015). Privacy-Preserving Naive Bayes Classification. In: Zhang, S., Wirsing, M., Zhang, Z. (eds) Knowledge Science, Engineering and Management. KSEM 2015. Lecture Notes in Computer Science(), vol 9403. Springer, Cham. https://doi.org/10.1007/978-3-319-25159-2_57

Download citation

DOI: https://doi.org/10.1007/978-3-319-25159-2_57
Published: 03 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25158-5
Online ISBN: 978-3-319-25159-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics