skip to main content
10.1145/3358331.3358396acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaiamConference Proceedingsconference-collections
research-article

Naive Bayes Classification based on Differential Privacy

Published: 17 October 2019 Publication History

Abstract

Data mining has a wide range of applications in the real world. However, it is possible to disclose the private information of users in the process of data mining. Therefore, it is of great significance to protect the users' privacy while mining the knowledge behind the data. In this paper, we propose a Naive Bayes classification method based on differential privacy. For nominal attributes, we add Laplace noise to the count. For numerical attributes, we add Laplace noise to the mean, standard deviation, and scale parameter, and then use the noisy parameters to calculate the prior probability and conditional probability. For numerical attributes, we assume that they follow Gaussian, Laplace, or lognormal distribution, and apply our algorithms to compare utilities.

References

[1]
Agrawal, R. (2000). Privacy-preserving data mining. Proceedings of the 2000 ACM SIG-MOD international conference on Management of data. Association for Computing Machinery.
[2]
Dwork, C. (2006). Differential privacy. International Colloquium on Automata, Languages, & Programming.
[3]
Dwork, C., Kenthapadi, K., Mcsherry, F., Mironov, I., & Naor, M. (2006). Our Data, Ourselves: Privacy Via Distributed Noise Generation. Advances in Cryptology - EUROCRYPT 2006, 25th Annual International Conference on the Theory and Applications of Cryptographic Techniques, St. Petersburg, Russia, May 28 - June 1, 2006, Proceedings. DBLP.
[4]
Liu, X., Liu, A., Zhang, X., Li, Z., & Zhou, X. (2017). When Differential Privacy Meets Randomized Perturbation: A Hybrid Approach for Privacy-Preserving Recommender System. International Conference on Database Systems for Advanced Applications.
[5]
Lin, C., Song, Z., Song, H., Zhou, Y., Wang, Y., & Wu, G. (2016). Differential privacy preserving in big data analytics for connected health. Journal of Medical Systems, 40(4), 97.
[6]
Xinyu, X., Fei, C., Peizhi, H., Miaomiao, T., Xiaofang, H., & Badong, C., et al. (2018). Frequent itemsets mining with differential privacy over large-scale data. IEEE Access, 1--1.
[7]
Xia, Y., Huang, Y., Zhang, X., & Bae, H. Y. (2018). Frequent itemset mining with differential privacy based on transaction truncation.
[8]
Wu, X., Wei, Y., Mao, Y., & Wang, L. (2018). A differential privacy dna motif finding method based on closed frequent patterns. Cluster Computing (21), 1--13.
[9]
Qin, Z., Yu, T., Yang, Y., Khalil, I., Xiao, X., & Ren, K. (2017). [ACM Press the 2017 ACM SIGSAC Conference - Dallas, Texas, USA (2017.10.30--2017.11.03)] Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, - CCS \"17 - Generating Synthetic Decentralized Social Graphs with Local Differential Privacy. Acm Sigsac Conference on Computer & Communications Security (pp.425--438). ACM.
[10]
Hu, J., Shi, W., Liu, H., Yan, J., Tian, Y., & Wu, Z. (2017). Preserving Friendly-Correlations in Uncertain Graphs Using Differential Privacy. 2017 International Conference on Networking and Network Applications (NaNA). IEEE Computer Society.
[11]
Zhu, & Tianqing. (2013). An Effective Differentially Private Data Releasing Algorithm for;Decision Tree. IEEE International Conference on Trust. IEEE.
[12]
Su, D., Cao, J., Li, N., Bertino, E., & Jin, H. (2016). Differentially Private K-Means Clustering. Proceedings of the Sixth ACM Conference on Data and Application Security and Privacy. ACM.
[13]
Cormode, G. (2012). Individual privacy vs population privacy: learning to attack, anonymization.
[14]
Huai, M., Huang, L., Yang, W., Li, L., & Qi, M. (2015). Privacy-Preserving Naive Bayes Classification. International Conference on Knowledge Science. Springer International Publishing.
[15]
Vaidya, J., Shafiq, B., Basu, A., & Hong, Y. (2013). Differentially Private Naive Bayes Classification. Proceedings of the 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 01. ACM.
[16]
Yi, J., Su, Y., & Zhao, X. (2017). Statistics-based email communication security behavior recognition. Journal of Physics: Conference Series, 887, 012040.

Cited By

View all
  • (2024)HEaaN-NB: Non-Interactive Privacy-Preserving Naive Bayes Using CKKS for Secure Outsourced Cloud ComputingIEEE Access10.1109/ACCESS.2024.343816112(110762-110780)Online publication date: 2024
  • (2024)Targeted prevention of risky deals for improper granular data with deep learningInternational Journal of System Assurance Engineering and Management10.1007/s13198-024-02646-816:2(750-764)Online publication date: 6-Dec-2024
  • (2023)Fully Homomorphic Privacy-Preserving Naive Bayes Machine Learning and ClassificationProceedings of the 11th Workshop on Encrypted Computing & Applied Homomorphic Cryptography10.1145/3605759.3625262(91-102)Online publication date: 26-Nov-2023
  • Show More Cited By

Index Terms

  1. Naive Bayes Classification based on Differential Privacy

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    AIAM 2019: Proceedings of the 2019 International Conference on Artificial Intelligence and Advanced Manufacturing
    October 2019
    418 pages
    ISBN:9781450372022
    DOI:10.1145/3358331
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 October 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Differential Privacy
    2. Naive Bayes Classification
    3. utility

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Funding Sources

    • the Foundation of Guizhou Provincial Key Laboratory of Public Big Data
    • the Fundamental Research Funds for the Central Universities

    Conference

    AIAM 2019

    Acceptance Rates

    Overall Acceptance Rate 100 of 285 submissions, 35%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 28 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)HEaaN-NB: Non-Interactive Privacy-Preserving Naive Bayes Using CKKS for Secure Outsourced Cloud ComputingIEEE Access10.1109/ACCESS.2024.343816112(110762-110780)Online publication date: 2024
    • (2024)Targeted prevention of risky deals for improper granular data with deep learningInternational Journal of System Assurance Engineering and Management10.1007/s13198-024-02646-816:2(750-764)Online publication date: 6-Dec-2024
    • (2023)Fully Homomorphic Privacy-Preserving Naive Bayes Machine Learning and ClassificationProceedings of the 11th Workshop on Encrypted Computing & Applied Homomorphic Cryptography10.1145/3605759.3625262(91-102)Online publication date: 26-Nov-2023
    • (2023)Privacy-preserving data (stream) mining techniques and their impact on data mining accuracy: a systematic literature reviewArtificial Intelligence Review10.1007/s10462-023-10425-356:9(10427-10464)Online publication date: 22-Feb-2023
    • (2022)A Modified Naïve Bayes Classifier for Detecting Spam E-mails based on Feature Selection2022 6th International Conference on Intelligent Computing and Control Systems (ICICCS)10.1109/ICICCS53718.2022.9788340(1634-1641)Online publication date: 25-May-2022
    • (2021)Liver disease prediction using W-LR-XGB Algorithm2021 International Conference on Computer, Blockchain and Financial Development (CBFD)10.1109/CBFD52659.2021.00055(245-248)Online publication date: Apr-2021
    • (2020)Differential Privacy Preserving Naive Bayes Classification via Wavelet Transform2020 International Conference on Networking and Network Applications (NaNA)10.1109/NaNA51271.2020.00021(81-85)Online publication date: Dec-2020

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media