skip to main content
10.1145/3631461.3631955acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicdcnConference Proceedingsconference-collections
research-article
Open access

A Cost-effective Framework for Privacy Preserving Federated Learning

Published: 22 January 2024 Publication History

Abstract

Recently, Federated Learning (FL) has received significant attention in collaborative and privacy preserving model training across different application areas. FL is a part of machine learning that enables multiple data owners to collaboratively train a single model by sharing model parameters instead of their original data. However, it may introduce risks of information leakage while exchanging model parameters. To address this concern, existing FL approaches use Secure Multiparty Computation (SMC), Differential Privacy (DP), Homomorphic Encryption (HE), and various hybrid secure techniques. This increases the computational and communicational costs as well as the training time. In this paper, we propose a cost-effective and efficient federated learning model with vertical data partitioning. In our approach, we exploit the trivial distribution of dataset into private and non-private features. We use only private dataset for security enforcement whereas the complete dataset is used for training to enhance the accuracy of the model. Our approach also reduces the computational and communicational costs because it uses non-private data in the initial round and exchanges partially trained model parameters. Subsequently, it follows traditional FL approach with private data. Our framework doesn’t incorporate SMC during the initial round, whereas, it is only used during training with private data. We have evaluated the security and performance of our proposed framework and compared the same with state-of-art research.

References

[1]
Timo M Deist, Frank JWM Dankers, Priyanka Ojha, M Scott Marshall, Tomas Janssen, Corinne Faivre-Finn, Carlotta Masciocchi, Vincenzo Valentini, Jiazhou Wang, Jiayan Chen, 2020. Distributed learning on 20 000+ lung cancer patients–The Personal Health Train. Radiotherapy and Oncology 144 (2020), 189–200.
[2]
Bimal Ghimire and Danda B Rawat. 2022. Recent advances on federated learning for cybersecurity and cybersecurity for federated learning for internet of things. IEEE Internet of Things Journal 9, 11 (2022), 8229–8249.
[3]
Bart Jacobs and Jean Popma. 2019. Medical research, big data and the need for privacy by design. Big Data & Society 6, 1 (2019), 2053951718824352.
[4]
Qiongxiu Li and Mads Græsbøll Christensen. 2019. A privacy-preserving asynchronous averaging algorithm based on shamir’s secret sharing. In 2019 27th European Signal Processing Conference (EUSIPCO). IEEE, 1–5.
[5]
Tong Li, Jin Li, Zheli Liu, Ping Li, and Chunfu Jia. 2018. Differentially private Naive Bayes learning over multiple data sources. Information Sciences 444 (2018), 89–104.
[6]
Yong Li, Yipeng Zhou, Alireza Jolfaei, Dongjin Yu, Gaochao Xu, and Xi Zheng. 2020. Privacy-preserving federated learning framework based on chained secure multiparty computing. IEEE Internet of Things Journal 8, 8 (2020), 6178–6186.
[7]
Yang Liu, Yan Kang, Tianyuan Zou, Yanhong Pu, Yuanqin He, Xiaozhou Ye, Ye Ouyang, Ya-Qin Zhang, and Qiang Yang. 2022. Vertical federated learning. arXiv preprint arXiv:2211.12814 (2022).
[8]
Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, and Blaise Agüera y Arcas. 2017. Communication-Efficient Learning of Deep Networks from Decentralized Data. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, AISTATS 2017, 20-22 April 2017, Fort Lauderdale, FL, USA(Proceedings of Machine Learning Research, Vol. 54), Aarti Singh and Xiaojin (Jerry) Zhu (Eds.). PMLR, 1273–1282. http://proceedings.mlr.press/v54/mcmahan17a.html
[9]
Shiho Moriai. 2019. Privacy-preserving deep learning via additively homomorphic encryption. In 2019 IEEE 26th Symposium on Computer Arithmetic (ARITH). IEEE, 198–198.
[10]
Rachel Nosowsky and Thomas J Giordano. 2006. The Health Insurance Portability and Accountability Act of 1996 (HIPAA) privacy rule: implications for clinical research. Annu. Rev. Med. 57 (2006), 575–590.
[11]
Nicolas Papernot, Shuang Song, Ilya Mironov, Ananth Raghunathan, Kunal Talwar, and Úlfar Erlingsson. 2018. Scalable private learning with pate. arXiv preprint arXiv:1802.08908 (2018).
[12]
Quoc-Viet Pham, Ming Zeng, Thien Huynh-The, Zhu Han, and Won-Joo Hwang. 2022. Aerial access networks for federated learning: Applications and challenges. IEEE Network 36, 3 (2022), 159–166.
[13]
Stacey Truex, Nathalie Baracaldo, Ali Anwar, Thomas Steinke, Heiko Ludwig, Rui Zhang, and Yi Zhou. 2019. A hybrid approach to privacy-preserving federated learning. In Proceedings of the 12th ACM workshop on artificial intelligence and security. 1–11.
[14]
Runhua Xu, Nathalie Baracaldo, Yi Zhou, Ali Anwar, and Heiko Ludwig. 2019. Hybridalpha: An efficient approach for privacy-preserving federated learning. In Proceedings of the 12th ACM workshop on artificial intelligence and security. 13–23.
[15]
Qiang Yang, Yang Liu, Tianjian Chen, and Yongxin Tong. 2019. Federated machine learning: Concept and applications. ACM Transactions on Intelligent Systems and Technology (TIST) 10, 2 (2019), 1–19.
[16]
Tom Young, Devamanyu Hazarika, Soujanya Poria, and Erik Cambria. 2018. Recent trends in deep learning based natural language processing. ieee Computational intelligenCe magazine 13, 3 (2018), 55–75.
[17]
Chen Zhang, Yu Xie, Hang Bai, Bin Yu, Weihong Li, and Yuan Gao. 2021. A survey on federated learning. Knowledge-Based Systems 216 (2021), 106775.
[18]
Tuo Zhang, Lei Gao, Chaoyang He, Mi Zhang, Bhaskar Krishnamachari, and A Salman Avestimehr. 2022. Federated learning for the internet of things: Applications, challenges, and opportunities. IEEE Internet of Things Magazine 5, 1 (2022), 24–29.
[19]
Weishan Zhang, Tao Zhou, Qinghua Lu, Xiao Wang, Chunsheng Zhu, Haoyun Sun, Zhipeng Wang, Sin Kit Lo, and Fei-Yue Wang. 2021. Dynamic-fusion-based federated learning for COVID-19 detection. IEEE Internet of Things Journal 8, 21 (2021), 15884–15891.
[20]
Bo Zhao, Konda Reddy Mopuri, and Hakan Bilen. 2020. idlg: Improved deep leakage from gradients. arXiv preprint arXiv:2001.02610 (2020).
[21]
Ligeng Zhu, Zhijian Liu, and Song Han. 2019. Deep leakage from gradients. Advances in neural information processing systems 32 (2019).

Cited By

View all
  • (2024)CQUPT-HDS: Health Big Data Security Monitoring Scheme for the Whole Life Cycle2024 IEEE International Conference on Medical Artificial Intelligence (MedAI)10.1109/MedAI62885.2024.00069(476-481)Online publication date: 15-Nov-2024

Index Terms

  1. A Cost-effective Framework for Privacy Preserving Federated Learning

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICDCN '24: Proceedings of the 25th International Conference on Distributed Computing and Networking
    January 2024
    423 pages
    ISBN:9798400716737
    DOI:10.1145/3631461
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 January 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Data Partitioning
    2. Federated Learning
    3. Privacy Preserving federated learning

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICDCN '24

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)168
    • Downloads (Last 6 weeks)15
    Reflects downloads up to 25 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)CQUPT-HDS: Health Big Data Security Monitoring Scheme for the Whole Life Cycle2024 IEEE International Conference on Medical Artificial Intelligence (MedAI)10.1109/MedAI62885.2024.00069(476-481)Online publication date: 15-Nov-2024

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Login options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media