New Approaches to Federated XGBoost Learning for Privacy-Preserving Data Analysis

Yamamoto, Fuki; Wang, Lihua; Ozawa, Seiichi

doi:10.1007/978-3-030-63833-7_47

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12533))

Included in the following conference series:

International Conference on Neural Information Processing

2811 Accesses
9 Citations
3 Altmetric

Abstract

In this paper, we propose a new privacy-preserving machine learning algorithm called Federated-Learning XGBoost (FL-XGBoost), in which a federated learning scheme is introduced into XGBoost, a state-of-the-art gradient boosting decision tree model. The proposed FL-XGBoost can train a sensitive task to be solved among different entities without revealing their own data. The proposed FL-XGBoost can achieve significant reduction in the number of communications between entities by exchanging decision tree models. In our experiments, we carry out the performance comparison between FL-XGBoost and a different federated learning approach to XGBoost called FATE. The experimental results show that the proposed method can achieve high prediction accuracy with less communication even if the number of entities is increase.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Chen, T., Guestrin, C.: Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794 (2016)
Google Scholar
Dwork, C., McSherry, F., Nissim, K., Smith, A.: Calibrating noise to sensitivity in private data analysis. In: Halevi, S., Rabin, T. (eds.) TCC 2006. LNCS, vol. 3876, pp. 265–284. Springer, Heidelberg (2006). https://doi.org/10.1007/11681878_14
Chapter Google Scholar
Friedman, J.: Greedy function approximation: a gradient boosting machine. Ann. Stat. pp. 1189–1232 (2001)
Google Scholar
Kaggle: Credit Card Fraud Detection. https://www.kaggle.com/mlg-ulb/creditcardfraud. Accessed 14 Sep 2020
Paillier, P.: Public-key cryptosystems based on composite degree residuosity classes. In: Stern, J. (ed.) EUROCRYPT 1999. LNCS, vol. 1592, pp. 223–238. Springer, Heidelberg (1999). https://doi.org/10.1007/3-540-48910-X_16
Chapter Google Scholar
UCI: Arcene Data Set. https://archive.ics.uci.edu/ml/datasets/Arcene. Accessed 14 Sep 2020
UCI: German Credit Data. https://archive.ics.uci.edu/ml/datasets/statlog+(german+credit+data. Accessed 14 Sep 2020
UCI: QSAR biodegradation Data Set (UCI). https://archive.ics.uci.edu/ml/datasets/QSAR+biodegradation. Accessed 14 Sep 2020
Webank: FATE (Federated AI Technology Enabler). https://fate.readthedocs.io/en/latest/index.html. Accessed 14 Sep 2020
Yang, M., Song, L., Xu, J., Li, C., Tan, G.: The tradeoff between privacy and accuracy in anomaly detection using federated XGBoost. arXiv preprint arXiv:1907.07157 (2019)
Yang, Q., Liu, Y., Chen, T., Tong, Y.: Federated machine learning: Concept and applications. In: ACM Transactions on Intelligent Systems and Technology (TIST), New York, NY, USA, pp. 1–19. ACM (2019)
Google Scholar
Zhao, L., et al.: Inprivate digging: enabling tree-based distributed data mining with differential privacy. In: IEEE INFOCOM 2018-IEEE Conference on Computer Communications, pp. 2087–2095. IEEE (2018)
Google Scholar

Download references

Acknowledgement

We would like to thank Associate Professor Toshiaki Omori and the members of the National Institute of Information and Communications Technology (NICT) for their helpful advice and support in writing this paper. This research has been accomplished through the project “Social Implementation of Privacy-Preserving Data Analytics” (JPMJCR19F6) in the JST CREST research area “Development and Integration of Artificial Intelligence Technologies for Innovation Acceleration”.

Author information

Authors and Affiliations

Graduate School of Engineering, Kobe University, Kobe, Japan
Fuki Yamamoto & Seiichi Ozawa
National Institute of Information and Communications Technology, Tokyo, Japan
Lihua Wang
Center for Mathematical and Data Sciences, Kobe University, Kobe, Japan
Seiichi Ozawa

Authors

Fuki Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar
Lihua Wang
View author publications
You can also search for this author in PubMed Google Scholar
Seiichi Ozawa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Seiichi Ozawa .

Editor information

Editors and Affiliations

Department of AI, Ping An Life, Shenzhen, China
Haiqin Yang
Faculty of Information Technology, King Mongkut’s Institute of Technology Ladkrabang, Bangkok, Thailand
Kitsuchart Pasupa
City University of Hong Kong, Kowloon, China
Andrew Chi-Sing Leung
Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong, Hong Kong
James T. Kwok
School of Information Technology, King Mongkut’s University of Technology Thonburi, Bangkok, Thailand
Jonathan H. Chan
The Chinese University of Hong Kong, New Territories, Hong Kong
Irwin King

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yamamoto, F., Wang, L., Ozawa, S. (2020). New Approaches to Federated XGBoost Learning for Privacy-Preserving Data Analysis. In: Yang, H., Pasupa, K., Leung, A.CS., Kwok, J.T., Chan, J.H., King, I. (eds) Neural Information Processing. ICONIP 2020. Lecture Notes in Computer Science(), vol 12533. Springer, Cham. https://doi.org/10.1007/978-3-030-63833-7_47

Download citation

DOI: https://doi.org/10.1007/978-3-030-63833-7_47
Published: 20 November 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63832-0
Online ISBN: 978-3-030-63833-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics