research-article

Public Access

Privacy-Preserving Distributed Multi-Task Learning with Asynchronous Updates

Authors:

Inci M. Baytas,

Jiayu ZhouAuthors Info & Claims

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Pages 1195 - 1204

https://doi.org/10.1145/3097983.3098152

Published: 13 August 2017 Publication History

Abstract

Many data mining applications involve a set of related learning tasks. Multi-task learning (MTL) is a learning paradigm that improves generalization performance by transferring knowledge among those tasks. MTL has attracted so much attention in the community, and various algorithms have been successfully developed. Recently, distributed MTL has also been studied for related tasks whose data is distributed across different geographical regions. One prominent challenge of the distributed MTL frameworks is to maintain the privacy of the data. The distributed data may contain sensitive and private information such as patients' records and registers of a company. In such cases, distributed MTL frameworks are required to preserve the privacy of the data. In this paper, we propose a novel privacy-preserving distributed MTL framework to address this challenge. A privacy-preserving proximal gradient algorithm, which asynchronously updates models of the learning tasks, is introduced to solve a general class of MTL formulations. The proposed asynchronous approach is robust against network delays and provides a guaranteed differential privacy through carefully designed perturbation. Theoretical guarantees of the proposed algorithm are derived and supported by the extensive experimental results.

References

[1]

Rie Kubota Ando and Tong Zhang 2005. A framework for learning predictive structures from multiple tasks and unlabeled data. Journal of Machine Learning Research Vol. 6, Nov (2005), 1817--1853.

Digital Library

[2]

Andreas Argyriou, Theodoros Evgeniou, and Massimiliano Pontil 2008. Convex multi-task feature learning. Machine Learning, Vol. 73, 3 (2008), 243--272.

Digital Library

[3]

Richard G Baraniuk. 2007. Compressive sensing [lecture notes]. IEEE signal processing magazine Vol. 24, 4 (2007), 118--121.

[4]

Inci M. Baytas, Ming Yan, Anil K. Jain, and Jiayu Zhou. 2016. Asynchronous Multi-task Learning. In 2016 IEEE 16th International Conference on Data Mining (ICDM). 11--20.

[5]

Amir Beck and Marc Teboulle 2009. A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM journal on imaging sciences Vol. 2, 1 (2009), 183--202.

Digital Library

[6]

Amos Beimel, Shiva Prasad Kasiviswanathan, and Kobbi Nissim. 2010. Bounds on the sample complexity for private learning and private data release Theory of Cryptography Conference. Springer, 437--454.

[7]

Jérôme Bolte, Shoham Sabach, and Marc Teboulle. 2014. Proximal alternating linearized minimization for nonconvex and nonsmooth problems. Mathematical Programming Vol. 146, 1--2 (2014), 459--494.

Digital Library

[8]

Rich Caruana. 1998. Multitask learning. Learning to learn. Springer, 95--133.

[9]

Kamalika Chaudhuri, Claire Monteleoni, and Anand D Sarwate. 2011. Differentially private empirical risk minimization. The Journal of Machine Learning Research Vol. 12 (2011), 1069--1109.

Digital Library

[10]

Kamalika Chaudhuri, Anand D Sarwate, and Kaushik Sinha. 2013. A near-optimal algorithm for differentially-private principal components. The Journal of Machine Learning Research Vol. 14, 1 (2013), 2905--2943.

Digital Library

[11]

Jianhui Chen, Lei Tang, Jun Liu, and Jieping Ye. 2009. A convex formulation for learning shared structures from multiple tasks Proceedings of the 26th Annual International Conference on Machine Learning. ACM, 137--144.

[12]

Jianhui Chen, Jiayu Zhou, and Jieping Ye 2011. Integrating low-rank and group-sparse structures for robust multi-task learning Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 42--50.

[13]

Damek Davis. 2016. The Asynchronous PALM Algorithm for Nonsmooth Nonconvex Problems. arXiv preprint arXiv:1604.00526 (2016).

[14]

Damek Davis, Brent Edmunds, and Madeleine Udell. 2016. The Sound of APALM Clapping: Faster Nonsmooth Nonconvex Optimization with Stochastic Asynchronous PALM. arXiv preprint arXiv:1606.02338 (2016).

[15]

Francesco Dinuzzo, Gianluigi Pillonetto, and Giuseppe De Nicolao 2011. Client--server multitask learning from distributed datasets. IEEE Transactions on Neural Networks Vol. 22, 2 (2011), 290--303.

Digital Library

[16]

Cynthia Dwork. 2006. Differential privacy. Automata, languages and programming. Springer, 1--12.

Digital Library

[17]

Cynthia Dwork. 2008. An ad omnia approach to defining and achieving private data analysis. Privacy, Security, and Trust in KDD. Springer, 1--13.

[18]

Cynthia Dwork, Frank McSherry, Kobbi Nissim, and Adam Smith 2006. Calibrating noise to sensitivity in private data analysis. Theory of Cryptography. Springer, 265--284.

Digital Library

[19]

Cynthia Dwork and Aaron Roth 2013. The algorithmic foundations of differential privacy. Theoretical Computer Science Vol. 9, 3--4 (2013), 211--407.

[20]

Cynthia Dwork, Guy N Rothblum, and Salil Vadhan. 2010. Boosting and differential privacy. In Foundations of Computer Science (FOCS), 2010 51st Annual IEEE Symposium on. IEEE, 51--60.

Digital Library

[21]

Theodoros Evgeniou and Massimiliano Pontil 2004. Regularized multi--task learning. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 109--117.

Digital Library

[22]

Arik Friedman and Assaf Schuster 2010. Data mining with differential privacy. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 493--502.

Digital Library

[23]

Gene H Golub and Charles F Van Loan 2012. Matrix computations. Vol. Vol. 3. JHU Press.

[24]

Pinghua Gong, Jieping Ye, and Changshui Zhang 2012. Robust multi-task feature learning. In Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 895--903.

Digital Library

[25]

Sunil Kumar Gupta, Santu Rana, and Svetha Venkatesh. 2016. Differentially Private Multi-task Learning. In Pacific-Asia Workshop on Intelligence and Security Informatics. Springer, 101--113.

Digital Library

[26]

Prateek Jain and Abhradeep Thakurta 2013. Differentially private learning with kernels. In Proceedings of the 30th International Conference on Machine Learning (ICML-13). 118--126.

[27]

Ali Jalali, Sujay Sanghavi, Chao Ruan, and Pradeep K Ravikumar 2010. A dirty model for multi-task learning. In Advances in Neural Information Processing Systems. 964--972.

[28]

Shuiwang Ji and Jieping Ye 2009. An accelerated gradient method for trace norm minimization Proceedings of the 26th annual international conference on machine learning. ACM, 457--464.

[29]

Xin Jin, Ping Luo, Fuzhen Zhuang, Jia He, and Qing He 2015. Collaborating between Local and Global Learning for Distributed Online Multiple Tasks Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. ACM, 113--122.

[30]

Shiva P Kasiviswanathan and Adam Smith 2014. On the Semantics' of Differential Privacy: A Bayesian Formulation. Journal of Privacy and Confidentiality Vol. 6, 1 (2014), 1.

[31]

Seyoung Kim and Eric P Xing 2010. Tree-guided group lasso for multi-task regression with structured sparsity. (2010).

[32]

Matt J Kusner, Jacob R Gardner, Roman Garnett, and Kilian Q Weinberger 2015. Differentially Private Bayesian Optimization. In ICML. 918--927.

[33]

Chao Li, Michael Hay, Vibhor Rastogi, Gerome Miklau, and Andrew McGregor 2010. Optimizing linear counting queries under differential privacy Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. ACM, 123--134.

[34]

Sulin Liu, Sinno Jialin Pan, and Qirong Ho 2016. Distributed Multi-task Relationship Learning. (2016).

Cited By

Hoang BPang YLiang SZhan LThompson PZhou JBaeza-Yates RBonchi F(2024)Distributed Harmonization: Federated Clustered Batch Effect Adjustment and GeneralizationProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671590(5105-5115)Online publication date: 25-Aug-2024
https://dl.acm.org/doi/10.1145/3637528.3671590
Zhang XWang Q(2024)A Graph-Assisted Framework for Multiple Graph LearningIEEE Transactions on Signal and Information Processing over Networks10.1109/TSIPN.2024.335223610(162-178)Online publication date: 2024
https://doi.org/10.1109/TSIPN.2024.3352236
Wang BChen YLi FSong JLu RDuan PTian Z(2024)Privacy-Preserving Convolutional Neural Network Classification Scheme With Multiple KeysIEEE Transactions on Services Computing10.1109/TSC.2023.334929817:1(322-335)Online publication date: Jan-2024
https://doi.org/10.1109/TSC.2023.3349298
Show More Cited By

Index Terms

Privacy-Preserving Distributed Multi-Task Learning with Asynchronous Updates

Recommendations

Privacy-Preserving Distributed Multi-Task Learning against Inference Attack in Cloud Computing
Because of the powerful computing and storage capability in cloud computing, machine learning as a service (MLaaS) has recently been valued by the organizations for machine learning training over some related representative datasets. When these datasets ...
A review of privacy preserving models for multi-party data release framework
WIR '16: Proceedings of the ACM Symposium on Women in Research 2016

Nowadays, with the improvement of internet technology and advancement in distributed computing data is increasing rapidly. There is a need of information sharing between organizations. Ideally, we wish to share data from multiple private databases and ...
MetaMorphosis: Task-oriented Privacy Cognizant Feature Generation for Multi-task Learning
IoTDI '23: Proceedings of the 8th ACM/IEEE Conference on Internet of Things Design and Implementation

With the growth of computer vision applications, deep learning, and edge computing contribute to ensuring practical collaborative intelligence (CI) by distributing the workload among edge devices and the cloud. However, running separate single-task ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 2017

2240 pages

ISBN:9781450348874

DOI:10.1145/3097983

General Chairs:
Stan Matwin
Dalhousie University
,
Shipeng Yu
LinkedIn
,
Faisal Farooq
IBM

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 August 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Conference

KDD '17

Sponsor:

KDD '17: The 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 13 - 17, 2017

NS, Halifax, Canada

Acceptance Rates

KDD '17 Paper Acceptance Rate 64 of 748 submissions, 9%;

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Sponsor:
sigkdd
sigkdd

The 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 3 - 7, 2025

Toronto , ON , Canada

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

41
Total Citations
View Citations
1,645
Total Downloads

Downloads (Last 12 months)118
Downloads (Last 6 weeks)14

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Hoang BPang YLiang SZhan LThompson PZhou JBaeza-Yates RBonchi F(2024)Distributed Harmonization: Federated Clustered Batch Effect Adjustment and GeneralizationProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671590(5105-5115)Online publication date: 25-Aug-2024
https://dl.acm.org/doi/10.1145/3637528.3671590
Zhang XWang Q(2024)A Graph-Assisted Framework for Multiple Graph LearningIEEE Transactions on Signal and Information Processing over Networks10.1109/TSIPN.2024.335223610(162-178)Online publication date: 2024
https://doi.org/10.1109/TSIPN.2024.3352236
Wang BChen YLi FSong JLu RDuan PTian Z(2024)Privacy-Preserving Convolutional Neural Network Classification Scheme With Multiple KeysIEEE Transactions on Services Computing10.1109/TSC.2023.334929817:1(322-335)Online publication date: Jan-2024
https://doi.org/10.1109/TSC.2023.3349298
Xu HSeng KAng LSmith J(2024)Decentralized and Distributed Learning for AIoT: A Comprehensive Review, Emerging Challenges, and OpportunitiesIEEE Access10.1109/ACCESS.2024.342221112(101016-101052)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3422211
Liang JSu S(2024)DP-FedEwc: Differentially private federated elastic weight consolidation for model personalizationKnowledge-Based Systems10.1016/j.knosys.2024.112401303(112401)Online publication date: Nov-2024
https://doi.org/10.1016/j.knosys.2024.112401
Mozafari MMoattar M(2024)A Hybrid Fuzzy Deep Belief Network Extreme Learning Machine Framework With Hyperbolic Secant Activation Function for Robust Semi‐Supervised Sentiment ClassificationApplied AI Letters10.1002/ail2.1026:1Online publication date: 13-Oct-2024
https://doi.org/10.1002/ail2.102
Zhang KCao X(2023)Online Power Control for Distributed Multitask Learning Over Noisy Fading Wireless ChannelsIEEE Transactions on Signal Processing10.1109/TSP.2023.332279171(3679-3694)Online publication date: 1-Jan-2023
https://dl.acm.org/doi/10.1109/TSP.2023.3322791
Ma HGuo HLau V(2023)Communication-Efficient Federated Multitask Learning Over Wireless NetworksIEEE Internet of Things Journal10.1109/JIOT.2022.320131010:1(609-624)Online publication date: 1-Jan-2023
https://doi.org/10.1109/JIOT.2022.3201310
Zhang KLuan XDing FLiu F(2023)Distributed online multi‐task sparse identification for multiple systems with asynchronous updatesInternational Journal of Robust and Nonlinear Control10.1002/rnc.694233:18(11242-11256)Online publication date: 23-Aug-2023
https://doi.org/10.1002/rnc.6942
Ouyang XXie ZZhou JXing GHuang J(2022)ClusterFL: A Clustering-based Federated Learning System for Human Activity RecognitionACM Transactions on Sensor Networks10.1145/355498019:1(1-32)Online publication date: 8-Dec-2022
https://dl.acm.org/doi/10.1145/3554980
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten