skip to main content
10.1145/3097983.3098152acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Public Access

Privacy-Preserving Distributed Multi-Task Learning with Asynchronous Updates

Published: 13 August 2017 Publication History

Abstract

Many data mining applications involve a set of related learning tasks. Multi-task learning (MTL) is a learning paradigm that improves generalization performance by transferring knowledge among those tasks. MTL has attracted so much attention in the community, and various algorithms have been successfully developed. Recently, distributed MTL has also been studied for related tasks whose data is distributed across different geographical regions. One prominent challenge of the distributed MTL frameworks is to maintain the privacy of the data. The distributed data may contain sensitive and private information such as patients' records and registers of a company. In such cases, distributed MTL frameworks are required to preserve the privacy of the data. In this paper, we propose a novel privacy-preserving distributed MTL framework to address this challenge. A privacy-preserving proximal gradient algorithm, which asynchronously updates models of the learning tasks, is introduced to solve a general class of MTL formulations. The proposed asynchronous approach is robust against network delays and provides a guaranteed differential privacy through carefully designed perturbation. Theoretical guarantees of the proposed algorithm are derived and supported by the extensive experimental results.

References

[1]
Rie Kubota Ando and Tong Zhang 2005. A framework for learning predictive structures from multiple tasks and unlabeled data. Journal of Machine Learning Research Vol. 6, Nov (2005), 1817--1853.
[2]
Andreas Argyriou, Theodoros Evgeniou, and Massimiliano Pontil 2008. Convex multi-task feature learning. Machine Learning, Vol. 73, 3 (2008), 243--272.
[3]
Richard G Baraniuk. 2007. Compressive sensing [lecture notes]. IEEE signal processing magazine Vol. 24, 4 (2007), 118--121.
[4]
Inci M. Baytas, Ming Yan, Anil K. Jain, and Jiayu Zhou. 2016. Asynchronous Multi-task Learning. In 2016 IEEE 16th International Conference on Data Mining (ICDM). 11--20.
[5]
Amir Beck and Marc Teboulle 2009. A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM journal on imaging sciences Vol. 2, 1 (2009), 183--202.
[6]
Amos Beimel, Shiva Prasad Kasiviswanathan, and Kobbi Nissim. 2010. Bounds on the sample complexity for private learning and private data release Theory of Cryptography Conference. Springer, 437--454.
[7]
Jérôme Bolte, Shoham Sabach, and Marc Teboulle. 2014. Proximal alternating linearized minimization for nonconvex and nonsmooth problems. Mathematical Programming Vol. 146, 1--2 (2014), 459--494.
[8]
Rich Caruana. 1998. Multitask learning. Learning to learn. Springer, 95--133.
[9]
Kamalika Chaudhuri, Claire Monteleoni, and Anand D Sarwate. 2011. Differentially private empirical risk minimization. The Journal of Machine Learning Research Vol. 12 (2011), 1069--1109.
[10]
Kamalika Chaudhuri, Anand D Sarwate, and Kaushik Sinha. 2013. A near-optimal algorithm for differentially-private principal components. The Journal of Machine Learning Research Vol. 14, 1 (2013), 2905--2943.
[11]
Jianhui Chen, Lei Tang, Jun Liu, and Jieping Ye. 2009. A convex formulation for learning shared structures from multiple tasks Proceedings of the 26th Annual International Conference on Machine Learning. ACM, 137--144.
[12]
Jianhui Chen, Jiayu Zhou, and Jieping Ye 2011. Integrating low-rank and group-sparse structures for robust multi-task learning Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 42--50.
[13]
Damek Davis. 2016. The Asynchronous PALM Algorithm for Nonsmooth Nonconvex Problems. arXiv preprint arXiv:1604.00526 (2016).
[14]
Damek Davis, Brent Edmunds, and Madeleine Udell. 2016. The Sound of APALM Clapping: Faster Nonsmooth Nonconvex Optimization with Stochastic Asynchronous PALM. arXiv preprint arXiv:1606.02338 (2016).
[15]
Francesco Dinuzzo, Gianluigi Pillonetto, and Giuseppe De Nicolao 2011. Client--server multitask learning from distributed datasets. IEEE Transactions on Neural Networks Vol. 22, 2 (2011), 290--303.
[16]
Cynthia Dwork. 2006. Differential privacy. Automata, languages and programming. Springer, 1--12.
[17]
Cynthia Dwork. 2008. An ad omnia approach to defining and achieving private data analysis. Privacy, Security, and Trust in KDD. Springer, 1--13.
[18]
Cynthia Dwork, Frank McSherry, Kobbi Nissim, and Adam Smith 2006. Calibrating noise to sensitivity in private data analysis. Theory of Cryptography. Springer, 265--284.
[19]
Cynthia Dwork and Aaron Roth 2013. The algorithmic foundations of differential privacy. Theoretical Computer Science Vol. 9, 3--4 (2013), 211--407.
[20]
Cynthia Dwork, Guy N Rothblum, and Salil Vadhan. 2010. Boosting and differential privacy. In Foundations of Computer Science (FOCS), 2010 51st Annual IEEE Symposium on. IEEE, 51--60.
[21]
Theodoros Evgeniou and Massimiliano Pontil 2004. Regularized multi--task learning. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 109--117.
[22]
Arik Friedman and Assaf Schuster 2010. Data mining with differential privacy. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 493--502.
[23]
Gene H Golub and Charles F Van Loan 2012. Matrix computations. Vol. Vol. 3. JHU Press.
[24]
Pinghua Gong, Jieping Ye, and Changshui Zhang 2012. Robust multi-task feature learning. In Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 895--903.
[25]
Sunil Kumar Gupta, Santu Rana, and Svetha Venkatesh. 2016. Differentially Private Multi-task Learning. In Pacific-Asia Workshop on Intelligence and Security Informatics. Springer, 101--113.
[26]
Prateek Jain and Abhradeep Thakurta 2013. Differentially private learning with kernels. In Proceedings of the 30th International Conference on Machine Learning (ICML-13). 118--126.
[27]
Ali Jalali, Sujay Sanghavi, Chao Ruan, and Pradeep K Ravikumar 2010. A dirty model for multi-task learning. In Advances in Neural Information Processing Systems. 964--972.
[28]
Shuiwang Ji and Jieping Ye 2009. An accelerated gradient method for trace norm minimization Proceedings of the 26th annual international conference on machine learning. ACM, 457--464.
[29]
Xin Jin, Ping Luo, Fuzhen Zhuang, Jia He, and Qing He 2015. Collaborating between Local and Global Learning for Distributed Online Multiple Tasks Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. ACM, 113--122.
[30]
Shiva P Kasiviswanathan and Adam Smith 2014. On the Semantics' of Differential Privacy: A Bayesian Formulation. Journal of Privacy and Confidentiality Vol. 6, 1 (2014), 1.
[31]
Seyoung Kim and Eric P Xing 2010. Tree-guided group lasso for multi-task regression with structured sparsity. (2010).
[32]
Matt J Kusner, Jacob R Gardner, Roman Garnett, and Kilian Q Weinberger 2015. Differentially Private Bayesian Optimization. In ICML. 918--927.
[33]
Chao Li, Michael Hay, Vibhor Rastogi, Gerome Miklau, and Andrew McGregor 2010. Optimizing linear counting queries under differential privacy Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. ACM, 123--134.
[34]
Sulin Liu, Sinno Jialin Pan, and Qirong Ho 2016. Distributed Multi-task Relationship Learning. (2016).

Cited By

View all
  • (2024)Distributed Harmonization: Federated Clustered Batch Effect Adjustment and GeneralizationProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671590(5105-5115)Online publication date: 25-Aug-2024
  • (2024)A Graph-Assisted Framework for Multiple Graph LearningIEEE Transactions on Signal and Information Processing over Networks10.1109/TSIPN.2024.335223610(162-178)Online publication date: 2024
  • (2024)Privacy-Preserving Convolutional Neural Network Classification Scheme With Multiple KeysIEEE Transactions on Services Computing10.1109/TSC.2023.334929817:1(322-335)Online publication date: Jan-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
August 2017
2240 pages
ISBN:9781450348874
DOI:10.1145/3097983
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 August 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. asynchronous proximal optimization
  2. differential privacy
  3. multi-task learning

Qualifiers

  • Research-article

Funding Sources

Conference

KDD '17
Sponsor:

Acceptance Rates

KDD '17 Paper Acceptance Rate 64 of 748 submissions, 9%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)118
  • Downloads (Last 6 weeks)14
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Distributed Harmonization: Federated Clustered Batch Effect Adjustment and GeneralizationProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671590(5105-5115)Online publication date: 25-Aug-2024
  • (2024)A Graph-Assisted Framework for Multiple Graph LearningIEEE Transactions on Signal and Information Processing over Networks10.1109/TSIPN.2024.335223610(162-178)Online publication date: 2024
  • (2024)Privacy-Preserving Convolutional Neural Network Classification Scheme With Multiple KeysIEEE Transactions on Services Computing10.1109/TSC.2023.334929817:1(322-335)Online publication date: Jan-2024
  • (2024)Decentralized and Distributed Learning for AIoT: A Comprehensive Review, Emerging Challenges, and OpportunitiesIEEE Access10.1109/ACCESS.2024.342221112(101016-101052)Online publication date: 2024
  • (2024)DP-FedEwc: Differentially private federated elastic weight consolidation for model personalizationKnowledge-Based Systems10.1016/j.knosys.2024.112401303(112401)Online publication date: Nov-2024
  • (2024)A Hybrid Fuzzy Deep Belief Network Extreme Learning Machine Framework With Hyperbolic Secant Activation Function for Robust Semi‐Supervised Sentiment ClassificationApplied AI Letters10.1002/ail2.1026:1Online publication date: 13-Oct-2024
  • (2023)Online Power Control for Distributed Multitask Learning Over Noisy Fading Wireless ChannelsIEEE Transactions on Signal Processing10.1109/TSP.2023.332279171(3679-3694)Online publication date: 1-Jan-2023
  • (2023)Communication-Efficient Federated Multitask Learning Over Wireless NetworksIEEE Internet of Things Journal10.1109/JIOT.2022.320131010:1(609-624)Online publication date: 1-Jan-2023
  • (2023)Distributed online multi‐task sparse identification for multiple systems with asynchronous updatesInternational Journal of Robust and Nonlinear Control10.1002/rnc.694233:18(11242-11256)Online publication date: 23-Aug-2023
  • (2022)ClusterFL: A Clustering-based Federated Learning System for Human Activity RecognitionACM Transactions on Sensor Networks10.1145/355498019:1(1-32)Online publication date: 8-Dec-2022
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media