Distributed Learning to Protect Privacy in Multi-centric Clinical Studies

Damiani, Andrea; Vallati, Mauro; Gatta, Roberto; Dinapoli, Nicola; Jochems, Arthur; Deist, Timo; van Soest, Johan; Dekker, Andre; Valentini, Vincenzo

doi:10.1007/978-3-319-19551-3_8

Distributed Learning to Protect Privacy in Multi-centric Clinical Studies

Andrea Damiani⁸,
Mauro Vallati⁹,
Roberto Gatta⁸,
Nicola Dinapoli⁸,
Arthur Jochems¹⁰,
Timo Deist¹⁰,
Johan van Soest¹⁰,
Andre Dekker¹⁰ &
…
Vincenzo Valentini⁸

Conference paper

3890 Accesses
14 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9105))

Abstract

Research in medicine has to deal with the growing amount of data about patients which are made available by modern technologies. All these data might be used to support statistical studies, and for identifying causal relations. To use these data, which are spread across hospitals, efficient merging techniques as well as policies to deal with this sensitive information are strongly needed. In this paper we introduce and empirically test a distributed learning approach, to train Support Vector Machines (SVM), that allows to overcome problems related to privacy and data being spread around. The introduced technique allows to train algorithms without sharing any patients-related information, ensuring privacy and avoids the development of merging tools. We tested this approach on a large dataset and we described results, in terms of convergence and performance; we also provide considerations about the features of an IT architecture designed to support distributed learning computations.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Boyd, S., Parikh, N., Chu, E., Peleato, B., Eckstein, J.: Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends Mach. Learn. 3(1), 1–122 (2011)
Article Google Scholar
Caragea, D.: Learning Classifiers from Distributed, Semantically Heterogeneous, Autonomous Data Sources. Ph.D. thesis (2004)
Google Scholar
Dantzig, G.B., Wolfe, P.: Decomposition principle for linear programs. Operations Research 8, 101–111 (1960)
Article MATH Google Scholar
Caragea, D., Reinoso, J., Silvescu, A., Honavar, V.: Statistics gathering for learning from distributed, heterogeneous and autonomous data sources (2003)
Google Scholar
Grant, M. C., Boyd, S. P.: Graph implementations for nonsmooth convex programs. In: Blondel, V.D., Boyd, S.P., Kimura, H. (eds.) Recent Advances in Learning and Control. LNCIS, vol. 371, pp. 95–110. Springer, Heidelberg (2008)
Chapter Google Scholar
Grant, M., Boyd, S.: Cvx: Matlab software for disciplined convex programming, version 2.1 (March 2014)
Google Scholar
Gummadi, S., Housri, N., Zimmers, T.A., Koniaris, L.G.: medical record: a balancing act of patient safety, privacy and health care delivery. The American Journal of the Medical Sciences 348(3), 238–243 (2014)
Article Google Scholar
Vaidya, J., Yu, H., Jiang, X.: Privacy-preserving SVM classification. Knowledge and Information Systems 14(2), 161–178 (2008)
Article Google Scholar
Lindell, Y., Pinkas, B.: Privacy preserving data mining. In: Bellare, M. (ed.) CRYPTO 2000. LNCS, vol. 1880, pp. 36–54. Springer, Heidelberg (2000)
Chapter Google Scholar
Nguyen, M.H., de la Torre, F.: Optimal feature selection for support vector machines. Smart Computing Review 3(43), 584–591 (2010)
Google Scholar
Mullen, K.M.: Continuous global optimization in r. Journal of Statistical Software 60(6) (2014)
Google Scholar
Nash, J.C., Varadhan, R.: Unifying optimization algorithms to aid software system users: optimx for r. Journal of Statistical Software 43(9), 1–14 (2011)
Google Scholar
Parikh, N., Boyd, S.: Proximal algorithms. Foundations and Trends in Optimization 1(3), 123–231 (2014)
Article Google Scholar
Que, J., Jiang, X., Ohno-Machado, L.: A collaborative framework for distributed privacy-preserving support vector machine learning. In: AMIA Annual Symposium Proceedings, vol. 2012, pp. 1350–1359 (2012)
Google Scholar
Koenker, R., Mizera, I.: Convex optimization in r. Journal of Statistical Software 60(5) (2014)
Google Scholar
Kumar, V., Minz, S.: Feature selection: A literature review. Smart Computing Review 4(3) (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Radiotherapy Department, Università Cattolica del Sacro Cuore, Rome, Italy
Andrea Damiani, Roberto Gatta, Nicola Dinapoli & Vincenzo Valentini
School of Computing and Engineering, University of Huddersfield, Huddersfield, UK
Mauro Vallati
Department of Radiation Oncology (MAASTRO), GROW School for Oncology and Developmental Biology, Maastricht University Medical Centre, Maastricht, Netherlands
Arthur Jochems, Timo Deist, Johan van Soest & Andre Dekker

Authors

Andrea Damiani
View author publications
You can also search for this author in PubMed Google Scholar
Mauro Vallati
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Gatta
View author publications
You can also search for this author in PubMed Google Scholar
Nicola Dinapoli
View author publications
You can also search for this author in PubMed Google Scholar
Arthur Jochems
View author publications
You can also search for this author in PubMed Google Scholar
Timo Deist
View author publications
You can also search for this author in PubMed Google Scholar
Johan van Soest
View author publications
You can also search for this author in PubMed Google Scholar
Andre Dekker
View author publications
You can also search for this author in PubMed Google Scholar
Vincenzo Valentini
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrea Damiani .

Editor information

Editors and Affiliations

University of Pennsylvania, Philadelphia, Pennsylvania, USA
John H. Holmes
University of Pavia, Pavia, Italy
Riccardo Bellazzi
University of Pavia, Pavia, Italy
Lucia Sacchi
University of Manchester, Manchester, United Kingdom
Niels Peek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Damiani, A. et al. (2015). Distributed Learning to Protect Privacy in Multi-centric Clinical Studies. In: Holmes, J., Bellazzi, R., Sacchi, L., Peek, N. (eds) Artificial Intelligence in Medicine. AIME 2015. Lecture Notes in Computer Science(), vol 9105. Springer, Cham. https://doi.org/10.1007/978-3-319-19551-3_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-19551-3_8
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19550-6
Online ISBN: 978-3-319-19551-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics