MP2SDA: Multi-Party Parallelized Sparse Discriminant Learning

Published: 13 March 2020


Sparse Discriminant Analysis (SDA) has been widely used to improve the performance of classical Fisher’s Linear Discriminant Analysis in supervised metric learning, feature selection, and classification. With the increasing needs of distributed data collection, storage, and processing, enabling the Sparse Discriminant Learning to embrace the multi-party distributed computing environments becomes an emerging research topic. This article proposes a novel multi-party SDA algorithm, which can learn SDA models effectively without sharing any raw data and basic statistics among machines. The proposed algorithm (1) leverages the direct estimation of SDA to derive a distributed loss function for the discriminant learning, (2) parameterizes the distributed loss function with local/global estimates through bootstrapping, and (3) approximates a global estimation of linear discriminant projection vector by optimizing the “distributed bootstrapping loss function” with gossip-based stochastic gradient descent. Experimental results on both synthetic and real-world benchmark datasets show that our algorithm can compete with the aggregated SDA with similar performance, and significantly outperforms the most recent distributed SDA in terms of accuracy and F1-score.


  • (2023)Data Placement for Multi-Tenant Data Federation on the CloudIEEE Transactions on Cloud Computing10.1109/TCC.2021.313657711:2(1414-1429)Online publication date: 1-Apr-2023
  • (2022)Machine Learning in Real-Time Internet of Things (IoT) Systems: A SurveyIEEE Internet of Things Journal10.1109/JIOT.2022.31610509:11(8364-8386)Online publication date: 1-Jun-2022
  • (2022)OGM: Online gaussian graphical models on the flyApplied Intelligence10.1007/s10489-021-02563-452:3(3103-3117)Online publication date: 1-Feb-2022
Published In

ACM Transactions on Knowledge Discovery from Data  Volume 14, Issue 3
June 2020
381 pages
Published: 13 March 2020

Published: 13 March 2020
Accepted: 01 December 2019
Revised: 01 October 2019
Received: 01 March 2019
Published in TKDD Volume 14, Issue 3


Author Tags

  distributed
  multi-party
  parallelized
  4. parallelized


Funding Sources

  NSF: CRII: CSR: NeuroMC---Parallel Online Scheduling of Mixed-Criticality Real-Time Systems via Neural Networks
  • NSF: CRII: CSR: NeuroMC---Parallel Online Scheduling of Mixed-Criticality Real-Time Systems via Neural Networks


