Skip to main content

Application of Probabilistic Common Set on an Open World Set for Vertical Federated Learning

  • Conference paper
  • First Online:
Parallel and Distributed Computing, Applications and Technologies (PDCAT 2022)

Abstract

Vertical federated learning (VFL) is a distributed machine learning technology that is suitable for model building in organizations across different industries. It enables the identification of a common set of data that co-occur across organizations. However, VFL uses private set intersection (PSI) protocols, which requires making all data shareable, and satisfying the data minimization principle in the General Data Protection Regulation is difficult. To mitigate noncompliance in privacy regulations, we propose a new VFL method that uses horizontal federated learning to identify the common set instead of PSI. The method consists of two concepts: The first is to use a common data structure between organizations to avoid using PSI. The second is to identify the common set from machine learning classifiers of unseen data of a certain class. Our proposed method considers that the data labeled as the desired class is unseen data and it is not in the common set. Experimental results show that the F-measure is 0.8 or higher in 40% of the common set ratios.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bendale, A., Boult, T.E.: Towards open set deep networks. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1563–1572. IEEE Computer Society, Los Alamitos, CA, USA (2016). https://doi.org/10.1109/CVPR.2016.173

  2. Bloom, B.H.: Space/time trade-offs in hash coding with allowable errors. Commun. ACM 13(7), 422–426 (1970). https://doi.org/10.1145/362686.362692

    Article  MATH  Google Scholar 

  3. Dhamija, A.R., Günther, M., Boult, T.E.: Reducing network agnostophobia. In: NeurIPS, pp. 9175–9186 (2018). https://proceedings.neurips.cc/paper/2018/hash/48db71587df6c7c442e5b76cc723169a-Abstract.html

  4. Egert, R., Fischlin, M., Gens, D., Jacob, S., Senker, M., Tillmanns, J.: Privately computing set-union and set-intersection cardinality via bloom filters. In: Foo, E., Stebila, D. (eds.) ACISP 2015. LNCS, vol. 9144, pp. 413–430. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-19962-7_24

    Chapter  MATH  Google Scholar 

  5. Jiang, J.C., Kantarci, B., Oktug, S., Soyata, T.: Federated learning in smart city sensing: challenges and opportunities. Sensors 20(21), 6230 (2020). https://doi.org/10.3390/s20216230

    Article  Google Scholar 

  6. Kairouz, P., et al.: Advances and open problems in federated learning (2019). https://doi.org/10.48550/ARXIV.1912.04977

  7. Kholod, I., et al.: Open-source federated learning frameworks for IoT: a comparative review and analysis. Sensors 21(1), 167 (2021). https://doi.org/10.3390/s21010167

    Article  Google Scholar 

  8. Matan, O., et al.: Handwritten character recognition using neural network architectures. In: the 4th USPS Advanced Technology Conference, pp. 1003–1011 (1990)

    Google Scholar 

  9. Miyaji, A., Nagao, Y.: Privacy preserving data integration protocol. In: 2020 15th Asia Joint Conference on Information Security (AsiaJCIS), pp. 89–96 (2020). https://doi.org/10.1109/AsiaJCIS50894.2020.00025

  10. OpenMined: Pysyft (2022). https://www.openmined.org/

  11. Perera, P., Oza, P., Patel, V.M.: One-class classification: a survey. arXiv preprint arXiv:2101.03064 (2021)

  12. Shu, L., Xu, H., Liu, B.: DOC: deep open classification of text documents. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 2911–2916. Association for Computational Linguistics, Copenhagen, Denmark (2017). https://doi.org/10.18653/v1/D17-1314

  13. Yang, Q., Liu, Y., Chen, T., Tong, Y.: Federated machine learning: concept and applications. ACM Trans. Intell. Syst. Technol. 10(2), 1–19 (2019). https://doi.org/10.1145/3298981

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hiroshi Someda .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Someda, H., Osada, S., Kajikawa, Y. (2023). Application of Probabilistic Common Set on an Open World Set for Vertical Federated Learning. In: Takizawa, H., Shen, H., Hanawa, T., Hyuk Park, J., Tian, H., Egawa, R. (eds) Parallel and Distributed Computing, Applications and Technologies. PDCAT 2022. Lecture Notes in Computer Science, vol 13798. Springer, Cham. https://doi.org/10.1007/978-3-031-29927-8_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-29927-8_39

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-29926-1

  • Online ISBN: 978-3-031-29927-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics