Abstract
Although the release and analysis of high-dimensional data bring tremendous value to people, it causes great hidden danger to participants’ privacy in the meantime. Various privacy protection methods based on differential privacy have been proposed at present. However, most of them cannot simultaneously solve the problems of high computational overhead and privacy threats from untrusted servers caused by the curse of high dimensionality. Therefore, we propose a safer and more effective high-dimensional data release algorithm based on local differential privacy, which is referred to as PU_Bpub. It effectively preserves the dimensional correlation of the original high-dimensional data and reduces the communication overhead of synthetic data. Extensive experiments on real-world datasets demonstrate that our solution substantially outperforms the state-of-the-art techniques in terms of computational overhead, and the synthetic dataset has high utility.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ye, Q.Q., Meng, X.F., Zhu, M.J., et al.: Survey on local differential privacy. J. Soft. 29, 159–183 (2018)
Dwork, C.: Differential privacy in new settings. In: Proceedings of the Twenty-First Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 174–183. SIAM (2010)
Wang, N., Xiao, X.K., Yang, Y., et al.: PrivTrie: effective frequent term discovery under local differential privacy. In: Proceedings of IEEE ICDE, Piscataway, pp. 821–832. IEEE (2018)
Zhang, X., Chen, L., Jin, K., et al.: Private high-dimensional data publication with junction tree. J. Comput. Res. Dev. 55(12), 2794–2809 (2018)
Hardt, M., Roth, A.: Beyond worst-case analysis in private singular vector computation. In: Proceedings of the Forty-fifth Annual ACM Symposium on Theory of Computing, New York, USA, pp. 331–340. ACM (2013)
Chaudhuri, K., Sarwate, A.D., Sinha, K.: A nearoptimal algorithm for differentially-private principal components. J. Mach. Learn. Res. 14(1), 2905–2943 (2013)
Peng, C., Zhao, Y., Fan, M.: A differential private data publishing algorithm via principal component analysis based on maximum information coefficient. Netinfo Secur. 20(2), 37–48 (2020)
Sun, H., Yang, J., Cheng, X., et al.: A high-dimensional numeric data collection algorithm for local difference privacy based on random projection. Big Data Res. 6(01), 3–11 (2020)
Zhang, J., Cormode, G., Procopiuc, C.M., et al.: PrivBayes: private data release via Bayesian networks. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, New York, USA, pp. 1423–1434 (2014)
Ren, X.B., Yu, C.-M., Yu, W.R., et al.: LoPub: high-dimensional crowdsourced data publication with local differential privacy. IEEE Trans. Inf. Forensic Secur. 13, 2151–2166 (2018)
Wang, T., Yang, X., Ren, X., Yu, W., Yang, S.: Local private high-dimensional crowdsourced data release based on copula functions. IEEE Trans. Serv. Comput. 15(2), 778–792 (2019)
Min, X., Bolin, D., Tianhao, W., et al.: Collecting and analyzing data jointly from multiple services under local differential privacy. Proc. VLDB Endowmen. 13(11), 2760–2772 (2020)
Acknowledgement
This paper is supported by Inner Mongolia Natural Science Foundation (Grant No. 2018MS06026) and the Science and Technology Program of Inner Mongolia Autonomous Region (Grant No. 2019GG116).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Lin, A., Ma, X. (2022). PU_Bpub: High-Dimensional Data Release Mechanism Based on Spectral Clustering with Local Differential Privacy. In: Wang, L., Segal, M., Chen, J., Qiu, T. (eds) Wireless Algorithms, Systems, and Applications. WASA 2022. Lecture Notes in Computer Science, vol 13472. Springer, Cham. https://doi.org/10.1007/978-3-031-19214-2_48
Download citation
DOI: https://doi.org/10.1007/978-3-031-19214-2_48
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-19213-5
Online ISBN: 978-3-031-19214-2
eBook Packages: Computer ScienceComputer Science (R0)