research-article

P³S²: practical secure protocol for speech data publishing

Authors:

Guanglin ZhangAuthors Info & Claims

ACM TURC '19: Proceedings of the ACM Turing Celebration Conference - China

Article No.: 33, Pages 1 - 5

https://doi.org/10.1145/3321408.3321595

Published: 17 May 2019 Publication History

Abstract

Speech data publishing discloses users' data privacy, and thus entails more privacy risks for users. Existing work sanitized the content, voice, and, voiceprint of speech data without considering the consistence among these three aspects, and therefore cannot protect users' data privacy. To this end, we propose a practical secure protocol for speech data publishing P³S², the first attempt towards taking the corrections among the three factors into consideration when it sanitizes users' speech data. To concrete, it designs a three-dimension sanitization that utilizes feature learning to capture the set of characteristics in each dimension, and then sanitizes speech data in each dimension using the learned features. As a result, the correlations among the three dimensions of the sanitized speech data are guaranteed. Furthermore, it utilizes two real world datasets, TED talks and LibriSpeech to evaluate the performance of P³S² in terms of the data privacy preservation.

References

[1]

W. Xinyu, H. Zhongzhao, F. Xinzhe, F. Luoyi, W. Xinbing, and L. Song-wu, "Social network de-anonymization with overlapping communities: Analysis, algorithm and experiments," in Proc. of IEEE INFOCOM, 2018.

[2]

W. Xudong, F. Luoyi, Y. Yuhang, F. Xinzhe, W. Xinbing, and C. Guihai, "GLP: A novel framework for group-level location promotion in geo-social networks," IEEE/ACM Transactions on Networking, vol. 26, no. 6, pp. 2870--2883, 2018.

Digital Library

[3]

"Apple admitted Siri voice data sharing," Available at https://goo.gl/rRHj4r.

[4]

"Samsung admitted voice data sharing," Available at https://goo.gl/bQPUDj.

[5]

J. Qian, F. Han, J. Hou, C. Zhang, Y. Wang, and X.-Y. Li, "Towards privacy-preserving speech data publishing," in Proc. of IEEE INFOCOM, 2018.

[6]

H. Corrigan-Gibbs and D. Boneh, "Prio: Private, robust, and scalable computation of aggregate statistics." in Proc. of NSDI, 2017.

Digital Library

[7]

"Apple's 'differential privacy' is about collecting your data-but not? Your data, " Available at https://www.wired.com/2016/06/apples-differential-privacy-collecting-data/.

[8]

D. Gillick, "Can conversational word usage be used to predict speaker demographics?" in Proc. of Eleventh Annual Conference of the International Speech Communication Association, 2010.

[9]

B. L. Brown and J. M. Bradshaw, "Towards a social psychology of voice variations," in Recent advances in language, communication, and social psychology. Routledge, 2018, pp. 144--181.

[10]

H. Zhao, Z. Yang, Z. Chen, and X. Zhang, "Automatic chinese personality recognition based on prosodic features," in Proc. of International Conference on Multimedia Modeling.

[11]

J. H. Hansen, K. Williams, and H. Boril, "Speaker height estimation from speech: Fusing spectral regression and statistical acoustic models," The Journal of the Acoustical Society of America, vol. 138, no. 2, pp. 1052--1067, 2015.

[12]

P. Patel, A. Chaudhari, R. Kale, and M. Pund, "Emotion recognition from speech with gaussian mixture models & via boosted gmm," International Journal of Research In Science & Engineering, vol. 3, 2017.

[13]

B. Schuller, S. Steidl, A. Batliner, F. Schiel, and J. Krajewski, "The interspeech 2011 speaker state challenge," in Proc. of Twelfth Annual Conference of the International Speech Communication Association, 2011.

[14]

H. Jiang, P. Zhao, and C. Wang, "RobLoP: Towards Robust Privacy Preserving against Location Dependent Attacks in Continuous LBS Queries," IEEE/ACM Transactions on Networking, vol. 26, no. 2, pp. 1018--1032, 2018.

Digital Library

[15]

P. Zhao, H. Jiang, J. C. S. Lui, C. Wang, F. Zeng, F. Xiao, and Z. Li, "P3-LOC:A Privacy-Preserving Paradigm-Driven Framework for Indoor Localization," IEEE/ACM Transactions on Networking, vol. 26, no. 6, pp. 2856--2869, 2018.

Digital Library

[16]

Y. Zhang, Q. Chen, and S. Zhong, "Privacy-preserving data aggregation in mobile phone sensing," IEEE Transactions on Information Forensics and Security, vol. 11, no. 5, pp. 980--992, 2016.

Digital Library

[17]

T. Araki, J. Furukawa, Y. Lindell, A. Nof, and K. Ohara, "High-throughput semi-honest secure three-party computation with an honest majority," in Proc. of ACM CCS, 2016.

Digital Library

[18]

E. Boyle, K.-M. Chung, and R. Pass, "Large-scale secure computation: Multi-party computation for (parallel) ram programs," in Proc. of Annual Cryptology Conference, 2015.

[19]

K. Bonawitz, V. Ivanov, B. Kreuter, A. Marcedone, H. B. McMahan, S. Patel, D. Ramage, A. Segal, and K. Seth, "Practical secure aggregation for privacy-preserving machine learning," in Proc. of ACM CCS, 2017.

Digital Library

[20]

G. Lample, M. Ballesteros, S. Subramanian, K. Kawakami, and C. Dyer, "Neural architectures for named entity recognition," 2016.

[21]

S. H. Mohammadi and A. Kain, An Overview of Voice Conversion Systems, 2017.

Digital Library

[22]

"Ted talks," Available at https://www.ted.com/.

[23]

V. Panayotov, G. Chen, D. Povey, and S. Khudanpur, "Librispeech: an asr corpus based on public domain audio books," in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[24]

R. Shokri, M. Stronati, C. Song, and V. Shmatikov, "Membership inference attacks against machine learning models," in Proc. of IEEE Symposium on Security and Privacy (S&P).

P³S²: practical secure protocol for speech data publishing
1. Security and privacy
2. Social and professional topics
  1. Computing / technology policy

Recommendations

Personalised anonymity for microdata release

Individual privacy protection in the released data sets has become an important issue in recent years. The release of microdata provides a significant information resource for researchers, whereas the release of person‐specific data poses a threat to ...
k-anonymity: a model for protecting privacy

Consider a data holder, such as a hospital or a bank, that has a privately held collection of person-specific, field structured data. Suppose the data holder wants to share a version of the data with researchers. How can a data holder release a version ...
A Novel Differential Privacy Approach that Enhances Classification Accuracy
C3S2E '16: Proceedings of the Ninth International C* Conference on Computer Science & Software Engineering

In the recent past, there has been a tremendous increase of large repositories of data, examples being in healthcare data, consumer data from retailers, and airline passenger data. These data are continually being shared with interested parties, either ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ACM TURC '19: Proceedings of the ACM Turing Celebration Conference - China

May 2019

963 pages

ISBN:9781450371582

DOI:10.1145/3321408

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 May 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

ACM TURC 2019

ACM TURC 2019: ACM Turing Celebration Conference - China

May 17 - 19, 2019

Chengdu, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
64
Total Downloads

Downloads (Last 12 months)2
Downloads (Last 6 weeks)1

Reflects downloads up to 14 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten