poster

A non-intrusive and adaptive speaker de-identification scheme using adversarial examples

Authors:
Meng Chen

Zhejiang University and ZJU-HIC

Zhejiang University and ZJU-HIC
View Profile

,
Li Lu

Zhejiang University

Zhejiang University
View Profile

,
Jiadi Yu

Shanghai Jiao Tong University

Shanghai Jiao Tong University
View Profile

,
Yingying Chen

Rutgers University

Rutgers University
View Profile

,
Zhongjie Ba

Zhejiang University

Zhejiang University
View Profile

,
Feng Lin

Zhejiang University

Zhejiang University
View Profile

,
Kui Ren

Zhejiang University

Zhejiang University
View Profile

MobiCom '22: Proceedings of the 28th Annual International Conference on Mobile Computing And NetworkingOctober 2022Pages 853–855https://doi.org/10.1145/3495243.3558260

Published:14 October 2022Publication History

MobiCom '22: Proceedings of the 28th Annual International Conference on Mobile Computing And Networking

Pages 853–855

ABSTRACT

Faced with the threat of identity leakage during voice data publishing, users are engaged in a privacy-utility dilemma while enjoying convenient voice services. Existing studies employ direct modification or text-based re-synthesis to de-identify users' voices, but resulting in inconsistent audibility for human participants and not adaptive to informed attacks. In this poster, we propose a non-intrusive and adaptive speaker de-identification scheme to balance the privacy and utility of voice services. We generate adversarial examples to conceal user identity from exposure by Automatic Speaker Identification (ASI). By learning a compact distribution with a conditional variational auto-encoder, our system enables on-demand target sampling and diverse identity transformation. We also introduce the acoustic masking effect to construct inaudible perturbations, thus preserving the speech content and perceptual quality. Experiments on 50 speakers show our system could achieve 98.2% successful de-identification on 4 mainstream ASIs with an objective perceptual quality of 4.38 and a subjective mean opinion score of 4.56.

References

Shimaa Ahmed, Amrita Roy Chowdhury, Kassem Fawaz, and Parmesh Ramanathan. 2020. Preech: A System for Privacy-Preserving Speech Transcription. In Proceedings of USENIX Security. Virtual Event, 2703--2720.Google Scholar
Tadej Justin, Vitomir Struc, Simon Dobrisek, Bostjan Vesnicer, Ivo Ipsic, and France Mihelic. 2015. Speaker de-identification using diphone recognition and speech synthesis. In Proceedings of IEEE FG. Ljubljana, Slovenia, 1--7.Google ScholarCross Ref
Jianwei Qian, Haohua Du, Jiahui Hou, Linlin Chen, Taeho Jung, and Xiang-Yang Li. 2018. Hidebehind: Enjoy Voice Input with Voiceprint Unclonability and Anonymity. In Proceedings of ACM SenSys. Shenzhen, China, 82--94.Google ScholarDigital Library
Brij Mohan Lal Srivastava, Natalia A. Tomashenko, Xin Wang, Emmanuel Vincent, Junichi Yamagishi, Mohamed Maouche, Aurélien Bellet, and Marc Tommasi. 2020. Design Choices for X-Vector Based Speaker Anonymization. In Proceedings of ISCA Interspeech. Virtual Event, Shanghai, China, 1713--1717.Google Scholar
Brij Mohan Lal Srivastava, Nathalie Vauquier, Md. Sahidullah, Aurélien Bellet, Marc Tommasi, and Emmanuel Vincent. 2020. Evaluating Voice Conversion-Based Privacy Protection against Informed Attackers. In Proceedings of IEEE ICASSP. Barcelona, Spain, 2802--2806.Google Scholar
Tavish Vaidya and Micah Sherr. 2019. You Talk Too Much: Limiting Privacy Exposure Via Voice Input. In Proceedings of IEEE S&P Workshops. San Francisco, CA, USA, 84--91.Google ScholarCross Ref

Index Terms

A non-intrusive and adaptive speaker de-identification scheme using adversarial examples
1. Security and privacy
  1. Human and societal aspects of security and privacy
    1. Privacy protections

Recommendations

Reversible speaker de-identification using pre-trained transformation functions

A speaker de-identification method based on pre-trained transformations is proposed.We overcome the need for a parallel corpus between input and target speakers.Objective and subjective evaluations prove the validity of the proposed approach.This de-...
Read More
FedSP: Federated Speaker Verification with Personal Privacy Preservation
Algorithms and Architectures for Parallel Processing
Abstract
Automatic speaker verification (ASV) has been widely applied in a variety of industrial scenarios. In ASV, the universal background model (UBM) needs to be trained with a large variety of speaker data so that the UBM can learn the speaker-...
Read More
Speaker anonymization using generative adversarial networks

The advent use of smart devices has enabled the emergence of many applications that facilitate user interaction through speech. However, speech reveals private and sensitive information about the user’s identity, posing several security risks. For ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

MobiCom '22: Proceedings of the 28th Annual International Conference on Mobile Computing And Networking
October 2022
932 pages
ISBN:9781450391818
DOI:10.1145/3495243

Copyright © 2022 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 14 October 2022
Check for updates
Author Tags
adversarial example
privacy preservation
speaker de-identification
voice anonymization
Qualifiers
- poster
Conference

Acceptance Rates
Overall Acceptance Rate440of2,972submissions,15%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 216
  Total Downloads
- Downloads (Last 12 months)80
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

A non-intrusive and adaptive speaker de-identification scheme using adversarial examples

MobiCom '22: Proceedings of the 28th Annual International Conference on Mobile Computing And Networking

ABSTRACT

References

Cited By

Index Terms

Recommendations

Reversible speaker de-identification using pre-trained transformation functions

FedSP: Federated Speaker Verification with Personal Privacy Preservation

Speaker anonymization using generative adversarial networks

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

A non-intrusive and adaptive speaker de-identification scheme using adversarial examples

MobiCom '22: Proceedings of the 28th Annual International Conference on Mobile Computing And Networking

ABSTRACT

References

Cited By

Index Terms

Recommendations

Reversible speaker de-identification using pre-trained transformation functions

FedSP: Federated Speaker Verification with Personal Privacy Preservation

Speaker anonymization using generative adversarial networks

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media