Abstract
In the scenario of the Voice Privacy challenge, anonymization is achieved by converting all utterances from a source speaker to match the same target identity; this identity being randomly selected. In this context, an attacker with maximum knowledge about the anonymization system can not infer the target identity. This article proposed to constrain the target selection to a specific identity, i.e., removing the random selection of identity, to evaluate the extreme threat under a white-box assessment (the attacker has complete knowledge about the system). Targeting a unique identity also allows us to investigate whether some target’s identities are better than others to anonymize a given speaker.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bahmaninezhad, F., Zhang, C., Hansen, J.H.L.: Convolutional neural network based speaker de-identification. In: Odyssey (2018)
Champion, P., Jouvet, D., Larcher, A.: A study of F0 modification for X-vector based speech pseudonymization across gender. In: PPAI 2021 - The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence (2021)
Fang, F., et al.: Speaker anonymization using X-vector and neural waveform models. In: 10th ISCA Speech Synthesis Workshop (2019)
Gomez-Barrero, M., Galbally, J., Rathgeb, C., Busch, C.: General framework to evaluate unlinkability in biometric template protection systems. IEEE Trans. Inf. Forensics Secur. 13(6), 1406–1420 (2018)
Magariños, C., Lopez-Otero, P., Docio-Fernandez, L., Rodriguez-Banga, E., Erro, D., Garcia-Mateo, C.: Reversible speaker de-identification using pre-trained transformation functions. Comput. Speech Lang. 46, 36–52 (2017)
Maouche, M., Srivastava, B.M.L., Vauquier, N., Bellet, A., Tommasi, M., Vincent, E.: A comparative study of speech anonymization metrics. In: Interspeech (2020)
McAdams, S.: Spectral fusion, spectral parsing and the formation of the auditory image. Ph.D. thesis, Stanford (1984)
Nautsch, A., Jasserand, C., Kindt, E., Todisco, M., Trancoso, I., Evans, N.: The GDPR & speech data: reflections of legal and technology communities, first steps towards a common understanding. In: Interspeech (2019)
Noé, P.G., Bonastre, J.F., Matrouf, D., Tomashenko, N., Nautsch, A., Evans, N.: Speech pseudonymisation assessment using voice similarity matrices. In: Interspeech (2020)
European Parliament and of the Council: Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016. Regulation on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC. General Data Protection Regulation (2016)
Povey, D., et al.: The kaldi speech recognition toolkit. In: IEEE Workshop on Automatic Speech Recognition and Understanding (2011)
Srivastava, B.M.L., Bellet, A., Tommasi, M., Vincent, E.: Privacy-preserving adversarial representation learning in ASR: reality or illusion? In: Interspeech (2019)
Srivastava, B.M.L., et al.: Privacy and utility of x-vector based speaker anonymization. Trans. Audio Speech Lang. Process. (2021)
Srivastava, B.M.L., et al.: Design choices for x-vector based speaker anonymization. In: Interspeech (2020)
Srivastava, B.M.L., Vauquier, N., Sahidullah, M., Bellet, A., Tommasi, M., Vincent, E.: Evaluating voice conversion-based privacy protection against informed attackers. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (2020)
Sun, L., Li, K., Wang, H., Kang, S., Meng, H.: Phonetic posteriorgrams for many-to-one voice conversion without parallel data training. In: IEEE International Conference on Multimedia and Expo (2016)
Tomashenko, N., et al.: Introducing the VoicePrivacy initiative. In: Interspeech (2020)
Acknowledgments
This work was supported in part by the French National Research Agency under project DEEP-PRIVACY (ANR-18-CE23-0018) and Région Grand Est. Experiments were carried out using the Grid’5000 testbed, supported by a scientific interest group hosted by Inria and including CNRS, RENATER and several Universities as well as other organizations.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Champion, P., Jouvet, D., Larcher, A. (2021). Evaluating X-Vector-Based Speaker Anonymization Under White-Box Assessment. In: Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2021. Lecture Notes in Computer Science(), vol 12997. Springer, Cham. https://doi.org/10.1007/978-3-030-87802-3_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-87802-3_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87801-6
Online ISBN: 978-3-030-87802-3
eBook Packages: Computer ScienceComputer Science (R0)