poster

Privacy preserving speech analysis using emotion filtering at the edge: poster abstract

Authors:

Ranya Aloufi,

Hamed Haddadi,

David BoyleAuthors Info & Claims

SenSys '19: Proceedings of the 17th Conference on Embedded Networked Sensor Systems

Pages 426 - 427

https://doi.org/10.1145/3356250.3361947

Published: 10 November 2019 Publication History

Get Access

Abstract

Voice controlled devices and services are commonplace in consumer IoT. Cloud-based analysis services extract information from voice input using speech recognition techniques. Services providers can build detailed profiles of users' demographics, preferences and emotional states, etc., and may therefore significantly compromise privacy. To address this problem, a privacy-preserving intermediate layer between users and cloud services is proposed to sanitize voice input directly at edge devices by generating neutralized signals for forwarding. We show that a trained model, based on CycleGAN and deployed on a Raspberry Pi, enables identification and removal of sensitive emotional state information by ~91%, with minimal losses to speech recognition accuracy.

References

[1]

2019. Emotion-Classification-Ravdess. https://github.com/marcogdepinto/Emotion-Classification-Ravdess

Google Scholar

[2]

Affectiva. [n. d.]. Emotion AI. https://www.affectiva.com/emotion-ai-overview/

Google Scholar

[3]

Ranya Aloufi, Hamed Haddadi, and David Boyle. 2019. Emotion Filtering at the Edge. (Sep 2019). arXiv:1909.08500

Google Scholar

[4]

Ranya Aloufi, Hamed Haddadi, and David Boyle. 2019. Emotionless: Privacy-Preserving Speech Analysis for Voice Assistants. arXiv:1908.03632

Google Scholar

[5]

IBM. 2019. IBM Watson Speech to Text. https://speech-to-text-demo.ng.bluemix.net

Google Scholar

[6]

Huafeng Jin and Shuo Wang. 2018. Voice-based determination of physical and emotional characteristics of users.

Google Scholar

[7]

Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, and Nobukatsu Hojo. 2019. CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion.

Google Scholar

[8]

Steven R Livingstone and Frank A Russo. 2018. The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): A dynamic, multimodal set of facial and vocal expressions in North American English. (2018).

Google Scholar

[9]

Masanori Morise, Fumiya Yokomori, and Kenji Ozawa. 2016. WORLD: a vocoder-based high-quality speech synthesis system for real-time applications. (2016).

Google Scholar

[10]

Andreas Nautsch, Abelino Jiménez, Amos Treiber, Jascha Kolberg, Catherine Jasserand, Els Kindt, Héctor Delgado, Massimiliano Todisco, Mohamed Amine Hmani, Aymen Mtibaa, et al. 2019. Preserving Privacy in Speaker and Speech Characterisation. (2019).

Google Scholar

[11]

Scott R Peppet. 2014. Regulating the internet of things: first steps toward managing discrimination, privacy, security and consent. (2014).

Google Scholar

[12]

Weidi Xie, Arsha Nagrani, Joon Son Chung, and Andrew Zisserman. 2019. Utterance-level Aggregation For Speaker Recognition In The Wild. (2019).

Google Scholar

[13]

Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A Efros. 2017. Unpaired image-to-image translation using cycle-consistent adversarial networks.

Google Scholar

Cited By

View all

Vargas Martin MHung P(2022)Privacy-Preserving Speech RecognitionEncyclopedia of Machine Learning and Data Science10.1007/978-1-4899-7502-7_984-1(1-6)Online publication date: 8-Mar-2022
https://doi.org/10.1007/978-1-4899-7502-7_984-1
Zhu HZhang YGuo XLi X(2021)Anti Leakage: Protecting Privacy Hidden in Our Speech2021 7th International Conference on Big Data Computing and Communications (BigCom)10.1109/BigCom53800.2021.00011(114-120)Online publication date: Aug-2021
https://doi.org/10.1109/BigCom53800.2021.00011
Dubois DKolcun RMandalari AParacha MChoffnes DHaddadi H(2020)When Speakers Are All Ears: Characterizing Misactivations of IoT Smart SpeakersProceedings on Privacy Enhancing Technologies10.2478/popets-2020-00722020:4(255-276)Online publication date: 17-Aug-2020
https://doi.org/10.2478/popets-2020-0072
Show More Cited By

Index Terms

Privacy preserving speech analysis using emotion filtering at the edge: poster abstract
1. Computer systems organization
  1. Embedded and cyber-physical systems
    1. Embedded systems
2. Security and privacy

Recommendations

Paralinguistic Privacy Protection at the Edge
Voice user interfaces and digital assistants are rapidly entering our lives and becoming singular touch points spanning our devices. These always-on services capture and transmit our audio data to powerful cloud services for further processing and ...
Privacy-preserving Voice Analysis via Disentangled Representations
CCSW'20: Proceedings of the 2020 ACM SIGSAC Conference on Cloud Computing Security Workshop

Voice User Interfaces (VUIs) are increasingly popular and built into smartphones, home assistants, and Internet of Things (IoT) devices. Despite offering an always-on convenient user experience, VUIs raise new security and privacy concerns for their ...
Emotion Filtering at the Edge
SenSys-ML 2019: Proceedings of the 1st Workshop on Machine Learning on Edge in Sensor Systems

Voice controlled devices and services have become very popular in the consumer IoT. Cloud-based speech analysis services extract information from voice inputs using speech recognition techniques. Services providers can thus build very accurate profiles ...

Comments

Information & Contributors

Information

Published In

SenSys '19: Proceedings of the 17th Conference on Embedded Networked Sensor Systems

November 2019

472 pages

ISBN:9781450369503

DOI:10.1145/3356250

General Chairs:
Raghu K. Ganti
IBM T.J. Watson
,
Xiaofan (Fred) Jiang
Columbia University
,
Program Chairs:
Gian Pietro Picco
University of Trento, Italy
,
Xia Zhou
Dartmouth College

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 November 2019

Check for updates

Author Tags

Qualifiers

Poster

Funding Sources

Conference

SenSys '19

Sponsor:

SenSys '19: The 17th ACM Conference on Embedded Networked Sensor Systems

November 10 - 13, 2019

New York, New York

Acceptance Rates

Overall Acceptance Rate 198 of 990 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
372
Total Downloads

Downloads (Last 12 months)16
Downloads (Last 6 weeks)0

Reflects downloads up to 20 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Vargas Martin MHung P(2022)Privacy-Preserving Speech RecognitionEncyclopedia of Machine Learning and Data Science10.1007/978-1-4899-7502-7_984-1(1-6)Online publication date: 8-Mar-2022
https://doi.org/10.1007/978-1-4899-7502-7_984-1
Zhu HZhang YGuo XLi X(2021)Anti Leakage: Protecting Privacy Hidden in Our Speech2021 7th International Conference on Big Data Computing and Communications (BigCom)10.1109/BigCom53800.2021.00011(114-120)Online publication date: Aug-2021
https://doi.org/10.1109/BigCom53800.2021.00011
Dubois DKolcun RMandalari AParacha MChoffnes DHaddadi H(2020)When Speakers Are All Ears: Characterizing Misactivations of IoT Smart SpeakersProceedings on Privacy Enhancing Technologies10.2478/popets-2020-00722020:4(255-276)Online publication date: 17-Aug-2020
https://doi.org/10.2478/popets-2020-0072
Tawakuli AKaiser DEngel T(2020)Synchronized Preprocessing of Sensor Data2020 IEEE International Conference on Big Data (Big Data)10.1109/BigData50022.2020.9377900(3522-3531)Online publication date: 10-Dec-2020
https://doi.org/10.1109/BigData50022.2020.9377900

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Paralinguistic Privacy Protection at the Edge

Privacy-preserving Voice Analysis via Disentangled Representations

Emotion Filtering at the Edge

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations