论文信息 - Privacy preserving speech analysis using emotion filtering at the edge: poster abstract

Privacy preserving speech analysis using emotion filtering at the edge: poster abstract

Voice controlled devices and services are commonplace in consumer IoT. Cloud-based analysis services extract information from voice input using speech recognition techniques. Services providers can build detailed profiles of users' demographics, preferences and emotional states, etc., and may therefore significantly compromise privacy. To address this problem, a privacy-preserving intermediate layer between users and cloud services is proposed to sanitize voice input directly at edge devices by generating neutralized signals for forwarding. We show that a trained model, based on CycleGAN and deployed on a Raspberry Pi, enables identification and removal of sensitive emotional state information by ~91%, with minimal losses to speech recognition accuracy.

Hamed Haddadi | David Boyle | Ranya Aloufi

[1] Alexei A. Efros,et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[2] Masanori Morise,et al. WORLD: A Vocoder-Based High-Quality Speech Synthesis System for Real-Time Applications , 2016, IEICE Trans. Inf. Syst..

[3] Joon Son Chung,et al. Utterance-level Aggregation for Speaker Recognition in the Wild , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[4] Kou Tanaka,et al. Cyclegan-VC2: Improved Cyclegan-based Non-parallel Voice Conversion , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5] S. R. Livingstone,et al. The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): A dynamic, multimodal set of facial and vocal expressions in North American English , 2018, PloS one.

[6] Hamed Haddadi,et al. Emotionless: Privacy-Preserving Speech Analysis for Voice Assistants , 2019, ArXiv.

[7] Hamed Haddadi,et al. Emotion Filtering at the Edge , 2019, SenSys-ML.

[8] Scott R. Peppet. Regulating the Internet of Things: First Steps Toward Managing Discrimination, Privacy, Security & Consent , 2014 .

[9] Nicholas W. D. Evans,et al. Preserving privacy in speaker and speech characterisation , 2019, Comput. Speech Lang..