A method for preserving privacy during audio recordings by filtering speech

Continuous monitoring of sensor data from wearable devices such as smartwatches has been a hot topic recently and has many practical applications, particularly towards health monitoring. Both industry and academia have been using sensors such as accelerometers, gyroscopes and pulse oximeters, to infer and monitor, among other activities, physical activity, heart rate, sleep and walking. However, one sensor that tends to go unused, despite it's vast utility, is the microphone. The microphone has the ability to pick up potentially useful sounds such as coughing, wheezing, throat clearing, sneezing and more. The reason it often goes unused is because of the privacy concerns associated with continuously recording audio. Speech of the user and others around the user can be picked up by the microphone, which is generally considered highly invasive. In this paper, we propose a method for continuously recording audio while preserving privacy by filtering sensitive signals such as speech. We explain how this method can be applied to create classifiers and deployed for in-the-wild studies. We use cough detection as an example application and show that the proposed speech filtering method does not degrade the classification accuracy when detecting coughs.

[1]  Mani B. Srivastava,et al.  mSieve: differential behavioral privacy in time series of mobile sensor data , 2016, UbiComp.

[2]  Predrag V. Klasnja,et al.  Exploring Privacy Concerns about Personal Sensing , 2009, Pervasive.

[3]  Vigneshwaran Subbaraju,et al.  Demo: Smartwatch based Shopping Gesture Recognition , 2016, MobiSys '16 Companion.

[4]  Xiao Sun,et al.  SymDetector: detecting sound-related respiratory symptoms using smartphones , 2015, UbiComp.

[5]  Graeme Hirst,et al.  Detecting late-life depression in Alzheimer’s disease through analysis of speech and language , 2016, CLPsych@HLT-NAACL.

[6]  Daniel McDuff,et al.  Biowatch: Estimation of heart and breathing rates from wrist motions , 2015, 2015 9th International Conference on Pervasive Computing Technologies for Healthcare (PervasiveHealth).

[7]  Björn W. Schuller,et al.  Recent developments in openSMILE, the munich open-source multimedia feature extractor , 2013, ACM Multimedia.

[8]  John Adcock,et al.  Audio privacy: reducing speech intelligibility while preserving environmental sounds , 2008, ACM Multimedia.

[9]  Yang Liu,et al.  DBN-ivector Framework for Acoustic Emotion Recognition , 2016, INTERSPEECH.

[10]  S. J. Redmond,et al.  Sensors-Based Wearable Systems for Monitoring of Human Movement and Falls , 2012, IEEE Sensors Journal.

[11]  Rita Singh,et al.  Estimation of Children's Physical Characteristics from Their Voices , 2016, INTERSPEECH.

[12]  Ming Zeng,et al.  Sound Shredding: Privacy Preserved Audio Sensing , 2015, HotMobile.

[13]  Eric C. Larson,et al.  Accurate and privacy preserving cough sensing using a low-cost microphone , 2011, UbiComp '11.

[14]  Roland Göcke,et al.  An Investigation of Emotional Speech in Depression Classification , 2016, INTERSPEECH.