Audio-based event recognition system for smart homes

Building an acoustic-based event recognition system for smart homes is a challenging task due to the lack of high-level structures in environmental sounds. In particular, the selection of effective features is still an open problem. We make an important step toward this goal by showing that the combination of Mel-Frequency Cepstral Coefficients, Zero-Crossing Rate, and Discrete Wavelet Transform features can achieve an F1 score of 96.5% and a recognition accuracy of 97.8% with a gradient boosting classifier for ambient sounds recorded in a kitchen environment.

[1]  Saeed Vaseghi,et al.  Comparing noise compensation methods for robust prediction of acoustic speech features from MFCC vectors in noise , 2008, 2008 16th European Signal Processing Conference.

[2]  Milos Cernak,et al.  Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[3]  Lian-Wen Jin,et al.  Activity recognition from acceleration data using AR model representation and SVM , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[4]  M. Marschollek,et al.  Sensor-based Fall Risk Assessment – an Expert ‘to go’ , 2011, Methods of Information in Medicine.

[5]  Vesa T. Peltonen,et al.  Computational auditory scene recognition , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Inma Hernáez,et al.  Audio Classification Techniques in Home Environments for Elderly/Dependant People , 2010, ICCHP.

[7]  R Haux,et al.  Measurement of accelerometry-based gait parameters in people with and without dementia in the field: a technical feasibility study. , 2013, Methods of information in medicine.

[8]  George Demiris,et al.  A systematic review of telehealth tools and interventions to support family caregivers , 2015, Journal of telemedicine and telecare.

[9]  Patrick Olivier,et al.  Feature Learning for Activity Recognition in Ubiquitous Computing , 2011, IJCAI.

[10]  Francesc Alías,et al.  homeSound: Real-Time Audio Event Detection Based on High Performance Computing for Behaviour and Surveillance Remote Monitoring , 2017, Sensors.

[11]  M. Hasan,et al.  Energy efficient novel architectures for the lifting-based discrete wavelet transform , 2007 .

[12]  Jesse Hoey,et al.  Sensor-Based Activity Recognition , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[13]  L. Chiari,et al.  Quantification of Motor Impairment in Parkinson's Disease Using an Instrumented Timed Up and Go Test , 2013, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[14]  Ashutosh Datar,et al.  Speaker Independent Speech Recognition using MFCC with Cubic-Log Compression and VQ Analysis , 2014 .

[15]  James M. Rehg,et al.  A Scalable Approach to Activity Recognition based on Object Use , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[16]  Vesa T. Peltonen,et al.  Audio-based context recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  Mohamed Atibi,et al.  Comparison of MFCC and DWT features extractors applied to PCG classification , 2016, 2016 11th International Conference on Intelligent Systems: Theories and Applications (SITA).

[18]  Miguel A. Labrador,et al.  A Survey on Human Activity Recognition using Wearable Sensors , 2013, IEEE Communications Surveys & Tutorials.

[19]  Tae-Seong Kim,et al.  Accelerometer’s position independent physical activity recognition system for long-term activity monitoring in the elderly , 2010, Medical & Biological Engineering & Computing.

[20]  Jont B. Allen,et al.  Short term spectral analysis, synthesis, and modification by discrete Fourier transform , 1977 .

[21]  Suleyman Bilgin,et al.  The impact of Daubechies Wavelet performances on Ventricular Tachyarrhythmia Patients for determination of dominant frequency bands in HRV , 2009, 2009 14th National Biomedical Engineering Meeting.

[22]  J. Riekki,et al.  Auditory Context Recognition Using SVMs , 2008, 2008 The Second International Conference on Mobile Ubiquitous Computing, Systems, Services and Technologies.

[23]  David Wetherall,et al.  Recognizing daily activities with RFID-based sensors , 2009, UbiComp.

[24]  Qiang Yang,et al.  Sensor-Based Abnormal Human-Activity Detection , 2008, IEEE Transactions on Knowledge and Data Engineering.

[25]  Chris D. Nugent,et al.  An Ontology-Based Hybrid Approach to Activity Modeling for Smart Homes , 2014, IEEE Transactions on Human-Machine Systems.

[26]  Younghwan Yoo,et al.  User Activity Recognition in Smart Homes Using Pattern Clustering Applied to Temporal ANN Algorithm , 2015, Sensors.

[27]  Arthur I. Karshmer,et al.  Living assistance systems: an ambient intelligence approach , 2006, ICSE.

[28]  Tae-Seong Kim,et al.  A Triaxial Accelerometer-Based Physical-Activity Recognition via Augmented-Signal Features and a Hierarchical Recognizer , 2010, IEEE Transactions on Information Technology in Biomedicine.

[29]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[30]  Buket D. Barkana,et al.  Voiced/Unvoiced Decision for Speech Signals Based on Zero-Crossing Rate and Energy , 2008, SCSS.

[31]  Tuomas Virtanen,et al.  Voice activity detection in the presence of breathing noise using neural network and hidden Markov model , 2008, 2008 16th European Signal Processing Conference.