Acoustic monitoring of daily activities based on hidden Markov model and multidimensional scaling

Prolonging independence of seniors improves their quality of life and reduces caring costs. Monitoring health conditions and events increases the safety of the senior and is helpful for independent living. With the maturing of speech technology, it is now possible to monitor activities in daily living space via acoustic signals. The advantages include fewer disturbances and more privacy. This study aims to present an acoustic activated recognition framework to model the daily activity of seniors and provide quantitative evidence of physical functions and social interactivity for living support and the health-related quality of life assessment. Acoustic streams were recorded from designed scenarios within a living space. Fast acoustic pre-segmentation and transcription was implemented using the delta Bayesian information criterion. Hidden Markov model with a developed behavior grammar network was adopted to automatically recognize acoustic events. A Gaussian mixture model combined with multidimensional scaling was proposed for fast speaker diarization. Experimental results show high detection rates in both the recognition of acoustic events and speakers, revealing the feasibility of efficiently modeling daily activities and providing quantitative evidence of health condition and social connection. The case study also shows potential for activity monitoring in the course of caregiving and living independently.

[1]  Umi Kalsom Yusof,et al.  Hands-free messaging application (iSay-SMS): A proposed framework , 2010, 2010 International Conference on Science and Social Research (CSSR 2010).

[2]  D. Miller,et al.  Health-related quality of life , 2002, Multiple sclerosis.

[3]  Fabio Valente,et al.  Agglomerative information bottleneck for speaker diarization of meetings data , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[4]  John A. Stankovic,et al.  ALARM-NET: Wireless Sensor Networks for Assisted-Living and Residential Monitoring , 2006 .

[5]  Michel Vacher,et al.  Sound Classification in a Smart Room Environment: an Approach using GMM and HMM Methods , 2007 .

[6]  Michel Vacher,et al.  Development of Audio Sensing Technology for Ambient Assisted Living: Applications and Challenges , 2011, Int. J. E Health Medical Commun..

[7]  Christian A. Müller,et al.  A fast-match approach for robust, faster than real-time speaker diarization , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[8]  Hagai Aronowitz,et al.  A distance measure between GMMs based on the unscented transform and its application to speaker recognition , 2005, INTERSPEECH.

[9]  Ramesh A. Gopinath,et al.  Improved speaker segmentation and segments clustering using the bayesian information criterion , 1999, EUROSPEECH.

[10]  Shikha Dixit,et al.  Health-Related Quality of Life and Health Management , 2010 .

[11]  Chung-Hsien Wu,et al.  Automatic segmentation and identification of mixed-language speech using delta-BIC and LSA-based GMMs , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Hristijan Gjoreski,et al.  Three-layer Activity Recognition Combining Domain Knowledge and Meta-classification , 2013 .

[13]  Fabio Valente,et al.  Combination of agglomerative and sequential clustering for speaker diarization , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[15]  Mladen Russo,et al.  Android application for sending SMS messages with speech recognition interface , 2012, 2012 Proceedings of the 35th International Convention MIPRO.

[16]  Mike Davies,et al.  Filler models for Automatic Speech Recognition created from Hidden Markov Models using the K-Means algorithm , 2009, 2009 17th European Signal Processing Conference.

[17]  Wen-Hsien Fang,et al.  A video‐based human fall detection system for smart homes , 2010 .

[18]  A. Fleury,et al.  Sound and speech detection and classification in a Health Smart Home , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[19]  S. Chen,et al.  Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion , 1998 .

[20]  John A. Stankovic,et al.  Empath: a continuous remote emotional health monitoring system for depressive illness , 2011, Wireless Health.

[21]  C. N. Scanaill,et al.  A Review of Approaches to Mobility Telemonitoring of the Elderly in Their Living Environment , 2006, Annals of Biomedical Engineering.

[22]  Jean-Luc Gauvain,et al.  Improving Speaker Diarization , 2004 .

[23]  Douglas A. Reynolds,et al.  Approaches and applications of audio diarization , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..