Development of an automated speech recognition interface for personal emergency response systems

BackgroundDemands on long-term-care facilities are predicted to increase at an unprecedented rate as the baby boomer generation reaches retirement age. Aging-in-place (i.e. aging at home) is the desire of most seniors and is also a good option to reduce the burden on an over-stretched long-term-care system. Personal Emergency Response Systems (PERSs) help enable older adults to age-in-place by providing them with immediate access to emergency assistance. Traditionally they operate with push-button activators that connect the occupant via speaker-phone to a live emergency call-centre operator. If occupants do not wear the push button or cannot access the button, then the system is useless in the event of a fall or emergency. Additionally, a false alarm or failure to check-in at a regular interval will trigger a connection to a live operator, which can be unwanted and intrusive to the occupant. This paper describes the development and testing of an automated, hands-free, dialogue-based PERS prototype.MethodsThe prototype system was built using a ceiling mounted microphone array, an open-source automatic speech recognition engine, and a 'yes' and 'no' response dialog modelled after an existing call-centre protocol. Testing compared a single microphone versus a microphone array with nine adults in both noisy and quiet conditions. Dialogue testing was completed with four adults.Results and discussionThe microphone array demonstrated improvement over the single microphone. In all cases, dialog testing resulted in the system reaching the correct decision about the kind of assistance the user was requesting. Further testing is required with elderly voices and under different noise conditions to ensure the appropriateness of the technology. Future developments include integration of the system with an emergency detection method as well as communication enhancement using features such as barge-in capability.ConclusionThe use of an automated dialog-based PERS has the potential to provide users with more autonomy in decisions regarding their own health and more privacy in their own home.

[1]  Boaz Rafaely,et al.  Phase-mode versus delay-and-sum spherical microphone array processing , 2005, IEEE Signal Processing Letters.

[2]  Jwu-Sheng Hu,et al.  Robust Beamforming of Microphone Array Using Hinfinity Adaptive Filtering Technique , 2006, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[3]  M. Johnson,et al.  Home-screen: a short scale to measure fall risk in the home. , 2001, Public health nursing.

[4]  E. Porter Wearing and using personal emergency respone system buttons. , 2005, Journal of gerontological nursing.

[5]  Hong-Seok Kim,et al.  Performance of an HMM speech recognizer using a real-time tracking microphone array as input , 1999, IEEE Trans. Speech Audio Process..

[6]  Neil Johnson,et al.  A smart sensor to detect the falls of the elderly , 2004, IEEE Pervasive Computing.

[7]  M. Rantz,et al.  Aging in place: a new model for long-term care. , 2000, Nursing administration quarterly.

[8]  S. Rubin,et al.  Race and Sex Differences in Age‐Related Hearing Loss: The Health, Aging and Body Composition Study , 2005, Journal of the American Geriatrics Society.

[9]  A. Mihailidis,et al.  An intelligent emergency response system: Preliminary development and testing of a functional health monitoring system , 2006 .

[10]  M. Skubic,et al.  Older adults' attitudes towards and perceptions of ‘smart home’ technologies: a pilot study , 2004, Medical informatics and the Internet in medicine.

[11]  Dave Burke Voice Extensible Markup Language (VoiceXML) , 2007 .

[12]  Pascal Poupart,et al.  Partially Observable Markov Decision Processes with Continuous Observations for Dialogue Management , 2008, SIGDIAL.

[13]  Roberto Pieraccini,et al.  A stochastic model of human-machine interaction for learning dialog strategies , 2000, IEEE Trans. Speech Audio Process..

[14]  Alex Mihailidis,et al.  An intelligent emergency response system: preliminary development and testing of automated fall detection , 2005, Journal of telemedicine and telecare.

[15]  Norbert Noury,et al.  An Experimental Health Smart Home and Its Distributed Internet-based Information and Communication System: First Steps of a Research Project , 2001, MedInfo.

[16]  Daryle Gardner-Bonneau,et al.  Human Factors and Voice Interactive Systems , 1999 .

[17]  Luis Miguel Bergasa,et al.  A Navigation System for Assistant Robots Using Visually Augmented POMDPs , 2005, Auton. Robots.

[18]  W C Mann,et al.  Elder Acceptance of Health Monitoring Devices in the Home , 2002, Care Management Journals.

[19]  Paul Lamere,et al.  Sphinx-4: a flexible open source framework for speech recognition , 2004 .

[20]  S. Reinsch,et al.  Home Safety Intervention for the Prevention of Falls , 1994 .

[21]  M. Tinetti,et al.  Risk factors for falls among elderly persons living in the community. , 1988, The New England journal of medicine.

[22]  Tao Chen,et al.  Accent Issues in Large Vocabulary Continuous Speech Recognition , 2004, Int. J. Speech Technol..

[23]  Eileen J Porter Moments of Apprehension in the Midst of a Certainty: Some Frail Older Widows' Lives with a Personal Emergency Response System , 2003, Qualitative health research.

[24]  M. Gordon,et al.  Community care for the elderly: is it really better? , 1993, CMAJ : Canadian Medical Association journal = journal de l'Association medicale canadienne.

[25]  Kiyohiro Shikano,et al.  Elderly acoustic model for large vocabulary continuous speech recognition , 2001, INTERSPEECH.

[26]  Maurizio Omologo,et al.  Hidden Markov model training with contaminated speech material for distant-talking speech recognition , 2002, Comput. Speech Lang..

[27]  W. Mann,et al.  Use of Personal Emergency Response Systems by Older Individuals With Disabilities , 2005, Assistive technology : the official journal of RESNA.