Making Emergency Calls More Accessible to Older Adults Through a Hands-free Speech Interface in the House

Wearable personable emergency response (PER) systems are the mainstream solution for allowing frail and isolated individuals to call for help in an emergency. However, these devices are not well adapted to all users and are often not worn all the time, meaning they are not available when needed. This article presents a Voice User Interface system for emergency-call recognition. The interface is designed to permit hands-free interaction using natural language. Crucially, this allows a call for help to be registered without necessitating physical proximity to the system. The system is based on an ASR engine and is tested on a corpus collected to simulate realistic situations. The corpus contains French speech from 4 older adults and 13 younger people wearing an old-age simulator to hamper their mobility, vision, and hearing. On-line evaluation of the preliminary system showed an emergency-call error rate of 27%. Subsequent off-line experimentation improved the results (call error rate 24%), demonstrating that emergency-call recognition in the home is achievable. Another contribution of this work is the corpus, which is made available for research with the hope that it will facilitate related research and quicker development of robust methods for automatic emergency-call recognition in the home.

[1]  Michel Vacher,et al.  How affects can perturbe the automatic speech recognition of domotic interactions , 2013 .

[2]  L. Gillespie,et al.  Preventing falls in elderly people , 2004, BMJ : British Medical Journal.

[3]  Michel Vacher,et al.  SVM-Based Multimodal Classification of Activities of Daily Living in Health Smart Homes: Sensors, Algorithms, and First Experimental Results , 2010, IEEE Transactions on Information Technology in Biomedicine.

[4]  Jean-François Bonastre,et al.  ALIZE, a free toolkit for speaker recognition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[5]  Jeffrey M. Hausdorff,et al.  Evaluation of Accelerometer-Based Fall Detection Algorithms on Real-World Falls , 2012, PloS one.

[6]  Heidi Christensen,et al.  Perspectives on Speech and Language Interaction for Daily Assistive Technology , 2015, ACM Trans. Access. Comput..

[7]  Jon Barker,et al.  The second ‘chime’ speech separation and recognition challenge: Datasets, tasks and baselines , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  Stephen R Lord,et al.  Epidemiology of Emergency Medical Service Responses to Older People Who Have Fallen: A Prospective Cohort Study , 2014, Prehospital emergency care : official journal of the National Association of EMS Physicians and the National Association of State EMS Directors.

[9]  Michel Vacher,et al.  CIRDO: Smart companion for helping elderly to live at home for longer ☆ , 2014 .

[10]  N. Noury,et al.  Evaluation under real-life conditions of a stand-alone fall detector for the elderly subjects. , 2011, Annals of physical and rehabilitation medicine.

[11]  Maurizio Omologo,et al.  On the selection of the impulse responses for distant-speech recognition based on contaminated speech training , 2014, INTERSPEECH.

[12]  Guillaume Gravier,et al.  Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News , 2004, LREC.

[13]  Nara L. Newcomer,et al.  User Centered Design , 2014, Encyclopedia of Database Systems.

[14]  Randi Stokke,et al.  The Personal Emergency Response System as a Technology Innovation in Primary Health Care Services : An Integrative Review , 2016 .

[15]  Daniel Povey,et al.  The Kaldi Speech Recognition Toolkit , 2011 .

[16]  Pierre Zweigenbaum,et al.  The Quaero French Medical Corpus : A Ressource for Medical Entity Recognition and Normalization , 2014 .

[17]  Virginia Hessels Ba,et al.  Advances in Personal Emergency Response and Detection Systems , 2011 .

[18]  Michel Vacher,et al.  Information extraction from sound for medical telemonitoring , 2006, IEEE Transactions on Information Technology in Biomedicine.

[19]  C. Becker,et al.  Satisfaction and use of personal emergency response systems , 2010, Zeitschrift für Gerontologie und Geriatrie.

[20]  Marc-Eric Bobillier Chaumon,et al.  Detecting Falls at Home: User-Centered Design of a Pervasive Technology , 2016 .

[21]  L Nyberg,et al.  Comparison of real-life accidental falls in older people with experimental falls in middle-aged test subjects. , 2012, Gait & posture.

[22]  Jonathan Grudin,et al.  Design and evaluation , 1995 .

[23]  Fabio Brugnara,et al.  Towards age-independent acoustic modeling , 2009, Speech Commun..

[24]  Kallirroi Georgila,et al.  The MATCH corpus: a corpus of older and younger users’ interactions with spoken dialogue systems , 2010, Lang. Resour. Evaluation.

[25]  Michel Vacher,et al.  Speech recognition of aged voice in the AAL context: Detection of distress sentences , 2013, 2013 7th Conference on Speech Technology and Human - Computer Dialogue (SpeD).

[26]  Florence Cros,et al.  The user-centered design of an ambient technology for preventing falls at home , 2014 .

[27]  Hee-Cheol Kim,et al.  A questionnaire study for the design of smart home for the elderly , 2006, HEALTHCOM 2006 8th International Conference on e-Health Networking, Applications and Services.

[28]  Michel Vacher,et al.  Influence of Expressive Speech on ASR Performances: Application to Elderly Assistance in Smart Home , 2016, TSD.

[29]  Ramón López-Cózar,et al.  Designing smart home interfaces for the elderly , 2009, ASAC.

[30]  Patricia A McLean,et al.  Exploring Older Adults' Perceptions of the Utility and Ease of Use of Personal Emergency Response Systems. , 2017, Journal of National Black Nurses' Association : JNBNA.

[31]  Michel Vacher,et al.  SVM-Based Multi-Modal Classification of Activities of Daily Living in Health Smart Homes: Sensors, Algorithms and First Experimental Results , 2019 .

[32]  Alex Mihailidis,et al.  Development of an automated speech recognition interface for personal emergency response systems , 2009, Journal of NeuroEngineering and Rehabilitation.

[33]  Michel Vacher,et al.  Recognition of Distress Calls in Distant Speech Setting: a Preliminary Experiment in a Smart Home , 2015, SLPAT@Interspeech.

[34]  Gérard Chollet,et al.  Efficient Gaussian Mixture for Speech Recognition , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[35]  VirtanenTuomas,et al.  Detection and Classification of Acoustic Scenes and Events , 2018 .

[36]  Michel Vacher,et al.  Development of Audio Sensing Technology for Ambient Assisted Living: Applications and Challenges , 2011, Int. J. E Health Medical Commun..

[37]  Michel Vacher,et al.  Recognition of voice commands by multisource ASR and noise cancellation in a smart home environment , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[38]  Francesco Piazza,et al.  An integrated system for voice command recognition and emergency detection based on audio signals , 2015, Expert Syst. Appl..

[39]  John McDonough,et al.  Distant Speech Recognition , 2009 .

[40]  Carmen D Dirksen,et al.  Literature review on monitoring technologies and their outcomes in independently living elderly people , 2015, Disability and rehabilitation. Assistive technology.

[41]  Michel Vacher,et al.  Speech and Sound Use in a Remote Monitoring System for Health Care , 2006, TSD.

[42]  Brigitte Meillon,et al.  Evaluation of a Context-Aware Voice Interface for Ambient Assisted Living , 2015, ACM Trans. Access. Comput..

[43]  Jacqueline Laures-Gore,et al.  Acoustic-perceptual correlates of voice quality in elderly men and women. , 2006, Journal of communication disorders.

[44]  A. Mihailidis,et al.  Exploratory analysis of real personal emergency response call conversations: considerations for personal emergency response spoken dialogue systems , 2016, Journal of NeuroEngineering and Rehabilitation.

[45]  R. Halfens,et al.  Involvement of older people in the development of fall detection systems: a scoping review , 2016, BMC Geriatrics.

[46]  J. S. Stevenson,et al.  Older Adults , 1980, Suicide Prevention.

[47]  Björn W. Schuller,et al.  A Real-Time Speech Enhancement Framework in Noisy and Reverberated Acoustic Scenarios , 2012, Cognitive Computation.

[48]  Alain Colvez,et al.  Mieux prévenir les chutes chez les personnes âgées , 2016 .

[49]  J. Wei,et al.  PTCA in the Elderly: The “Young‐Old” versus the “Old‐Old” , 1992, Journal of the American Geriatrics Society.

[50]  Mari Zakrzewski,et al.  Probing a Proactive Home : Challenges in Researching and Designing Everyday Smart Environments , 2006 .

[51]  Jaco Badenhorst,et al.  The limitations of data perturbation for ASR of learner data in under-resourced languages , 2017, 2017 Pattern Recognition Association of South Africa and Robotics and Mechatronics (PRASA-RobMech).

[52]  Maxine Eskénazi,et al.  BREF, a large vocabulary spoken corpus for French , 1991, EUROSPEECH.

[53]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[54]  Phil D. Green,et al.  From WER and RIL to MER and WIL: improved evaluation measures for connected speech recognition , 2004, INTERSPEECH.

[55]  Eric Campo,et al.  A review of smart homes - Present state and future challenges , 2008, Comput. Methods Programs Biomed..

[56]  Michel Vacher,et al.  CirdoX: an on/off-line multisource speech and sound analysis software , 2016, LREC.

[57]  Heidi Christensen,et al.  Adaptive speech recognition and dialogue management for users with speech disorders , 2014, INTERSPEECH.

[58]  Brigitte Meillon,et al.  Design and evaluation of a smart home voice interface for the elderly: acceptability and objection aspects , 2011, Personal and Ubiquitous Computing.

[59]  Brigitte Meillon,et al.  The Sweet-Home speech and multimodal corpus for home automation interaction , 2014, LREC.

[60]  Ana Llena-Nozal,et al.  La prise en charge de la dépendance , 2011 .

[61]  van Jemh Annelies Bronswijk Human rights of older persons , 2011 .

[62]  Jon Barker,et al.  An analysis of environment, microphone and data simulation mismatches in robust speech recognition , 2017, Comput. Speech Lang..

[63]  Andreas Wendemuth,et al.  Towards Robust Spontaneous Speech Recognition with Emotional Speech Adapted Acoustic Models , 2012 .

[64]  M. Skubic,et al.  Older adults' attitudes towards and perceptions of ‘smart home’ technologies: a pilot study , 2004, Medical informatics and the Internet in medicine.

[65]  Alex Mihailidis,et al.  The CARES corpus: a database of older adult actor simulated emergency dialogue for developing a personal emergency response system , 2013, Int. J. Speech Technol..

[66]  Christina R. Victor,et al.  Use of personal call alarms among community-dwelling older people , 2012, Ageing and Society.

[67]  Michel Vacher,et al.  The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People , 2016, LREC.

[68]  Mark J. F. Gales,et al.  Maximum likelihood linear transformations for HMM-based speech recognition , 1998, Comput. Speech Lang..

[69]  Bernadette Dorizzi,et al.  Daily sound recognition using a combination of GMM and SVM for home automation , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[70]  Blockin,et al.  Vocal Expression of Emotion , 2004 .

[71]  Kiyohiro Shikano,et al.  Acoustic models of the elderly for large‐vocabulary continuous speech recognition , 2004 .

[72]  Steve Renals,et al.  Longitudinal study of ASR performance on ageing voices , 2008, INTERSPEECH.

[73]  Bart Vanrumste,et al.  Self-taught assistive vocal interfaces: an overview of the ALADIN project , 2013, INTERSPEECH.

[74]  Siti Salwah Salim,et al.  Exploring the influence of general and specific factors on the recognition accuracy of an ASR system for dysarthric speaker , 2015, Expert Syst. Appl..

[75]  Bonifaz Kaufmann,et al.  "Oh, I Say, Jeeves!" A Calm Approach to Smart Home Input , 2013, CHI-KDD.

[76]  Björn W. Schuller,et al.  Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge , 2011, Speech Commun..

[77]  Kai Feng,et al.  The subspace Gaussian mixture model - A structured model for speech recognition , 2011, Comput. Speech Lang..

[78]  Michel Vacher,et al.  On-line human activity recognition from audio and home automation sensors: Comparison of sequential and non-sequential models in realistic Smart Homes , 2016, J. Ambient Intell. Smart Environ..

[79]  Kaisa Väänänen,et al.  Evolution towards smart home environments: empirical evaluation of three user interfaces , 2004, Personal and Ubiquitous Computing.

[80]  Michel Vacher,et al.  Distant Speech Recognition in a Smart Home: Comparison of Several Multisource ASRs in Realistic Conditions , 2011, INTERSPEECH.