Speech analysis for Ambient Assisted Living : technical and user design of a vocal order system

Evolution of ICT led to the emergence of smart home. A Smart Home consists in a home equipped with data-processing technology which anticipates the needs of its inhabitant while trying to maintain their comfort and their safety by action on the house and by implementing connections with the outside world. Therefore, smart homes equipped with ambient intelligence technology constitute a promising direction to enable the growing number of elderly to continue to live in their own homes as long as possible. However, the technological solutions requested by this part of the population have to suit their specific needs and capabilities. It is obvious that these Smart Houses tend to be equipped with devices whose interfaces are increasingly complex and become difficult to control by the user. The people the most likely to benefit from these new technologies are the people in loss of autonomy such as the disabled people or the elderly which cognitive deficiencies (Alzheimer). Moreover, these people are the less capable of using the complex interfaces due to their handicap or their lack ICT understanding. Thus, it becomes essential to facilitate the daily life and the access to the whole home automation system through the smart home. The usual tactile interfaces should be supplemented by accessible interfaces, in particular, thanks to a system reactive to the voice ; these interfaces are also useful when the person cannot move easily. Vocal orders will allow the following functionality: - To ensure an assistance by a traditional or vocal order. - To set up a indirect order regulation for a better energy management. - To reinforce the link with the relatives by the integration of interfaces dedicated and adapted to the person in loss of autonomy. - To ensure more safety by detection of distress situations and when someone is breaking in the house. This chapter will describe the different steps which are needed for the conception of an audio ambient system. The first step is related to the acceptability and the objection aspects by the end users and we will report a user evaluation assessing the acceptance and the fear of this new technology. The experience aimed at testing three important aspects of speech interaction: voice command, communication with the outside world, home automation system interrupting a person's activity. The experiment was conducted in a smart home with a voice command using a Wizard of OZ technique and gave information of great interest. The second step is related to a general presentation of the audio sensing technology for ambient assisted living. Different aspect of sound and speech processing will be developed. The applications and challenges will be presented. The third step is related to speech recognition in the home environment. Automatic Speech Recognition systems (ASR) have reached good performances with close talking microphones (e.g., head-set), but the performances decrease significantly as soon as the microphone is moved away from the mouth of the speaker (e.g., when the microphone is set in the ceiling). This deterioration is due to a broad variety of effects including reverberation and presence of undetermined background noise such as TV radio and, devices. This part will present a system of vocal order recognition in distant speech context. This system was evaluated in a dedicated flat thanks to some experiments. This chapter will then conclude with a discussion on the interest of the speech modality concerning the Ambient Assisted Living.

[1]  Douglas D. O'Shaughnessy,et al.  Blind Speech Separation in Multiple Environments Using a Frequency Oriented PCA Method for Convolutive Mixtures , 2011, INTERSPEECH.

[2]  N. Sharkey,et al.  Granny and the robots: ethical issues in robot care for the elderly , 2012, Ethics and Information Technology.

[3]  Paul Lukowicz,et al.  Design and Evaluation of a Sound Based Water Flow Measurement System , 2008, EuroSSC.

[4]  Brigitte Meillon,et al.  The sweet-home project: Audio technology in smart homes to improve well-being and reliance , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[5]  William G. Cowley,et al.  Adaptive Blocking Beamformer for Speech Separation , 2011, INTERSPEECH.

[6]  Michel Vacher,et al.  Fusion of Audio and Temporal Multimodal Data by Spreading Activation for Dweller Localisation in a Smart Home , 2011, AMI 2011.

[7]  Barbara Prazak-Aram,et al.  Requirements and Ethical Issues for Sensor-Augmented Environments in Elderly Care , 2007, HCI.

[8]  Christophe Kolski,et al.  Démarche centrée utilisateur pour la conception de SIAD basés sur un processus d'ECD, application dans le domaine de la santé , 2010 .

[9]  Klaus-Peter Engelbrecht,et al.  Study of a Speech-based Smart Home System with Older Users , 2008 .

[10]  Michel Vacher,et al.  Determining useful sensors for automatic recognition of activities of daily living in health smart home , 2009 .

[11]  Lorna Lines,et al.  Multiple voices, multiple choices: Older adults??? evaluation of speech output to support independent living , 2006 .

[12]  Haizhou Li,et al.  Sound event classification based on Feature Integration, Recursive Feature Elimination and Structured Classification , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[13]  Martina Ziefle,et al.  Technology acceptability for medical assistance , 2010, 2010 4th International Conference on Pervasive Computing Technologies for Healthcare.

[14]  Ramón López-Cózar,et al.  Designing smart home interfaces for the elderly , 2009, ASAC.

[15]  Alex Mihailidis,et al.  Development of an automated speech recognition interface for personal emergency response systems , 2009, Journal of NeuroEngineering and Rehabilitation.

[16]  Haizhou Li,et al.  Alternative Frequency Scale Cepstral Coefficient for Robust Sound Event Recognition , 2011, INTERSPEECH.

[17]  Jonathan G. Fiscus,et al.  A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER) , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[18]  Alain Dufaux Detection and Recognition of Impulsive Sound Signals , 2001 .

[19]  Marjorie Skubic,et al.  An acoustic fall detector system that uses sound height information to reduce the false alarm rate , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[20]  W. Keith Edwards,et al.  At Home with Ubiquitous Computing: Seven Challenges , 2001, UbiComp.

[21]  Iván Durán-Díaz,et al.  Generalized Method for Solving the Permutation Problem in Frequency-Domain Blind Source Separation of Convolved Speech Signals , 2011, INTERSPEECH.

[22]  Georges Linarès,et al.  Imperfect transcript driven speech recognition , 2006, INTERSPEECH.

[23]  Israel Gannot,et al.  Fall detection of elderly through floor vibrations and sound , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[24]  Michel Vacher,et al.  Speech and Sound Use in a Remote Monitoring System for Health Care , 2006, TSD.

[25]  Jacqueline Laures-Gore,et al.  Acoustic-perceptual correlates of voice quality in elderly men and women. , 2006, Journal of communication disorders.

[26]  M. Skubic,et al.  Older adults' attitudes towards and perceptions of ‘smart home’ technologies: a pilot study , 2004, Medical informatics and the Internet in medicine.

[27]  Juan Carlos Augusto,et al.  Past, Present and Future of Ambient Intelligence and Smart Environments , 2009, ICAART.

[28]  Mari Zakrzewski,et al.  Probing a Proactive Home : Challenges in Researching and Designing Everyday Smart Environments , 2006 .

[29]  Brigitte Meillon,et al.  Design and evaluation of a smart home voice interface for the elderly: acceptability and objection aspects , 2011, Personal and Ubiquitous Computing.

[30]  Michel Vacher,et al.  SVM-Based Multimodal Classification of Activities of Daily Living in Health Smart Homes: Sensors, Algorithms, and First Experimental Results , 2010, IEEE Transactions on Information Technology in Biomedicine.

[31]  Steve Renals,et al.  Ageing Voices: The Effect of Changes in Voice Parameters on ASR Performance , 2010, EURASIP J. Audio Speech Music. Process..

[32]  Björn W. Schuller,et al.  Learning New Acoustic Events in an HMM-Based System Using MAP Adaptation , 2011, INTERSPEECH.

[33]  Panos Markopoulos,et al.  Ambient intelligence, ethics and privacy , 2007 .

[34]  John McDonough,et al.  Distant Speech Recognition , 2009 .

[35]  Marjorie Skubic,et al.  TigerPlace, A State-Academic-Private Project to Revolutionize Traditional Long-Term Care , 2008, Journal of housing for the elderly.

[36]  Michael A. Cowling,et al.  Non-Speech Environmental Sound Classification System for Autonomous Surveillance , 2004 .

[37]  Michel Vacher,et al.  Information extraction from sound for medical telemonitoring , 2006, IEEE Transactions on Information Technology in Biomedicine.

[38]  Francis Jambon,et al.  Evaluation des systèmes mobiles et ubiquitaires: proposition de méthodologie et retours d'expérience , 2014 .

[39]  J. Rodin Aging and health: effects of the sense of control. , 1986, Science.

[40]  Georges Linarès,et al.  Principes et performances du décodeur parole continue Speeral , 2002 .

[41]  Kaisa Väänänen,et al.  Evolution towards smart home environments: empirical evaluation of three user interfaces , 2004, Personal and Ubiquitous Computing.

[42]  Ning Liu,et al.  Bathroom Activity Monitoring Based on Sound , 2005, Pervasive.

[43]  Steve Renals,et al.  Longitudinal study of ASR performance on ageing voices , 2008, INTERSPEECH.

[44]  Jay G. Wilpon,et al.  A study of speech recognition for children and the elderly , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[45]  Fabio Brugnara,et al.  Towards age-independent acoustic modeling , 2009, Speech Commun..

[46]  Michel Vacher,et al.  Sound Classification in a Smart Room Environment: an Approach using GMM and HMM Methods , 2007 .

[47]  Hee-Cheol Kim,et al.  A questionnaire study for the design of smart home for the elderly , 2006, HEALTHCOM 2006 8th International Conference on e-Health Networking, Applications and Services.

[48]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[49]  Michel Vacher,et al.  Development of Audio Sensing Technology for Ambient Assisted Living: Applications and Challenges , 2011, Int. J. E Health Medical Commun..

[50]  Vincent Rialle,et al.  What Do Family Caregivers of Alzheimer’s Disease Patients Desire in Smart Home Technologies? , 2009, Methods of Information in Medicine.

[51]  G. ÓLaighin,et al.  A proposal for the classification and evaluation of fall detectors Une proposition pour la classification et l'évaluation des détecteurs de chutes , 2008 .

[52]  Michel Vacher,et al.  Preliminary evaluation of speech/sound recognition for telemedicine application in a real environment , 2008, INTERSPEECH.

[53]  Richard M. Stern,et al.  The 1997 CMU Sphinx-3 English Broadcast News Transcription System , 1997 .

[54]  Michel Vacher,et al.  Sound detection and classification through transient models usingwavelet coefficient trees , 2004, 2004 12th European Signal Processing Conference.

[55]  Georges Linarès,et al.  Generalized driven decoding for speech recognition system combination , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.