Using robust audio and video processing technologies to alleviate the elderly cognitive decline

We are recently witnessing a growing interest for pervasive context-aware products and services for elderly users. This is largely due to falling fertility and rising longevity phenomena, as well as due to the proliferation of the aging population all over the world. In this paper we present a number of leading edge audio and video processing technologies, which can be exploited to build robust ambient assisted living applications for elderly groups. In particular, we discuss application requirements aiming at alleviating the cognitive decline of elderly users and present audio and video processing components that can essentially fulfill these requirements. We emphasize on technologies such as automatic speech recognition, speaker identification, face detection, person tracking, face identification, and demonstrate how mature versions of these technologies can be appropriately customized to give a significant boost to AAL applications for senior citizens. The challenges, solutions and ideas within this paper are part of the EU project HERMES, which aims at providing an integrated approach to cognitive care, based on assistive technology that reduces age-related decline of cognitive capabilities.

[1]  Bhuvana Ramabhadran,et al.  The IBM 2007 speech transcription system for European parliamentary speeches , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[2]  Ghassan O. Karame,et al.  2D and 3D face localization for complex scenes , 2007, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance.

[3]  Lawrence D. Stone,et al.  Bayesian Multiple Target Tracking , 1999 .

[4]  Vincent M. Stanford Pervasive Computing: Applications - Using Pervasive Computing to Deliver Elder Care , 2002, IEEE Distributed Syst. Online.

[5]  Aristodemos Pnevmatikakis,et al.  The AIT Multimodal Person Identification System for CLEAR 2007 , 2007, CLEAR.

[6]  John Soldatos,et al.  A breadboard architecture for pervasive context-aware services in smart spaces: middleware components and prototype applications , 2007, Personal and Ubiquitous Computing.

[7]  A. Pnevmatikakis,et al.  Robust Estimation of Background for Fixed Cameras , 2006, 2006 15th International Conference on Computing.

[8]  Douglas W. Oard Multilingual access to large spoken archives , 2003 .

[9]  Shrikanth S. Narayanan,et al.  Combining acoustic and language information for emotion recognition , 2002, INTERSPEECH.

[10]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[11]  Oswald Lanz,et al.  Approximate Bayesian multibody tracking , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[13]  James L. Crowley,et al.  Agent based middleware infrastructure for autonomous context-aware ubiquitous computing services , 2007, Comput. Commun..

[14]  Roberto Brunelli,et al.  A Generative Approach to Audio-Visual Person Tracking , 2006, CLEAR.

[15]  Gary Bradski,et al.  Computer Vision Face Tracking For Use in a Perceptual User Interface , 1998 .

[16]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[17]  N. Baumgartner A SURVEY OF UPPER ONTOLOGIES FOR SITUATION AWARENESS , 2006 .

[18]  Patrick Pérez,et al.  Data fusion for visual tracking with particles , 2004, Proceedings of the IEEE.

[19]  Aristodemos Pnevmatikakis,et al.  Combining Finite State Machines and LDA for Voice Activity Detection , 2007, AIAI.

[20]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[21]  Vincent M. Stanford,et al.  Pervasive Computing Goes to Work: Interfacing to the Enterprise , 2002, IEEE Pervasive Comput..