Multi-modal activity recognition from egocentric vision, semantic enrichment and lifelogging applications for the care of dementia

Abstract We describe a framework for lifelogging monitoring in the scope of dementia care, based on activity recognition from egocentric vision and semantic context-enrichment. As pure vision-based approaches appear to be already saturating in terms of recognition accuracy, we propose their enhancement with wearable bracelet accelerometer information. For that purpose, we design and study appropriate early and late fusion schemes to increase accuracy. The incorporation of mechanical variables, such as jerk, improves the recognition accuracy of activities that require fine motion. In addition, we describe a framework for semantic activity representation and interpretation, using Semantic Web technologies for building interoperable activity graphs. The system is personalized, as deployment-specific activity models are authored, while problems related to the disease are detected by rules. Complemented by lifelogging applications, the system is able to support interventions by clinicians, and endorse a feeling of safety and inclusion for end-users and their carers.

[1]  Liming Chen,et al.  Ontology-Enabled Activity Learning and Model Evolution in Smart Homes , 2010, UIC.

[2]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[3]  Claudio Bettini,et al.  COSAR: hybrid reasoning for context-aware activity recognition , 2011, Personal and Ubiquitous Computing.

[4]  Cordelia Schmid,et al.  Mining Visual Actions from Movies , 2009, BMVC.

[5]  Liming Chen,et al.  A Hybrid Ontological and Temporal Approach for Composite Activity Modelling , 2012, 2012 IEEE 11th International Conference on Trust, Security and Privacy in Computing and Communications.

[6]  Yong Jae Lee,et al.  Predicting Important Objects for Egocentric Video Summarization , 2015, International Journal of Computer Vision.

[7]  Jenny Benois-Pineau,et al.  Strategies for multiple feature fusion with Hierarchical HMM: Application to activity recognition from wearable audiovisual sensors , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[8]  Yongdong Zhang,et al.  Parallel deblocking filter for HEVC on many-core processor , 2014 .

[9]  Claudio Bettini,et al.  OWL 2 modeling and reasoning with complex human activities , 2011, Pervasive Mob. Comput..

[10]  Simone Calderara,et al.  Understanding social relationships in egocentric vision , 2015, Pattern Recognit..

[11]  Georgios Meditskos,et al.  Pervasive and Mobile Computing ( ) – Pervasive and Mobile Computing Semantic Web Technologies in Pervasive Computing: a Survey and Research Roadmap , 2022 .

[12]  Thanos G. Stavropoulos,et al.  DemaWare2: Integrating sensors, multimedia and semantic analysis for the ambient care of dementia , 2017, Pervasive Mob. Comput..

[13]  Deva Ramanan,et al.  Detecting activities of daily living in first-person camera views , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Sebastian Rudolph,et al.  Modeling in OWL 2 without Restrictions , 2012, OWLED.

[15]  Koutraki Maria,et al.  S-CRETA: Smart Classroom Real-Time Assistance , 2012, isami 2012.

[16]  Georges Quénot,et al.  Hierarchical Late Fusion for Concept Detection in Videos , 2014, Fusion in Computer Vision.

[17]  Boris Motik,et al.  OWL 2: The next step for OWL , 2008, J. Web Semant..

[18]  Jeff Z. Pan,et al.  Resource Description Framework , 2020, Definitions.

[19]  James M. Rehg,et al.  A Scalable Approach to Activity Recognition based on Object Use , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[20]  T. Flash,et al.  The coordination of arm movements: an experimentally confirmed mathematical model , 1985, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[21]  Matthias Rauterberg,et al.  The Evolution of First Person Vision Methods: A Survey , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[23]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[24]  Young-Koo Lee,et al.  OWL-Based User Preference and Behavior Routine Ontology for Ubiquitous System , 2005, OTM Conferences.

[25]  Alan F. Smeaton,et al.  Combining wearable sensors for location-free monitoring of gait in older people , 2012, J. Ambient Intell. Smart Environ..

[26]  Miguel A. Labrador,et al.  A Survey on Human Activity Recognition using Wearable Sensors , 2013, IEEE Communications Surveys & Tutorials.

[27]  Martin Hepp,et al.  Using SPARQL and SPIN for Data Quality Management on the Semantic Web , 2010, BIS.

[28]  Ramesh C. Jain,et al.  Objective Self , 2014, IEEE Multim..

[29]  Chris D. Nugent,et al.  A Knowledge-Driven Approach to Activity Recognition in Smart Homes , 2012, IEEE Transactions on Knowledge and Data Engineering.

[30]  Stephen J. McKenna,et al.  Combining embedded accelerometers with computer vision for recognizing food preparation activities , 2013, UbiComp.

[31]  Bernt Schiele,et al.  A tutorial on human activity recognition using body-worn inertial sensors , 2014, CSUR.

[32]  Yannick Berthoumieu,et al.  Multiple Feature Fusion Based on Co-Training Approach and Time Regularization for Place Classification in Wearable Video , 2013, Adv. Multim..

[33]  Georgios Meditskos,et al.  Semantic Event Fusion of Different Visual Modality Concepts for Activity Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Yongdong Zhang,et al.  A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors , 2014, IEEE Signal Processing Letters.

[35]  Georgios Meditskos,et al.  Knowledge-Driven Activity Recognition and Segmentation Using Context Connections , 2014, International Semantic Web Conference.

[36]  Ali Farhadi,et al.  Understanding egocentric activities , 2011, 2011 International Conference on Computer Vision.

[37]  Jenq-Neng Hwang,et al.  A Review on Video-Based Human Activity Recognition , 2013, Comput..

[38]  Alexandra König,et al.  Validation of an automatic video monitoring system for the detection of instrumental activities of daily living in dementia patients. , 2015, Journal of Alzheimer's disease : JAD.

[39]  Walterio W. Mayol-Cuevas,et al.  High level activity recognition using low resolution wearable vision , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[40]  Thanos G. Stavropoulos,et al.  A Novel and Intelligent Home Monitoring System for Care Support of Elders with Cognitive Impairment. , 2016, Journal of Alzheimer's disease : JAD.

[41]  Zhiwen Yu,et al.  Three-dimensional image-based human pose recovery with hypergraph regularized autoencoders , 2017, Multimedia Tools and Applications.

[42]  James M. Rehg,et al.  Learning to Recognize Daily Actions Using Gaze , 2012, ECCV.

[43]  Jean-François Dartigues,et al.  Recognition of Instrumental Activities of Daily Living in Egocentric Video for Activity Monitoring of Patients with Dementia , 2015, Health Monitoring and Personalized Feedback using Multimedia Data.

[44]  Zhenyu He,et al.  Activity recognition from acceleration data based on discrete consine transform and SVM , 2009, 2009 IEEE International Conference on Systems, Man and Cybernetics.

[45]  Antonio Iera,et al.  The Internet of Things: A survey , 2010, Comput. Networks.

[46]  Carlo S. Regazzoni,et al.  Optimizing Superpixel Clustering for Real-Time Egocentric-Vision Applications , 2015, IEEE Signal Processing Letters.

[47]  Mitsuru Ikeda,et al.  Activity Recognition Using Context-Aware Infrastructure Ontology in Smart Home Domain , 2012, 2012 Seventh International Conference on Knowledge, Information and Creativity Support Systems.

[48]  Hui Wang,et al.  Ontology-Based Learning Framework for Activity Assistance in an Adaptive Smart Home , 2011 .

[49]  Chris D. Nugent,et al.  Ontology-based activity recognition in intelligent pervasive environments , 2009, Int. J. Web Inf. Syst..

[50]  Jenny Benois-Pineau,et al.  Modeling instrumental activities of daily living in egocentric vision as sequences of active objects and context for alzheimer disease research , 2013, MIIRH '13.

[51]  Jenny Benois-Pineau,et al.  Hierarchical Hidden Markov Model in detecting activities of daily living in wearable videos for studies of dementia , 2011, Multimedia Tools and Applications.

[52]  Véronique Malaisé,et al.  Design and use of the Simple Event Model (SEM) , 2011, J. Web Semant..

[53]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[54]  Dieter Fensel,et al.  Knowledge Engineering: Principles and Methods , 1998, Data Knowl. Eng..

[55]  Vittorio Murino,et al.  Semi-supervised multi-feature learning for person re-identification , 2013, 2013 10th IEEE International Conference on Advanced Video and Signal Based Surveillance.