Mobile activity recognition for a whole day: recognizing real nursing activities with big dataset

In this paper, we provide a real nursing data set for mobile activity recognition that can be used for supervised machine learning, and big data combined the patient medical records and sensors attempted for 2 years, and also propose a method for recognizing activities for a whole day utilizing prior knowledge about the activity segments in a day. Furthermore, we demonstrate data mining by applying our method to the bigger data with additional hospital data. In the proposed method, we 1) convert a set of segment timestamps into a prior probability of the activity segment by exploiting the concept of importance sampling, 2) obtain the likelihood of traditional recognition methods for each local time window within the segment range, and, 3) apply Bayesian estimation by marginalizing the conditional probability of estimating the activities for the segment samples. By evaluating with the dataset, the proposed method outperformed the traditional method without using the prior knowledge by 25.81% at maximum by balanced classification rate. Moreover, the proposed method significantly reduces duration errors of activity segments from 324.2 seconds of the traditional method to 74.6 seconds at maximum. We also demonstrate the data mining by applying our method to bigger data in a hospital.

[1]  Taghi M. Khoshgoftaar,et al.  Comparing Boosting and Bagging Techniques With Noisy and Imbalanced Data , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[2]  Young-Koo Lee,et al.  Semi-Markov conditional random fields for accelerometer-based activity recognition , 2010, Applied Intelligence.

[3]  M. Panella,et al.  Reducing clinical variations with clinical pathways: do pathways work? , 2003, International journal for quality in health care : journal of the International Society for Quality in Health Care.

[4]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[5]  Gwenn Englebienne,et al.  Accurate activity recognition in a home setting , 2008, UbiComp.

[6]  James Church,et al.  Wearable sensor badge and sensor jacket for context awareness , 1999, Digest of Papers. Third International Symposium on Wearable Computers.

[7]  Hisashi Kashima,et al.  Roughly balanced bagging for imbalanced data , 2009, Stat. Anal. Data Min..

[8]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[9]  Kris Vanhaecht,et al.  Systematic review: indicators to evaluate effectiveness of clinical pathways for gastrointestinal surgery. , 2008, Journal of evaluation in clinical practice.

[10]  Mi Zhang,et al.  A feature selection-based framework for human activity recognition using wearable multimodal sensors , 2011, BODYNETS.

[11]  Taghi M. Khoshgoftaar,et al.  An Empirical Study of Learning from Imbalanced Data Using Random Forest , 2007, 19th IEEE International Conference on Tools with Artificial Intelligence(ICTAI 2007).

[12]  Futoshi Naya,et al.  Workers' Routine Activity Recognition using Body Movements and Location Information , 2006, 2006 10th IEEE International Symposium on Wearable Computers.

[13]  Nobuhiko Nishio,et al.  HASC Challenge: gathering large scale human activity corpus for the real-world activity understandings , 2011, AH '11.

[14]  Mi Zhang,et al.  Motion primitive-based human activity recognition using a bag-of-features approach , 2012, IHI '12.

[15]  Ana Franco,et al.  Chunking or not chunking? How do we find words in artificial language learning? , 2012, Advances in cognitive psychology.

[16]  Jesús Favela,et al.  Monitoring behavioral patterns in hospitals through activity-aware computing , 2008, 2008 Second International Conference on Pervasive Computing Technologies for Healthcare.

[17]  J. McQueen Segmentation of Continuous Speech Using Phonotactics , 1998 .

[18]  Roman Klinger,et al.  Classical Probabilistic Models and Conditional Random Fields , 2007 .

[19]  Kris Demuynck,et al.  A Comparison of Different Approaches to Automatic Speech Segmentation , 2002, TSD.

[20]  David J. Hand,et al.  A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems , 2001, Machine Learning.

[21]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[22]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[23]  Angelo M. Sabatini,et al.  Machine Learning Methods for Classifying Human Physical Activity from On-Body Accelerometers , 2010, Sensors.

[24]  Venet Osmani,et al.  Human activity recognition in pervasive health-care: Supporting efficient remote collaboration , 2008, J. Netw. Comput. Appl..

[25]  Chris North,et al.  Visualizing Biological Pathways: Requirements Analysis, Systems Evaluation and Research Agenda , 2005, Inf. Vis..

[26]  Tianshi Chen,et al.  Towards Maximizing the Area Under the ROC Curve for Multi-Class Classification Problems , 2011, AAAI.

[27]  Fabio Tozeto Ramos,et al.  Multi-scale Conditional Random Fields for first-person activity recognition , 2014, 2014 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[28]  Héctor Pomares,et al.  A benchmark dataset to evaluate sensor displacement in activity recognition , 2012, UbiComp.

[29]  Diane J. Cook,et al.  Human Activity Recognition and Pattern Discovery , 2010, IEEE Pervasive Computing.

[30]  Gary M. Weiss,et al.  Activity recognition using cell phone accelerometers , 2011, SKDD.

[31]  Manuela M. Veloso,et al.  Feature selection in conditional random fields for activity recognition , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[32]  Thomas Rotter,et al.  Clinical pathways: effects on professional practice, patient outcomes, length of stay and hospital costs. , 2010, The Cochrane database of systematic reviews.

[33]  Kent Larson,et al.  Real-Time Recognition of Physical Activities and Their Intensities Using Wireless Accelerometers and a Heart Rate Monitor , 2007, 2007 11th IEEE International Symposium on Wearable Computers.

[34]  Lie Lu,et al.  Speech segmentation without speech recognition , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[35]  Jesse Hoey,et al.  Sensor-Based Activity Recognition , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[36]  Abdul V. Roudsari,et al.  Computerization of workflows, guidelines, and care pathways: a review of implementation challenges for process-oriented health information systems , 2011, J. Am. Medical Informatics Assoc..

[37]  Emiliano Miluzzo,et al.  A survey of mobile phone sensing , 2010, IEEE Communications Magazine.

[38]  Thomas G. Dietterich Machine Learning for Sequential Data: A Review , 2002, SSPR/SPR.

[39]  Tom Fawcett,et al.  ROC Graphs: Notes and Practical Considerations for Researchers , 2007 .

[40]  Manuela M. Veloso,et al.  Conditional random fields for activity recognition , 2007, AAMAS '07.

[41]  Mehryar Mohri,et al.  Discriminative Topic Segmentation of Text and Speech , 2010, AISTATS.

[42]  Ricardo Chavarriaga,et al.  The Opportunity challenge: A benchmark database for on-body sensor-based activity recognition , 2013, Pattern Recognit. Lett..

[43]  Go Hirakawa,et al.  A Large Scale Gathering System for Activity Data with Mobile Sensors , 2011, 2011 15th Annual International Symposium on Wearable Computers.

[44]  Paul Lukowicz,et al.  Opportunistic human activity and context recognition , 2013, Computer.

[45]  Miguel Á. Carreira-Perpiñán,et al.  Multiscale conditional random fields for image labeling , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[46]  Xu Sun,et al.  Large Scale Real-Life Action Recognition Using Conditional Random Fields with Stochastic Training , 2011, PAKDD.

[47]  Alexander R. Pico,et al.  Finding the Right Questions: Exploratory Pathway Analysis to Enhance Biological Discovery in Large Datasets , 2010, PLoS biology.

[48]  Paul Lukowicz,et al.  Collecting complex activity datasets in highly rich networked sensor environments , 2010, 2010 Seventh International Conference on Networked Sensing Systems (INSS).

[49]  Andreas Savvides,et al.  Extracting spatiotemporal human activity patterns in assisted living using a home sensor network , 2008, PETRA '08.

[50]  Paul Lukowicz,et al.  Activity Recognition of Assembly Tasks Using Body-Worn Microphones and Accelerometers , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.