Semi-Markov kMeans Clustering and Activity Recognition from Body-Worn Sensors

Subsequence clustering aims to find patterns that appear repeatedly in time series data. We introduce a novel subsequence clustering technique that we call semi-Markov kmeans clustering. The clustering results in ideal examples of the repeating patterns and in labeled segmentations that can be used as training data for sophisticated discriminative methods like max-margin semi-Markov models. We are applying the new clustering technique to activity recognition from body-worn sensors by showing how it can enable a system to learn from data that is only annotated by an ordered list of activity types that have been undertaken. This kind of annotation, unlike a detailed segmentation of the sensor data, is easily provided by a non-expert user. We show that we can achieve equally good results using only an ordered list of activity types for training as when using a full detailed labeled segmentation.

[1]  George Nagy,et al.  In search of meaning for time series subsequence clustering: matching algorithms based on a new distance measure , 2006, CIKM '06.

[2]  R. Coifman,et al.  Local feature extraction and its applications using a library of bases , 1994 .

[3]  Michel Verleysen,et al.  Unfolding preprocessing for meaningful time series clustering , 2006, Neural Networks.

[4]  Jason R. Chen Making subsequence time series clustering meaningful , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[5]  Eamonn J. Keogh,et al.  Clustering of time-series subsequences is meaningless: implications for previous and future research , 2004, Knowledge and Information Systems.

[6]  Li Wang,et al.  Discriminative human action segmentation and recognition using semi-Markov model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Alexander J. Smola,et al.  Semi-Markov Models for Sequence Segmentation , 2007, EMNLP.

[8]  Alexander J. Smola,et al.  Wearable sensor activity analysis using semi-Markov models with a grammar , 2010, Pervasive Mob. Comput..

[9]  Heikki Mannila,et al.  Rule Discovery from Time Series , 1998, KDD.

[10]  Stewart G Trost,et al.  Conducting accelerometer-based activity assessments in field-based research. , 2005, Medicine and science in sports and exercise.

[11]  R Williamson,et al.  Gait event detection for FES using accelerometers and supervised machine learning. , 2000, IEEE transactions on rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society.

[12]  Zbigniew R. Struzik Time Series Rule Discovery: Tough, Not Meaningless , 2003, ISMIS.

[13]  B.T. Smith,et al.  Evaluation of force-sensing resistors for gait event detection to trigger electrical stimulation to improve walking in the child with cerebral palsy , 2002, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[14]  Thomas Hofmann,et al.  Large Scale Hidden Semi-Markov SVMs , 2007 .

[15]  William W. Cohen,et al.  Semi-Markov Conditional Random Fields for Information Extraction , 2004, NIPS.

[16]  K.A. Peker Subsequence time series (STS) clustering techniques for meaningful pattern discovery , 2005, International Conference on Integration of Knowledge Intensive Multi-Agent Systems, 2005..

[17]  Ann V Rowlands,et al.  The Measurement and Interpretation of Children's Physical Activity. , 2007, Journal of sports science & medicine.