Constrained Motif Discovery in Time Series

The goal of motif discovery algorithms is to efficiently find unknown recurring patterns. In this paper, we focus on motif discovery in time series. Most available algorithms cannot utilize domain knowledge in any way which results in quadratic or at least super-linear time and space complexity. In this paper we define the Constrained Motif Discovery problem which enables utilization of domain knowledge into the motif discovery process. The paper then provides two algorithms called MCFull and MCInc for efficiently solving the constrained motif discovery problem. We also show that most unconstrained motif discovery problems be converted into constrained ones using a change-point detection algorithm. A novel change-point detection algorithm called the Robust Singular Spectrum Transform (RSST) is then introduced and compared to traditional Singular Spectrum Transform using synthetic and real-world data sets. The results show that RSST achieves higher specificity and is more adequate for finding constraints to convert unconstrained motif discovery problems to constrained ones that can be solved using MCFull and MCInc. We then compare the combination of RSST and MCFull or MCInc with two state-of-the-art motif discovery algorithms on a large set of synthetic time series. The results show that the proposed algorithms provided four to ten folds increase in speed compared the unconstrained motif discovery algorithms studied without any loss of accuracy. RSST+MCFull is then used in a real world human-robot interaction experiment to enable the robot to learn free hand gestures, actions, and their associations by watching humans and other robots interacting.

[1]  Stephen Shaoyi Liao,et al.  Discovering original motifs with different lengths from time series , 2008, Knowl. Based Syst..

[2]  Jeremy Buhler,et al.  Finding motifs using random projections , 2001, RECOMB.

[3]  Eamonn J. Keogh,et al.  HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[4]  Toyoaki Nishida,et al.  Human adaptation to a miniature robot: Precursors of mutual adaptation , 2008, RO-MAN 2008 - The 17th IEEE International Symposium on Robot and Human Interactive Communication.

[5]  Jessica Lin,et al.  Finding Motifs in Time Series , 2002, KDD 2002.

[6]  Mark P. Styczynski,et al.  A generic motif discovery algorithm for sequential data. , 2006, Bioinformatics.

[7]  Michèle Basseville,et al.  Detection of abrupt changes , 1993 .

[8]  Edit Gombay Change detection in autoregressive time series , 2008 .

[9]  V. Moskvina,et al.  An Algorithm Based on Singular Spectrum Analysis for Change-Point Detection , 2003 .

[10]  Tim Oates,et al.  PERUSE: An unsupervised algorithm for finding recurring patterns in time series , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[11]  Keisuke Inoue,et al.  Knowledge Discovery from Heterogeneous Dynamic Systems using Change-Point Correlations , 2005, SDM.

[12]  Irfan A. Essa,et al.  Improving Activity Discovery with Automatic Neighborhood Estimation , 2007, IJCAI.

[13]  Irfan A. Essa,et al.  Detecting Subdimensional Motifs: An Efficient Algorithm for Generalized Multivariate Pattern Discovery , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[14]  Shubha Kadambe,et al.  Application of the wavelet transform for pitch detection of speech signals , 1992, IEEE Trans. Inf. Theory.

[15]  Yasser F. O. Mohammad,et al.  The H3R Explanation Corpus human-human and base human-robot interaction dataset , 2008, 2008 International Conference on Intelligent Sensors, Sensor Networks and Information Processing.

[16]  Christiaan J. J. Paredis,et al.  Interactive Multimodal Robot Programming , 2005, Int. J. Robotics Res..

[17]  Futoshi Naya,et al.  B-Pack: a Bluetooth-based wearable sensing device for nursing activity recognition , 2006, 2006 1st International Symposium on Wireless Pervasive Computing.

[18]  Tom Armstrong,et al.  Discovering Patterns in Real-Valued Time Series , 2006, PKDD.

[19]  Eamonn J. Keogh,et al.  Probabilistic discovery of time series motifs , 2003, KDD '03.

[20]  Yong Xu,et al.  An experiment study of gesture-based human-robot interface , 2007, 2007 IEEE/ICME International Conference on Complex Medical Engineering.

[21]  Shusaku Tsumoto,et al.  Mining similar temporal patterns in long time-series data and its application to medicine , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..