Classification of multivariate time series via temporal abstraction and time intervals mining

Classification of multivariate time series data, often including both time points and intervals at variable frequencies, is a challenging task. We introduce the KarmaLegoSification (KLS) framework for classification of multivariate time series analysis, which implements three phases: (1) application of a temporal abstraction process that transforms a series of raw time-stamped data points into a series of symbolic time intervals; (2) mining these symbolic time intervals to discover frequent time-interval-related patterns (TIRPs), using Allen’s temporal relations; and (3) using the TIRPs as features to induce a classifier. To efficiently detect multiple TIRPs (features) in a single entity to be classified, we introduce a new algorithm, SingleKarmaLego, which can be shown to be superior for that purpose over a Sequential TIRPs Detection algorithm. We evaluated the KLS framework on datasets in the domains of diabetes, intensive care, and infectious hepatitis, assessing the effects of the various settings of the KLS framework. Discretization using Symbolic Aggregate approXimation (SAX) led to better performance than using the equal-width discretization (EWD); knowledge-based cut-off definitions when available were superior to both. Using three abstract temporal relations was superior to using the seven core temporal relations. Using an epsilon value larger than zero tended to result in a slightly better accuracy when using the SAX discretization method, but resulted in a reduced accuracy when using EWD, and overall, does not seem beneficial. No feature selection method we tried proved useful. Regarding feature (TIRP) representation, mean duration performed better than horizontal support, which in turn performed better than the default Binary (existence) representation method.

[1]  Yuval Shahar,et al.  A Framework for Knowledge-Based Temporal Abstraction , 1997, Artif. Intell..

[2]  Dmitriy Fradkin,et al.  Robust Mining of Time Intervals with Semi-interval Partial Order Patterns , 2010, SDM.

[3]  Umeshwar Dayal,et al.  PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth , 2001, ICDE 2001.

[4]  Christian Freksa,et al.  Temporal Reasoning Based on Semi-Intervals , 1992, Artif. Intell..

[5]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[6]  Diane J. Cook,et al.  Detecting Anomalous Sensor Events in Smart Home Data for Enhancing the Living Experience , 2011, Artificial Intelligence and Smarter Living.

[7]  Yuval Shahar Knowledge-based temporal interpolation , 1999, J. Exp. Theor. Artif. Intell..

[8]  Carlo Combi,et al.  Data mining with Temporal Abstractions: learning rules from time series , 2007, Data Mining and Knowledge Discovery.

[9]  Eamonn J. Keogh,et al.  Time Series Classification under More Realistic Assumptions , 2013, SDM.

[10]  Fabian Mörchen,et al.  Optimizing time series discretization for knowledge discovery , 2005, KDD '05.

[11]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[12]  John F. Roddick,et al.  A Survey of Temporal Knowledge Discovery Paradigms and Methods , 2002, IEEE Trans. Knowl. Data Eng..

[13]  Milos Hauskrecht,et al.  A temporal pattern mining approach for classifying electronic health record data , 2013, ACM Trans. Intell. Syst. Technol..

[14]  Fabian Mörchen,et al.  Algorithms for time series knowledge mining , 2006, KDD '06.

[15]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[16]  Yuval Shahar,et al.  Medical Temporal-Knowledge Discovery via Temporal Abstraction , 2009, AMIA.

[17]  Evert de Jonge,et al.  Temporal Discretization of medical time series - A comparative study , 2007 .

[18]  Yuval Shahar,et al.  Dynamic temporal interpretation contexts for temporal abstraction , 1998, Annals of Mathematics and Artificial Intelligence.

[19]  Evert de Jonge,et al.  Temporal abstraction for feature extraction: A comparative case study in prediction from intensive care monitoring data , 2007, Artif. Intell. Medicine.

[20]  Yuval Shahar,et al.  Improving Worm Detection with Artificial Neural Networks through Feature Selection and Temporal Analysis Techniques , 2008 .

[21]  Dimitrios Gunopulos,et al.  Mining frequent arrangements of temporal intervals , 2009, Knowledge and Information Systems.

[22]  Frank Höppner,et al.  Time Series Abstraction Methods - A Survey , 2002, GI Jahrestagung.

[23]  Yuval Shahar,et al.  Detection of Unknown Computer Worms Activity Based on Computer Behavior using Data Mining , 2007, CISDA.

[24]  Blaz Zupan,et al.  Intelligent Data Analysis in Medicine , 2000 .

[25]  Yuval Shahar,et al.  Vaidurya - A Concept-Based, Context-Sensitive Search Engine For Clinical Guidelines , 2004, MedInfo.

[26]  Yuval Shahar,et al.  Fast time intervals mining using the transitivity of temporal relations , 2013, Knowledge and Information Systems.

[27]  Yuval Shahar,et al.  Original Investigation: Semi-automated Entry of Clinical Temporal-abstraction Knowledge , 1999, J. Am. Medical Informatics Assoc..

[28]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[29]  Milos Hauskrecht,et al.  Mining recent temporal patterns for event detection in multivariate time series data , 2012, KDD.

[30]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[31]  John F. Roddick,et al.  ARMADA - An algorithm for discovering richer relative temporal association rules from interval-based data , 2007, Data Knowl. Eng..

[32]  F. Höppner Learning Temporal Rules from State Sequences , 2001 .

[33]  Yuval Shahar,et al.  Semiautomated Acquisition of Clinical Temporal-abstraction Knowledge , 1998 .

[34]  Yuval Shahar,et al.  Vaidurya: A multiple-ontology, concept-based, context-sensitive clinical-guideline search engine , 2009, J. Biomed. Informatics.

[35]  Kien A. Hua,et al.  Knowledge Discovery from Series of Interval Events , 2000, Journal of Intelligent Information Systems.

[36]  Eamonn J. Keogh,et al.  Three Myths about Dynamic Time Warping Data Mining , 2005, SDM.

[37]  Mong-Li Lee,et al.  Mining relationships among interval-based events for classification , 2008, SIGMOD Conference.

[38]  Ada Wai-Chee Fu,et al.  Discovering Temporal Patterns for Interval-Based Events , 2000, DaWaK.

[39]  Evert de Jonge,et al.  Analysis of ICU Patients Using the Time Series Knowledge Mining Method , 2007 .

[40]  Yen-Liang Chen,et al.  Mining Nonambiguous Temporal Patterns for Interval-Based Events , 2007, IEEE Transactions on Knowledge and Data Engineering.

[41]  F. Mörchen A better tool than Allen's relations for expressing temporal knowledge in interval data , 2006 .

[42]  Yuval Shahar,et al.  Application of Artificial Neural Networks Techniques to Computer Worm Detection , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.