Distance measure for querying sequences of temporal intervals

Time series representations are not always rich enough to describe the temporal activity, for instance, when the context and the relations of the observed elements are of interest. Sequences of temporal intervals use such intervals as primitives in their representation, and allow focusing on the temporal relations of these elements. This is a useful representation of data across many domains. Searching, indexing, and mining such sequences is essential for domain experts in order to discover useful information out of them. In this paper, we formulate the problem of comparing sequences of temporal intervals and propose a novel distance measure. We discuss the properties of the measure and study its robustness in the domain of sign language. Experiments on real data show that the measure is robust in terms of retrieval accuracy even for high levels of artificially introduced distortion.

[1]  John F. Roddick,et al.  A Survey of Temporal Knowledge Discovery Paradigms and Methods , 2002, IEEE Trans. Knowl. Data Eng..

[2]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[3]  Frank Höppner Discovery of Temporal Patterns. Learning Rules about the Qualitative Behaviour of Time Series , 2001, PKDD.

[4]  Hongjun Lu,et al.  Stock movement prediction and N-dimensional inter-transaction association rules , 1998, SIGMOD 1998.

[5]  P. S. Sastry,et al.  Discovering Frequent Generalized Episodes When Events Persist for Different Durations , 2007, IEEE Transactions on Knowledge and Data Engineering.

[6]  Charlotte Baker-Shenk,et al.  A Microanalysis of the Nonmanual Components of Questions in American Sign Language , 1983 .

[7]  Panagiotis Papapetrou,et al.  Discovering Frequent Poly-Regions in DNA Sequences , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[8]  Geoffrey Restall Coulter,et al.  American sign language typology , 1979 .

[9]  Frank Klawonn,et al.  Finding informative rules in interval sequences , 2001, Intell. Data Anal..

[10]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[11]  Carol Neidle,et al.  The Syntax of American Sign Language: Functional Categories and Hierarchical Structure , 1999 .

[12]  Jonathan D. Cryer,et al.  Time Series Analysis , 1986 .

[13]  Ada Wai-Chee Fu,et al.  Discovering Temporal Patterns for Interval-Based Events , 2000, DaWaK.

[14]  Scott K. Liddell American Sign Language Syntax , 1981 .

[15]  Yen-Liang Chen,et al.  Mining Nonambiguous Temporal Patterns for Interval-Based Events , 2007, IEEE Transactions on Knowledge and Data Engineering.

[16]  Gustavo Rossi,et al.  An approach to discovering temporal association rules , 2000, SAC '00.

[17]  James F. Allen,et al.  Actions and Events in Interval Temporal Logic , 1994 .

[18]  Kien A. Hua,et al.  Knowledge Discovery from Series of Interval Events , 2000, Journal of Intelligent Information Systems.

[19]  Dimitrios Gunopulos,et al.  Efficient Mining of Spatiotemporal Patterns , 2001, SSTD.

[20]  Simon Parsons,et al.  Principles of Data Mining by David J. Hand, Heikki Mannila and Padhraic Smyth, MIT Press, 546 pp., £34.50, ISBN 0-262-08290-X , 2004, The Knowledge Engineering Review.

[21]  John F. Roddick,et al.  ARMADA - An algorithm for discovering richer relative temporal association rules from interval-based data , 2007, Data Knowl. Eng..

[22]  John F. Roddick,et al.  Incremental Meta-Mining from Large Temporal Data Sets , 1998, ER Workshops.

[23]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[24]  Dmitriy Fradkin,et al.  Robust Mining of Time Intervals with Semi-interval Partial Order Patterns , 2010, SDM.

[25]  Chih-Ping Wei,et al.  Discovery of temporal patterns from process instances , 2004, Comput. Ind..

[26]  John F. Roddick,et al.  Mining Relationships Between Interacting Episodes , 2004, SDM.

[27]  Dimitrios Gunopulos,et al.  Discovering frequent arrangements of temporal intervals , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[28]  Xiaodong Chen,et al.  Mining Temporal Features in Association Rules , 1999, PKDD.

[29]  Dino Pedreschi,et al.  Efficient Mining of Temporally Annotated Sequences , 2006, SDM.

[30]  Jun-Lin Lin Mining maximal frequent intervals , 2003, SAC '03.

[31]  Dimitrios Gunopulos,et al.  Mining frequent arrangements of temporal intervals , 2009, Knowledge and Information Systems.