Discovering Frequent Event Patterns with Multiple Granularities in Time Sequences

An important usage of time sequences is to discover temporal patterns. The discovery process usually starts with a user specified skeleton, called an event structure, which consists of a number of variables representing events and temporal constraints among these variables; the goal of the discovery is to find temporal patterns, i.e., instantiations of the variables in the structure that appear frequently in the time sequence. The paper introduces event structures that have temporal constraints with multiple granularities, defines the pattern discovery problem with these structures, and studies effective algorithms to solve it. The basic components of the algorithms include timed automata with granularities (TAGs) and a number of heuristics. The TAGs are for testing whether a specific temporal pattern, called a candidate complex event type, appears frequently in a time sequence. Since there are often a huge number of candidate event types for a usual event structure, heuristics are presented aiming at reducing the number of candidate event types and reducing the time spent by the TAGs testing whether a candidate type does appear frequently in the sequence. These heuristics exploit the information provided by explicit and implicit temporal constraints with granularity in the given event structure. The paper also gives the results of an experiment to show the effectiveness of the heuristics on a real data set.

[1]  Michael Stonebraker,et al.  Implementing calendars and temporal rules in next generation databases , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[2]  Rina Dechter,et al.  Temporal Constraint Networks , 1989, Artif. Intell..

[3]  Angelika Kotz Dittrich,et al.  Research perspectives for time series management systems , 1994, SGMD.

[4]  Heikki Mannila,et al.  Discovering Frequent Episodes in Sequences , 1995, KDD.

[5]  Tomasz Imielinski,et al.  Database Mining: A Performance Perspective , 1993, IEEE Trans. Knowl. Data Eng..

[6]  Philip Laird,et al.  Identifying and Using Patterns in Sequential Data , 1993, ALT.

[7]  Sushil Jajodia,et al.  Testing complex temporal relationships involving multiple granularities and its application to data mining (extended abstract) , 1996, PODS.

[8]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[9]  Thomas G. Dietterich,et al.  Discovering Patterns in Sequences of Events , 1985, Artif. Intell..

[10]  David Forster,et al.  A Representation for Collections of Temporal Intervals , 1986, AAAI.

[11]  Sushil Jajodia,et al.  Logical design for temporal databases with multiple granularities , 1997, TODS.

[12]  Rajeev Alur,et al.  A Theory of Timed Automata , 1994, Theor. Comput. Sci..

[13]  Alfred V. Aho,et al.  The Design and Analysis of Computer Algorithms , 1974 .

[14]  Kaizhong Zhang,et al.  Combinatorial pattern discovery for scientific data: some preliminary results , 1994, SIGMOD '94.

[15]  Heikki Mannila,et al.  Discovering frequent episodes in sequences extended abstract , 1995, KDD 1995.