Online Top-k Similar Time-Lagged Pattern Pair Search in Multiple Time Series

We extract the relation among multiple time series in which a characteristic pattern in a time series follows a similar pattern in another time series. We call this a ‘post-hoc-relation’. For extracting many post-hoc-relations from a large number of time series, we investigated the problem of reducing the cost of online searching for the top-k similar time-lagged pattern pairs in multiple time series, where k is the query size. We propose an online top-k similar time-lagged pattern pair search method that manages the candidate cache in preparation for the top-k pair update and defines the upper bound distance for each arrival time of pattern pairs. Our method also prunes dissimilar pattern pairs by using an index and the upper bound distance. Experimental results show that our method successfully reduces the number of distance computations for a top-k similar pattern update.

[1]  Beng Chin Ooi,et al.  iDistance: An adaptive B+-tree based indexing method for nearest neighbor search , 2005, TODS.

[2]  Jeffrey K. Uhlmann,et al.  Satisfying General Proximity/Similarity Queries with Metric Trees , 1991, Inf. Process. Lett..

[3]  Pavel Zezula,et al.  M-tree: An Efficient Access Method for Similarity Search in Metric Spaces , 1997, VLDB.

[4]  Paulo J. Azevedo,et al.  Multiresolution Motif Discovery in Time Series , 2010, SDM.

[5]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[6]  Eamonn J. Keogh,et al.  Detecting time series motifs under uniform scaling , 2007, KDD '07.

[7]  Ge Yu,et al.  Boolean representation based data-adaptive correlation analysis over time series streams , 2007, CIKM '07.

[8]  Eamonn J. Keogh,et al.  Mining motifs in massive time series databases , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[9]  Jessica Lin,et al.  Finding Motifs in Time Series , 2002, KDD 2002.

[10]  Gonzalo Navarro,et al.  Dynamic spatial approximation trees , 2008, JEAL.

[11]  Yunhao Liu,et al.  Indexable PLA for Efficient Similarity Search , 2007, VLDB.

[12]  Gonzalo Navarro,et al.  A compact space decomposition for effective metric indexing , 2005, Pattern Recognit. Lett..

[13]  Pavel Zezula,et al.  A cost model for similarity queries in metric spaces , 1998, PODS '98.

[14]  Toon Calders,et al.  Online Discovery of Top-k Similar Motifs in Time Series Data , 2011, SDM.

[15]  Eamonn J. Keogh,et al.  Online discovery and maintenance of time series motifs , 2010, KDD.

[16]  Jie Liu,et al.  Fast approximate correlation for massive time-series data , 2010, SIGMOD Conference.

[17]  Pavel Zezula,et al.  Similarity Search: The Metric Space Approach (Advances in Database Systems) , 2005 .

[18]  Jie Liu,et al.  GAMPS: compressing multi sensor data by grouping and amplitude scaling , 2009, SIGMOD Conference.

[19]  Ira Assent,et al.  The TS-tree: efficient time series search and retrieval , 2008, EDBT '08.