Efficient Query Processing in Time Series

With the rapid development over the last decade, time series data become one of the most frequently used data in real world applications (e.g., finance analysis, medical diagnosis, environmental monitoring, etc.). As expected, the volume of the time series data will even grow larger in near future. It is important to design efficient and effective algorithm and index to handle various tasks for these data. Thereby, my PhD study focuses on how to extract meaningful time series patterns from large volume of data efficiently. Specifically, two types of extraction queries are discussed in this work, including longest-lasting correlated subsequence query and time series motif query. The applications and solutions of these two queries are thoroughly introduced and discussed in this paper. Moreover, some potential pattern extraction queries will also be discussed in this paper.

[1]  Christos Faloutsos,et al.  Similarity search without tears: the OMNI-family of all-purpose access methods , 2001, Proceedings 17th International Conference on Data Engineering.

[2]  Eamonn J. Keogh,et al.  Accelerating Dynamic Time Warping Subsequence Search with GPUs and FPGAs , 2010, 2010 IEEE International Conference on Data Mining.

[3]  Eamonn J. Keogh,et al.  Exact Discovery of Time Series Motifs , 2009, SDM.

[4]  Ira Assent,et al.  The TS-tree: efficient time series search and retrieval , 2008, EDBT '08.

[5]  Jessica Lin,et al.  Finding Motifs in Time Series , 2002, KDD 2002.

[6]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[7]  Paulo J. Azevedo,et al.  Multiresolution Motif Discovery in Time Series , 2010, SDM.

[8]  Eamonn J. Keogh,et al.  iSAX 2.0: Indexing and Mining One Billion Time Series , 2010, 2010 IEEE International Conference on Data Mining.

[9]  Eamonn J. Keogh,et al.  Logical-shapelets: an expressive primitive for time series classification , 2011, KDD.

[10]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[11]  Eamonn J. Keogh,et al.  Searching and Mining Trillions of Time Series Subsequences under Dynamic Time Warping , 2012, KDD.

[12]  Man Lung Yiu,et al.  Quick-motif: An efficient and scalable framework for exact motif discovery , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[13]  Vipin Kumar,et al.  Comparative Evaluation of Anomaly Detection Techniques for Sequence Data , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[14]  Eamonn J. Keogh,et al.  Time Series Epenthesis: Clustering Time Series Streams Requires Ignoring Some Data , 2011, 2011 IEEE 11th International Conference on Data Mining.

[15]  Dimitrios Gunopulos,et al.  Approximate embedding-based subsequence matching of time series , 2008, SIGMOD Conference.

[16]  Eamonn J. Keogh,et al.  HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[17]  Kuniaki Uehara,et al.  Discovery of Time-Series Motif from Multi-Dimensional Data Based on MDL Principle , 2005, Machine Learning.

[18]  Yong Yu,et al.  Prominent streak discovery in sequence data , 2011, KDD.

[19]  Eamonn J. Keogh,et al.  Online discovery and maintenance of time series motifs , 2010, KDD.

[20]  Clu-istos Foutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[21]  Eamonn J. Keogh,et al.  Finding Time Series Motifs in Disk-Resident Data , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[22]  Eamonn Keogh Exact Indexing of Dynamic Time Warping , 2002, VLDB.

[23]  Christos Faloutsos,et al.  Fast Time Sequence Indexing for Arbitrary Lp Norms , 2000, VLDB.

[24]  Dimitrios Gunopulos,et al.  Embedding-based subsequence matching in time-series databases , 2011, TODS.

[25]  Abdullah Mueen,et al.  Enumeration of time series motifs of all lengths , 2013, 2013 IEEE 13th International Conference on Data Mining.

[26]  Tim Oates,et al.  Visualizing Variable-Length Time Series Motifs , 2012, SDM.

[27]  Ambuj K. Singh,et al.  Optimizing similarity search for arbitrary length time series queries , 2004, IEEE Transactions on Knowledge and Data Engineering.

[28]  Li Wei,et al.  Experiencing SAX: a novel symbolic representation of time series , 2007, Data Mining and Knowledge Discovery.

[29]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[30]  Man Lung Yiu,et al.  Discovering Longest-lasting Correlation in Sequence Databases , 2013, Proc. VLDB Endow..