Accelerating Time Series Shapelets Discovery with Key Points

Shapelets are discriminative subsequences in a time series dataset, which provide good interpretability for time series classification results. For this reason, time series shapelets have attracted great interest in time series data mining community. Although time series shapelets have satisfactory performance on many time series datasets, how to fast discover them is still a challenge because any subsequence in a time series may be a shapelet candidate. There are several methods to speed up shapelets discovery in recent years. However, these methods are still time-consuming when dealing with the large datasets or long time series. In this paper, we propose a preprocessing step with time series key points for shapelets discovery which make full use of the prior knowledge of shapelets. Combining with shapelets discovery method based on SAX(Fast-Shaplets), we can find shapelets quickly on all benchmark datasets of UCR archives, while the classification accuracy is almost the same as the current methods.

[1]  Jason Lines,et al.  A shapelet transform for time series classification , 2012, KDD.

[2]  Li Wei,et al.  Fast time series classification using numerosity reduction , 2006, ICML.

[3]  Li Wei,et al.  Experiencing SAX: a novel symbolic representation of time series , 2007, Data Mining and Knowledge Discovery.

[4]  Eamonn J. Keogh,et al.  Scalable Clustering of Time Series with U-Shapelets , 2015, SDM.

[5]  Hui Ding,et al.  Querying and mining of time series data: experimental comparison of representations and distance measures , 2008, Proc. VLDB Endow..

[6]  Norbert Link,et al.  Prototype Optimization for Temporarily and Spatially Distorted Time Series , 2010, AAAI Spring Symposium: It's All in the Timing.

[7]  Lior Rokach,et al.  Fast Randomized Model Generation for Shapelet-Based Time Series Classification , 2012, ArXiv.

[8]  Zhen Wang,et al.  uWave: Accelerometer-based Personalized Gesture Recognition and Its Applications , 2009, PerCom.

[9]  Eamonn J. Keogh,et al.  Fast Shapelets: A Scalable Algorithm for Discovering Time Series Shapelets , 2013, SDM.

[10]  Eamonn J. Keogh,et al.  Time series shapelets: a new primitive for data mining , 2009, KDD.

[11]  Dan Roth,et al.  Efficient Pattern-Based Time Series Classification on GPU , 2012, 2012 IEEE 12th International Conference on Data Mining.

[12]  Didier Stricker,et al.  Exploring and extending the boundaries of physical activity recognition , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.

[13]  Eamonn J. Keogh,et al.  Logical-shapelets: an expressive primitive for time series classification , 2011, KDD.