Mining Time Series Data: A Selective Survey

Time series prediction and control may involve the study of massive data archive and require some kind of data mining techniques. In order to make the comparison of time series meaningful, one important question is to decide what similarity means and what features have to be extracted from a time series. This question leads to the fundamental dichotomy: (a) similarity can be based solely on time series shape; (b) similarity can be measured by looking at time series structure. This article discusses the main dissimilarity indices proposed in literature for time series data mining.

[1]  Eamonn J. Keogh,et al.  Iterative Deepening Dynamic Time Warping for Time Series , 2002, SDM.

[2]  Dimitrios Gunopulos,et al.  Indexing Multidimensional Time-Series , 2004, The VLDB Journal.

[3]  Maurizio Vichi New Developments in Classification and Data Analysis , 2005 .

[4]  John E. Markel,et al.  Linear Prediction of Speech , 1976, Communication and Cybernetics.

[5]  A. Gray,et al.  Distance measures for speech processing , 1976 .

[6]  Isabella Morlini On the Dynamic Time Warping for Computing the Dissimilarity Between Curves , 2005 .

[7]  Yoh'ichi Tohkura,et al.  A weighted cepstral distance measure for speech recognition , 1987, IEEE Trans. Acoust. Speech Signal Process..

[8]  Shenghuo Zhu,et al.  A survey on wavelet applications in data mining , 2002, SKDD.

[9]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[10]  Box Ge,et al.  Time series analysis: forecasting and control rev. ed. , 1976 .

[11]  T. Kuo,et al.  The application of cepstral coefficients and maximum likelihood method in EMG pattern recognition. , 1995, IEEE transactions on bio-medical engineering.

[12]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[13]  Gareth J. Janacek,et al.  Clustering time series from ARMA models with clipped data , 2004, KDD.

[14]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[15]  Changzhou Wang,et al.  Supporting content-based searches on time series via approximation , 2000, Proceedings. 12th International Conference on Scientific and Statistica Database Management.

[16]  P. Papantoni-Kazakos,et al.  Spectral distance measures between Gaussian processes , 1980, ICASSP.

[17]  P. D. Souza,et al.  15 Speech recognition using LPC distance measures , 1985 .

[18]  Konstantinos Kalpakis,et al.  Distance measures for effective clustering of ARIMA time-series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[19]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[20]  Eamonn J. Keogh,et al.  On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration , 2002, Data Mining and Knowledge Discovery.

[21]  Donald J. Berndt,et al.  Using Dynamic Time Warping to Find Patterns in Time Series , 1994, KDD Workshop.

[22]  Dit-Yan Yeung,et al.  Time series clustering with ARMA mixtures , 2004, Pattern Recognit..

[23]  Marcella Corduas,et al.  Time series clustering and classification by the autoregressive metric , 2008, Comput. Stat. Data Anal..

[24]  Elvira Romano,et al.  Integrating time alignment and Self Organizing Maps for Classifying Curves , 2006 .

[25]  D. Piccolo A DISTANCE MEASURE FOR CLASSIFYING ARIMA MODELS , 1990 .

[26]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[27]  Philip S. Yu,et al.  Structural Periodic Measures for Time-Series Data , 2005, Data Mining and Knowledge Discovery.

[28]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[29]  Gareth J. Janacek,et al.  Clustering Time Series with Clipped Data , 2005, Machine Learning.

[30]  M. Hunt,et al.  Distance measures for speech recognition , 1989 .

[31]  R. H. Shumway,et al.  1 Discriminant analysis for time series , 1982, Classification, Pattern Recognition and Reduction of Dimensionality.

[32]  Eamonn Keogh Exact Indexing of Dynamic Time Warping , 2002, VLDB.

[33]  Roberto Baragona,et al.  Statistical and Numerical Algorithms for Time Series Classification , 2010 .

[34]  Jorge Caiado,et al.  A periodogram-based metric for time series classification , 2006, Comput. Stat. Data Anal..

[35]  Zbigniew R. Struzik,et al.  The Haar Wavelet Transform in the Time Series Similarity Paradigm , 1999, PKDD.

[36]  Eamonn J. Keogh,et al.  Making Time-Series Classification More Accurate Using Learned Constraints , 2004, SDM.