Discrete wavelet transform-based time series analysis and mining

Time series are recorded values of an interesting phenomenon such as stock prices, household incomes, or patient heart rates over a period of time. Time series data mining focuses on discovering interesting patterns in such data. This article introduces a wavelet-based time series data analysis to interested readers. It provides a systematic survey of various analysis techniques that use discrete wavelet transformation (DWT) in time series data mining, and outlines the benefits of this approach demonstrated by previous studies performed on diverse application domains, including image classification, multimedia retrieval, and computer network anomaly detection.

[1]  Eamonn J. Keogh,et al.  On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration , 2002, Data Mining and Knowledge Discovery.

[2]  S. Venkatesh,et al.  Video genre categorization using audio wavelet coefficients , 2002 .

[3]  Shenghuo Zhu,et al.  A survey on wavelet applications in data mining , 2002, SKDD.

[4]  Asok Ray,et al.  Symbolic time series analysis for anomaly detection: A comparative evaluation , 2005, Signal Process..

[5]  Aidong Zhang,et al.  WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases , 1998, VLDB.

[6]  Lior Rokach,et al.  CHANGE DETECTION IN CLASSIFICATION MODELS INDUCED FROM TIME SERIES DATA , 2004 .

[7]  Clu-istos Foutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[8]  Piotr Indyk,et al.  Mining the stock market (extended abstract): which measure is best? , 2000, KDD '00.

[9]  Junshui Ma,et al.  Online novelty detection on temporal sequences , 2003, KDD '03.

[10]  Alan S. Perelson,et al.  Self-nonself discrimination in a computer , 1994, Proceedings of 1994 IEEE Computer Society Symposium on Research in Security and Privacy.

[11]  Cyrus Shahabi,et al.  TSA-tree: a wavelet-based approach to improve the efficiency of multi-level surprise and trend queries on time-series data , 2000, Proceedings. 12th International Conference on Scientific and Statistica Database Management.

[12]  共立出版株式会社 コンピュータ・サイエンス : ACM computing surveys , 1978 .

[13]  Stéphane Mallat,et al.  Singularity detection and processing with wavelets , 1992, IEEE Trans. Inf. Theory.

[14]  Linda K. Goodwin,et al.  Data mining for preterm birth prediction , 2000, SAC '00.

[15]  W. Addington,et al.  Rapid prediction of need for hospitalization in acute asthma. , 1976, JAMA.

[16]  John Turek,et al.  Progressive classification in the compressed domain for large EOS satellite databases , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[17]  Hojjat Adeli,et al.  Wavelet‐Clustering‐Neural Network Model for Freeway Incident Detection , 2003 .

[18]  Christos Faloutsos,et al.  Efficiently supporting ad hoc queries in large datasets of time sequences , 1997, SIGMOD '97.

[19]  David Salesin,et al.  Fast multiresolution image querying , 1995, SIGGRAPH.

[20]  Divyakant Agrawal,et al.  A comparison of DFT and DWT based similarity search in time-series databases , 2000, CIKM '00.

[21]  TERRAN LANE,et al.  Temporal sequence learning and data reduction for anomaly detection , 1999, TSEC.

[22]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[23]  Aryya Gangopadhyay,et al.  A method for clustering transient data streams , 2009, SAC '09.

[24]  Yong Wang,et al.  A differential wavelet-based noise reduction approach to improve clustering of hyperspectral Raman imaging data , 2006, 3rd IEEE International Symposium on Biomedical Imaging: Nano to Macro, 2006..

[25]  Fionn Murtagh,et al.  On neuro-wavelet modeling , 2004, Decis. Support Syst..

[26]  Zbigniew R. Struzik,et al.  Outlier detection and localisation with wavelet based multifractal formalism , 2000 .

[27]  C M Ginsburg,et al.  Early Prediction of the Need for Hospitalization in Children with Acute Asthma , 1984, Clinical pediatrics.

[28]  Andreas S. Weigend,et al.  Time Series Prediction: Forecasting the Future and Understanding the Past , 1994 .

[29]  Philip S. Yu,et al.  MALM: a framework for mining sequence database at multiple abstraction levels , 1998, CIKM '98.

[30]  Zbigniew R. Struzik,et al.  Measuring time series similarity through large singular features revealed with wavelet transformation , 1999, Proceedings. Tenth International Workshop on Database and Expert Systems Applications. DEXA 99.

[31]  Aryya Gangopadhyay,et al.  Clustering transient data streams by example and by variable , 2009 .

[32]  Salvatore J. Stolfo,et al.  Data Mining Approaches for Intrusion Detection , 1998, USENIX Security Symposium.

[33]  Ashfaq A. Khokhar,et al.  Content-based indexing and retrieval of audio data using wavelets , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[34]  Ioannis Kontoyiannis,et al.  An Efficient Recursive Partitioning Algorithm for Classification, Using Wavelets , 2002 .

[35]  Dragomir Anguelov,et al.  Mining The Stock Market : Which Measure Is Best ? , 2000 .

[36]  Eamonn J. Keogh,et al.  Finding surprising patterns in a time series database in linear time and space , 2002, KDD.

[37]  James Ze Wang,et al.  Wavelet-based image indexing techniques with partial sketch retrieval capability , 1997, Proceedings of ADL '97 Forum on Research and Technology. Advances in Digital Libraries.

[38]  Theofanis Sapatinas,et al.  Signal Detection in Underwater Sound using Wavelets , 1998 .

[39]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[40]  Aidong Zhang,et al.  A Multi-Resolution Content-Based Retrieval Approach for Geographic Images , 1999, GeoInformatica.

[41]  Tsuneo Katsuyama,et al.  A wavelet-based framework for proactive detection of network misconfigurations , 2004, NetT '04.

[42]  C. McGreavy,et al.  Application of wavelets and neural networks to diagnostic system development , 1999 .

[43]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[44]  Eamonn J. Keogh,et al.  Finding Time Series Discords Based on Haar Transform , 2006, ADMA.

[45]  Sophocles J. Orfanidis,et al.  Introduction to signal processing , 1995 .

[46]  George Tzanetakis,et al.  Automatic Musical Genre Classification of Audio Signals , 2001, ISMIR.

[47]  A. Walden,et al.  Wavelet Methods for Time Series Analysis , 2000 .

[48]  Dennis Shasha,et al.  High Performance Discovery In Time Series: Techniques And Case Studies (Monographs in Computer Science) , 2004 .

[49]  Hannu Toivonen,et al.  Mining for similarities in aligned time series using wavelets , 1999, Defense, Security, and Sensing.

[50]  Christos Faloutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[51]  Xiaoli Li,et al.  Discrete wavelet transform for tool breakage monitoring , 1999 .

[52]  A. Mojsilovic,et al.  Wavelet image extension for analysis and classification of infarcted myocardial tissue , 1997, IEEE Transactions on Biomedical Engineering.

[53]  Deok-Hwan Kim,et al.  Similarity search for multidimensional data sequences , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[54]  Stefano Rizzi,et al.  Medical decision support in clinical record management systems , 1994, Proceedings of International Conference on Expert Systems for Development.

[55]  Richard G. Baraniuk,et al.  A Multifractal Wavelet Model with Application to Network Traffic , 1999, IEEE Trans. Inf. Theory.

[56]  Kyuseok Shim,et al.  WALRUS: a similarity retrieval algorithm for image databases , 1999, IEEE Transactions on Knowledge and Data Engineering.

[57]  Mikhail J. Atallah,et al.  Detection of significant sets of episodes in event sequences , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[58]  G. Mitselmakher,et al.  A cross-correlation technique in wavelet domain for detection of stochastic gravitational waves , 2002 .

[59]  E. M. Lalitha Real-time Multi-resolution Decomposition of Degrading Fault Signals using Entropy measure , 2009 .

[60]  Jaideep Srivastava,et al.  Event detection from time series data , 1999, KDD '99.

[61]  Fionn Murtagh,et al.  Wavelet-based combined signal filtering and prediction , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[62]  Philip S. Yu,et al.  A Framework for Clustering Evolving Data Streams , 2003, VLDB.

[63]  Tak-Chung Fu,et al.  Pattern discovery from stock time series using self-organizing maps , 2016 .

[64]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[65]  Mohammad Saraee,et al.  Improving Similarity Search in Time Series Using Wavelets , 2006, Int. J. Data Warehous. Min..

[66]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[67]  Kunikazu Kobayashi,et al.  A WAVELET NEURAL NETWORK FOR FUNCTION APPROXIMATION AND NETWORK OPTIMIZATION , 1994 .

[68]  Sheng-Tun Li,et al.  Multi-resolution spatio-temporal data mining for the study of air pollutant regionalization , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[69]  Siyuan Ma,et al.  Modeling heterogeneous network traffic in wavelet domain: Part I-temporal correlation , 1999 .

[70]  Pierre Geurts,et al.  Pattern Extraction for Time Series Classification , 2001, PKDD.

[71]  Richard A. Davis,et al.  Time Series: Theory and Methods , 2013 .

[72]  Stéphane Canu,et al.  The long-term memory prediction by multiscale decomposition , 2000, Signal Process..

[73]  Aidong Zhang,et al.  WaveCluster: a wavelet-based clustering approach for spatial data in very large databases , 2000, The VLDB Journal.

[74]  James Ze Wang,et al.  System for Screening Objectionable Images Using Daubechies' Wavelets and Color Histograms , 1997, IDMS.

[75]  Man Hon Wong,et al.  Efficient and robust feature extraction and pattern matching of time series by a lattice structure , 2001, CIKM '01.

[76]  Uros Lotric Wavelet based denoising integrated into multilayered perceptron , 2004, Neurocomputing.

[77]  A. Jensen,et al.  Ripples in Mathematics - The Discrete Wavelet Transform , 2001 .

[78]  Dennis Shasha,et al.  High Performance Discovery in Time Series , 2004, Monographs in Computer Science.

[79]  Stephen Marsland,et al.  On-Line Novelty Detection through self-organisation with application to inspection robotics , 2001 .

[80]  Eamonn J. Keogh,et al.  Towards parameter-free data mining , 2004, KDD.

[81]  Sudipto Guha,et al.  Clustering Data Streams: Theory and Practice , 2003, IEEE Trans. Knowl. Data Eng..

[82]  H. Bunke,et al.  CLASSIFICATION AND DETECTION OF ABNORMAL EVENTS IN TIME SERIES OF GRAPHS , 2004 .

[83]  Ronald R. Coifman,et al.  Entropy-based algorithms for best basis selection , 1992, IEEE Trans. Inf. Theory.

[84]  Andrew W. Moore,et al.  Data mining for early disease outbreak detection , 2004 .

[85]  Li Wei,et al.  A Practical Tool for Visualizing and Data Mining Medical Time Series , 2005, 18th IEEE Symposium on Computer-Based Medical Systems (CBMS'05).

[86]  Prabhakar Raghavan,et al.  A Linear Method for Deviation Detection in Large Databases , 1996, KDD.

[87]  C.-C. Jay Kuo,et al.  Texture analysis and classification with tree-structured wavelet transform , 1993, IEEE Trans. Image Process..

[88]  Renée J. Miller,et al.  Similarity search over time-series data using wavelets , 2002, Proceedings 18th International Conference on Data Engineering.

[89]  Howard J. Hamilton,et al.  Interestingness measures for data mining: A survey , 2006, CSUR.

[90]  Eamonn J. Keogh,et al.  Segmenting Time Series: A Survey and Novel Approach , 2002 .

[91]  James G. Anderson,et al.  Clearing the way for physicians' use of clinical information systems , 1997, CACM.

[92]  John F. Roddick,et al.  A bibliography of temporal, spatial and spatio-temporal data mining research , 1999, SKDD.

[93]  R. Ogden On preconditioning the data for the wavelet transform when the sample size is not a power of two , 1997 .

[94]  Abdulhamit Subasi,et al.  Epileptic seizure detection using dynamic wavelet network , 2005, Expert Syst. Appl..

[95]  Sethuraman Panchanathan,et al.  Fast Wavelet Histogram Techniques for Image Indexing , 1999, Comput. Vis. Image Underst..

[96]  Amarnath Mukherjee,et al.  Time series models for internet traffic , 1996, Proceedings of IEEE INFOCOM '96. Conference on Computer Communications.

[97]  Ilaria Bartolini,et al.  Windsurf: region-based image retrieval using wavelets , 1999, Proceedings. Tenth International Workshop on Database and Expert Systems Applications. DEXA 99.

[98]  Matthias Blume,et al.  Image annotation based on learning vector quantization and localized Haar wavelet transform features , 1997, Defense, Security, and Sensing.

[99]  Yazhen Wang Jump and sharp cusp detection by wavelets , 1995 .

[100]  Dimitrios Gunopulos,et al.  Iterative Incremental Clustering of Time Series , 2004, EDBT.

[101]  Peter Funk,et al.  Clinical Decision Support by Time Series Classification Using Wavelets , 2005, ICEIS.

[102]  James Ze Wang,et al.  Content-based image indexing and searching using Daubechies' wavelets , 1998, International Journal on Digital Libraries.

[103]  Susan M. Bridges,et al.  Fuzzy frequent episodes for real-time intrusion detection , 2001, 10th IEEE International Conference on Fuzzy Systems. (Cat. No.01CH37297).

[104]  Chuanyi Ji,et al.  Modeling heterogeneous network traffic in wavelet domain , 2001, TNET.

[105]  Jun Wang,et al.  Real-time tool condition monitoring using wavelet transforms and fuzzy techniques , 2000, IEEE Trans. Syst. Man Cybern. Part C.

[106]  Michael R. Chernick,et al.  Wavelet Methods for Time Series Analysis , 2001, Technometrics.

[107]  R G Mark,et al.  Efficient hemodynamic event detection utilizing relational databases and wavelet analysis , 2001, Computers in Cardiology 2001. Vol.28 (Cat. No.01CH37287).

[108]  Fionn Murtagh,et al.  Prediction Based on a Multiscale Decomposition , 2003, Int. J. Wavelets Multiresolution Inf. Process..

[109]  Carla E. Brodley,et al.  Temporal sequence learning and data reduction for anomaly detection , 1998, CCS '98.

[110]  Eyke Hüllermeier,et al.  Clustering of gene expression data using a local shape-based similarity measure , 2005, Bioinform..

[111]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[112]  S. R. Subramanya,et al.  Wavelet-based indexing of audio data in audio/multimedia databases , 1998, Proceedings International Workshop on Multi-Media Database Management Systems (Cat. No.98TB100249).

[113]  Li Wei,et al.  Assumption-Free Anomaly Detection in Time Series , 2005, SSDBM.

[114]  Cyrus Shahabi,et al.  A Wavelet-Based Approach to Improve the E ciency of Multi-Level Surprise Mining? , 2001 .

[115]  Cyrus Shahabi,et al.  Feature subset selection and feature ranking for multivariate time series , 2005, IEEE Transactions on Knowledge and Data Engineering.

[116]  Raimondo Schettini,et al.  Multiresolution wavelet transform and supervised learning for content-based image retrieval , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[117]  Jean-Michel Poggi,et al.  Wavelet Toolbox User s Guide , 1996 .

[118]  Athanasios Kehagias,et al.  A Bayesian Multiple Models Combination Method for Time Series Prediction , 2001, J. Intell. Robotic Syst..

[119]  Paul Scheunders,et al.  Wavelet-based Texture Analysis , 1998 .

[120]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[121]  Jian Fan,et al.  Texture Classification by Wavelet Packet Signatures , 1993, MVA.

[122]  C. McGreavy,et al.  Application of wavelets and neural networks to diagnostic system development, 2, an integrated framework and its application , 1999 .

[123]  Zbigniew R. Struzik,et al.  The Haar Wavelet Transform in the Time Series Similarity Paradigm , 1999, PKDD.

[124]  Anja Feldmann,et al.  A non-instrusive, wavelet-based approach to detecting network performance problems , 2001, IMW '01.

[125]  Dipankar Dasgupta,et al.  Novelty detection in time series data using ideas from immunology , 1996 .

[126]  Mark B. Sandler,et al.  Classification of audio signals using statistical features on time and wavelet transform domains , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[127]  Xiaoli Li,et al.  Tool wear detection with fuzzy classification and wavelet fuzzy neural network , 1999 .