Time Series Classification with Discrete Wavelet Transformed Data

Time series mining has become essential for extracting knowledge from the abundant data that flows out from many application domains. To overcome storage and processing challenges in time series mining, compression techniques are being used. In this paper, we investigate the loss/gain of performance of time series classification approaches when fed with lossy-compressed data. This extended empirical study is essential for reassuring practitioners, but also for providing more insights on how compression techniques can even be effective in smoothing and reducing noise in time series data. From a knowledge engineering perspective, we show that time series may be compressed by 90% using discrete wavelet transforms and still achieve remarkable classification accuracy, and that residual details left by popular wavelet compression techniques can sometimes even help to achieve higher classification accuracy than the raw time series data, as they better capture essential local features.

[1]  Eamonn J. Keogh,et al.  Searching and Mining Trillions of Time Series Subsequences under Dynamic Time Warping , 2012, KDD.

[2]  J.B.T.M. Roerdink,et al.  A review of wavelet denoising in MRI and ultrasound brain imaging , 2006 .

[3]  Qiang Wang,et al.  A dimensionality reduction technique for efficient time series similarity analysis , 2008, Inf. Syst..

[4]  Pascal Bouvry,et al.  Management of an academic HPC cluster: The UL experience , 2014, 2014 International Conference on High Performance Computing & Simulation (HPCS).

[5]  Yun Zhang,et al.  Wavelet based image fusion techniques — An introduction, review and comparison , 2007 .

[6]  Geok Soon Hong,et al.  Wavelet analysis of sensor signals for tool condition monitoring: A review and some new results , 2009 .

[7]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[8]  Walid G. Morsi,et al.  Application of wavelet-based classification in non-intrusive load monitoring , 2015, 2015 IEEE 28th Canadian Conference on Electrical and Computer Engineering (CCECE).

[9]  Philip Chan,et al.  Toward accurate dynamic time warping in linear time and space , 2007, Intell. Data Anal..

[10]  Eamonn J. Keogh,et al.  A Complexity-Invariant Distance Measure for Time Series , 2011, SDM.

[11]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[12]  George C. Runger,et al.  A Bag-of-Features Framework to Classify Time Series , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Jacques Klein,et al.  Time Series Classification with Discrete Wavelet Transformed Data: Insights from an Empirical Study , 2016, SEKE.

[14]  Clement T. Yu,et al.  Haar Wavelets for Efficient Similarity Search of Time-Series: With and Without Time Warping , 2003, IEEE Trans. Knowl. Data Eng..

[15]  Jacques Klein,et al.  DSCo: A Language Modeling Approach for Time Series Classification , 2016, MLDM.

[16]  Jacques Klein,et al.  DSCo-NG: A Practical Language Modeling Approach for Time Series Classification , 2016, IDA.

[17]  I. Daubechies,et al.  Wavelets on the Interval and Fast Wavelet Transforms , 1993 .

[18]  I. Daubechies Orthonormal bases of compactly supported wavelets II: variations on a theme , 1993 .

[19]  Anthony Rowe,et al.  BLUED : A Fully Labeled Public Dataset for Event-Based Non-Intrusive Load Monitoring Research , 2012 .

[20]  Eamonn J. Keogh,et al.  Time series shapelets: a new primitive for data mining , 2009, KDD.

[21]  Joan Serrà,et al.  An empirical evaluation of similarity measures for time series classification , 2014, Knowl. Based Syst..

[22]  Jacques Klein,et al.  Profiling household appliance electricity usage with N-gram language modeling , 2016, 2016 IEEE International Conference on Industrial Technology (ICIT).

[23]  C. Duarte,et al.  Non-intrusive load monitoring based on switching voltage transients and wavelet transforms , 2012, 2012 Future of Instrumentation International Workshop (FIIW) Proceedings.

[24]  Michael W. Marcellin,et al.  JPEG2000: standard for interactive imaging , 2002, Proc. IEEE.

[25]  R. E. Lee,et al.  Distribution-free multiple comparisons between successive treatments , 1995 .

[26]  Eamonn J. Keogh,et al.  Fast Shapelets: A Scalable Algorithm for Discovering Time Series Shapelets , 2013, SDM.

[27]  Li Wei,et al.  Experiencing SAX: a novel symbolic representation of time series , 2007, Data Mining and Knowledge Discovery.

[28]  Junbin Gao,et al.  A review on applications of wavelet transform techniques in chemical analysis: 1989–1997 , 1998 .

[29]  Paul S Addison,et al.  Wavelet transforms and the ECG: a review , 2005, Physiological measurement.