A 2D Transform Based Distance Function for Time Series Classification

Along with the arrival of Industry 4.0 era, time series classification (TSC) has attracted a lot of attention in the last decade. The high dimensionality, high feature correlation and typically high levels of noise that found in time series bring great challenges to TSC. Among TSC algorithms, the 1NN classifier has been shown as effective and difficult to beat. The core of the 1NN classifier is the distance function. The large majority of TSC have concentrated on alternative distance functions. In this paper, a two-dimensional (2D) transform based distance (2DTbD) function is proposed. There are three steps in 2DTbD. Firstly, we convert time series to 2D space by turning time series around the coordinate origin. Then we calculate distances of each dimension. Finally, we ensemble distances in 2D space to get the final time series distance. Our distance function raises the accuracy rate through the fusion of 2D information. Experimental results demonstrate that the classification accuracy can be improved by 2DTbD.

[1]  Shijun Liu,et al.  A Fast Shapelet Discovery Algorithm Based on Important Data Points , 2017, Int. J. Web Serv. Res..

[2]  Philip Chan,et al.  Toward accurate dynamic time warping in linear time and space , 2007, Intell. Data Anal..

[3]  Tomasz Górecki,et al.  Using derivatives in time series classification , 2012, Data Mining and Knowledge Discovery.

[4]  Christos Faloutsos,et al.  Fast Time Sequence Indexing for Arbitrary Lp Norms , 2000, VLDB.

[5]  Eamonn J. Keogh,et al.  Experimental comparison of representation methods and distance measures for time series data , 2010, Data Mining and Knowledge Discovery.

[6]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[7]  Shijun Liu,et al.  A Shapelet Selection Algorithm for Time Series Classification: New Directions , 2017, IIKI.

[8]  Eamonn J. Keogh,et al.  A Complexity-Invariant Distance Measure for Time Series , 2011, SDM.

[9]  Christos Faloutsos,et al.  FTW: fast similarity search under the time warping distance , 2005, PODS.

[10]  Giandomenico Spezzano,et al.  A single pass algorithm for clustering evolving data streams based on swarm intelligence , 2011, Data Mining and Knowledge Discovery.

[11]  Tomasz Górecki,et al.  Non-isometric transforms in time series classification using DTW , 2014, Knowl. Based Syst..

[12]  Eamonn J. Keogh,et al.  Exact indexing of dynamic time warping , 2002, Knowledge and Information Systems.

[13]  Eamonn J. Keogh,et al.  DTW-D: time series semi-supervised learning from a single example , 2013, KDD.

[14]  Jacques Klein,et al.  DSCo-NG: A Practical Language Modeling Approach for Time Series Classification , 2016, IDA.

[15]  Ying Zhang,et al.  Time Series Classification by Modeling the Principal Shapes , 2017, WISE.

[16]  Shijun Liu,et al.  A Self-Evolving Method of Data Model for Cloud-Based Machine Data Ingestion , 2016, 2016 IEEE 9th International Conference on Cloud Computing (CLOUD).

[17]  Eamonn J. Keogh,et al.  The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances , 2016, Data Mining and Knowledge Discovery.

[18]  Eamonn J. Keogh,et al.  LB_Keogh supports exact indexing of shapes under rotation invariance with arbitrary representations and distance measures , 2006, VLDB.

[19]  Dimitrios Gunopulos,et al.  Finding Similar Time Series , 1997, PKDD.

[20]  Pierre-François Marteau,et al.  Time Warp Edit Distance with Stiffness Adjustment for Time Series Matching , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Christos Faloutsos,et al.  Efficient retrieval of similar time sequences under time warping , 1998, Proceedings 14th International Conference on Data Engineering.

[22]  Dacheng Tao,et al.  Segment-Based Features for Time Series Classification , 2012, Comput. J..

[23]  Hui Ding,et al.  Querying and mining of time series data: experimental comparison of representations and distance measures , 2008, Proc. VLDB Endow..

[24]  Christos Faloutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[25]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[26]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[27]  Olufemi A. Omitaomu,et al.  Weighted dynamic time warping for time series classification , 2011, Pattern Recognit..

[28]  Wesley W. Chu,et al.  An index-based approach for similarity search supporting time warping in large sequence databases , 2001, Proceedings 17th International Conference on Data Engineering.

[29]  Donald J. Berndt,et al.  Using Dynamic Time Warping to Find Patterns in Time Series , 1994, KDD Workshop.

[30]  Eamonn J. Keogh,et al.  On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration , 2002, Data Mining and Knowledge Discovery.

[31]  Eamonn J. Keogh,et al.  CID: an efficient complexity-invariant distance for time series , 2013, Data Mining and Knowledge Discovery.

[32]  Juan José Rodríguez Diez,et al.  Stacking for multivariate time series classification , 2015, Pattern Analysis and Applications.

[33]  Lei Chen,et al.  On The Marriage of Lp-norms and Edit Distance , 2004, VLDB.

[34]  Roberto Costa,et al.  Implementation and Empirical Assessment of a Web Application Cloud Deployment Tool , 2013, CloudCom 2013.

[35]  Carlos Agón,et al.  Time-series data mining , 2012, CSUR.

[36]  Raymond K. Wong,et al.  Adaptive Distance Measurement for Time Series Databases , 2007, DASFAA.

[37]  Gautam Das,et al.  The Move-Split-Merge Metric for Time Series , 2013, IEEE Transactions on Knowledge and Data Engineering.