The UEA multivariate time series classification archive, 2018

In 2002, the UCR time series classification archive was first released with sixteen datasets. It gradually expanded, until 2015 when it increased in size from 45 datasets to 85 datasets. In October 2018 more datasets were added, bringing the total to 128. The new archive contains a wide range of problems, including variable length series, but it still only contains univariate time series classification problems. One of the motivations for introducing the archive was to encourage researchers to perform a more rigorous evaluation of newly proposed time series classification (TSC) algorithms. It has worked: most recent research into TSC uses all 85 datasets to evaluate algorithmic advances. Research into multivariate time series classification, where more than one series are associated with each class label, is in a position where univariate TSC research was a decade ago. Algorithms are evaluated using very few datasets and claims of improvement are not based on statistical comparisons. We aim to address this problem by forming the first iteration of the MTSC archive, to be hosted at the website this http URL. Like the univariate archive, this formulation was a collaborative effort between researchers at the University of East Anglia (UEA) and the University of California, Riverside (UCR). The 2018 vintage consists of 30 datasets with a wide range of cases, dimensions and series lengths. For this first iteration of the archive we format all data to be of equal length, include no series with missing data and provide train/test splits.

[1]  B. Prabhakaran,et al.  Word Recognition from Continuous Articulatory Movement Time-series Data using Symbolic Representations , 2013, SLPAT.

[2]  Hossein Hamooni,et al.  Dual-Domain Hierarchical Classification of Phonetic Time Series , 2014, 2014 IEEE International Conference on Data Mining.

[3]  Laura J. Grundy,et al.  A dictionary of behavioral motifs reveals clusters of genes affecting Caenorhabditis elegans locomotion , 2012, Proceedings of the National Academy of Sciences.

[4]  Ethem Alpaydin,et al.  Combining Multiple Representations for Pen-based Handwritten Digit Recognition , 2001 .

[5]  James Large,et al.  Detecting Forged Alcohol Non-invasively Through Vibrational Spectroscopy and Machine Learning , 2018, PAKDD.

[6]  Víctor M. González Suárez,et al.  Generalized Models for the Classification of Abnormal Movements in Daily Life and its Applicability to Epilepsy Convulsion Recognition , 2016, Int. J. Neural Syst..

[7]  Jun Wang,et al.  Generalizing DTW to the multi-dimensional case requires an adaptive approach , 2016, Data Mining and Knowledge Discovery.

[8]  Jun Wang,et al.  On the Non-Trivial Generalization of Dynamic Time Warping to the Multi-Dimensional Case , 2015, SDM.

[9]  Patrick Chiang,et al.  Rate-adaptive compressed-sensing and sparsity variance of biomedical signals , 2015, 2015 IEEE 12th International Conference on Wearable and Implantable Body Sensor Networks (BSN).

[10]  Eamonn J. Keogh,et al.  Flying Insect Classification with Inexpensive Sensors , 2014, Journal of Insect Behavior.

[11]  Marc Toussaint,et al.  Modelling motion primitives and their timing in biologically executed movements , 2007, NIPS.

[12]  Sahin Albayrak,et al.  eRing: multiple finger gesture recognition with one ring using an electric field , 2015, iWOAR.

[13]  Zhen Wang,et al.  uWave: Accelerometer-based Personalized Gesture Recognition and Its Applications , 2009, PerCom.

[14]  S. Venkatesh,et al.  Online Context Recognition in Multisensor Systems using Dynamic Time Warping , 2005, 2005 International Conference on Intelligent Sensors, Sensor Networks and Information Processing.

[15]  Marco Cuturi,et al.  Fast Global Alignment Kernels , 2011, ICML.

[16]  Klaus-Robert Müller,et al.  Classifying Single Trial EEG: Towards Brain Computer Interfacing , 2001, NIPS.

[17]  G. Moody,et al.  Spontaneous termination of atrial fibrillation: a challenge from physionet and computers in cardiology 2004 , 2004, Computers in Cardiology, 2004.

[18]  Eamonn J. Keogh,et al.  The UCR time series archive , 2018, IEEE/CAA Journal of Automatica Sinica.

[19]  Laura J. Grundy,et al.  A database of C. elegans behavioral phenotypes , 2013, Nature Methods.

[20]  H. Flor,et al.  A spelling device for the paralysed , 1999, Nature.

[21]  Mineichi Kudo,et al.  Multidimensional curve classification using passing-through regions , 1999, Pattern Recognit. Lett..

[22]  Eamonn J. Keogh,et al.  The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances , 2016, Data Mining and Knowledge Discovery.

[23]  Nacereddine Hammami,et al.  Tree distribution classifier for automatic spoken Arabic digit recognition , 2009, 2009 International Conference for Internet Technology and Secured Transactions, (ICITST).

[24]  Pierre-François Marteau,et al.  Continuous pattern detection and recognition in stream - a benchmark for online gesture recognition , 2017, Int. J. Appl. Pattern Recognit..

[25]  Bernhard Schölkopf,et al.  Methods Towards Invasive Human Brain Computer Interfaces , 2004, NIPS.