Self-labeling techniques for semi-supervised time series classification: an empirical study

An increasing amount of unlabeled time series data available render the semi-supervised paradigm a suitable approach to tackle classification problems with a reduced quantity of labeled data. Self-labeled techniques stand out from semi-supervised classification methods due to their simplicity and the lack of strong assumptions about the distribution of the labeled and unlabeled data. This paper addresses the relevance of these techniques in the time series classification context by means of an empirical study that compares successful self-labeled methods in conjunction with various learning schemes and dissimilarity measures. Our experiments involve 35 time series datasets with different ratios of labeled data, aiming to measure the transductive and inductive classification capabilities of the self-labeled methods studied. The results show that the nearest-neighbor rule is a robust choice for the base classifier. In addition, the amending and multi-classifier self-labeled-based approaches reveal a promising attempt to perform semi-supervised classification in the time series context.

[1]  Eamonn J. Keogh,et al.  Searching and Mining Trillions of Time Series Subsequences under Dynamic Time Warping , 2012, KDD.

[2]  Zhongsheng Hua,et al.  Semi-supervised learning based on nearest neighbor rule and cut edges , 2010, Knowl. Based Syst..

[3]  Pradipta Kishore Dash,et al.  Power quality time series data mining using S-transform and fuzzy expert system , 2010, Appl. Soft Comput..

[4]  David Madigan,et al.  Decision Trees for Functional Variables , 2006, Sixth International Conference on Data Mining (ICDM'06).

[5]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[6]  Richard J. Povinelli,et al.  Time series classification using Gaussian mixture models of reconstructed phase spaces , 2004, IEEE Transactions on Knowledge and Data Engineering.

[7]  Eamonn J. Keogh,et al.  Experimental comparison of representation methods and distance measures for time series data , 2010, Data Mining and Knowledge Discovery.

[8]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[9]  Lei Chen,et al.  Robust and fast similarity search for moving object trajectories , 2005, SIGMOD '05.

[10]  Henrik Boström,et al.  Learning First Order Logic Time Series Classifiers: Rules and Boosting , 2000, PKDD.

[11]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[12]  Nick S. Jones,et al.  Highly Comparative Feature-Based Time-Series Classification , 2014, IEEE Transactions on Knowledge and Data Engineering.

[13]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[14]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[15]  Alexander Zien,et al.  Semi-Supervised Learning , 2006 .

[16]  Zhi-Hua Zhou,et al.  SETRED: Self-training with Editing , 2005, PAKDD.

[17]  Fabrice Muhlenbach,et al.  Separability Index in Supervised Learning , 2002, PKDD.

[18]  Li Wei,et al.  Semi-supervised time series classification , 2006, KDD '06.

[19]  Zhi-Hua Zhou,et al.  Tri-training: exploiting unlabeled data using three classifiers , 2005, IEEE Transactions on Knowledge and Data Engineering.

[20]  David Zhang,et al.  Time Series Classification Using Support Vector Machine with Gaussian Elastic Metric Kernel , 2010, 2010 20th International Conference on Pattern Recognition.

[21]  Shigeki Sagayama,et al.  Dynamic Time-Alignment Kernel in Support Vector Machine , 2001, NIPS.

[22]  Einoshin Suzuki,et al.  Decision-tree Induction from Time-series Data Based on a Standard-example Split Test , 2003, ICML.

[23]  Raja Jayaraman,et al.  Support vector-based algorithms with weighted dynamic time warping kernel function for time series classification , 2015, Knowl. Based Syst..

[24]  Jason Lines,et al.  Time-Series Classification with COTE: The Collective of Transformation-Based Ensembles , 2015, IEEE Trans. Knowl. Data Eng..

[25]  Mirjana Ivanovic,et al.  The Influence of Global Constraints on Similarity Measures for Time-Series Databases , 2011, Knowl. Based Syst..

[26]  Yan Zhou,et al.  Enhancing Supervised Learning with Unlabeled Data , 2000, ICML.

[27]  James M. W. Brownjohn,et al.  ARMA modelled time-series classification for structural health monitoring of civil infrastructure , 2008 .

[28]  Yasnitsky Leonid Advances in Intelligent Systems and Computing , 2019 .

[29]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[30]  Shie Mannor,et al.  Time Series Analysis Using Geometric Template Matching , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Francisco Herrera,et al.  Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power , 2010, Inf. Sci..

[32]  Geoffrey I. Webb,et al.  Faster and more accurate classification of time series by exploiting a novel dynamic time warping averaging algorithm , 2015, Knowledge and Information Systems.

[33]  Yosef Hochberg,et al.  Extensions of multiple testing procedures based on Simes' test , 1995 .

[34]  Philip S. Yu,et al.  Early classification on time series , 2012, Knowledge and Information Systems.

[35]  Francisco Herrera,et al.  Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study , 2015, Knowledge and Information Systems.

[36]  Eamonn J. Keogh,et al.  Towards Automatic Classification on Flying Insects Using Inexpensive Sensors , 2011, 2011 10th International Conference on Machine Learning and Applications and Workshops.

[37]  Joan Serrà,et al.  An empirical evaluation of similarity measures for time series classification , 2014, Knowl. Based Syst..

[38]  Arie Ben-David,et al.  A lot of randomness is hiding in accuracy , 2007, Eng. Appl. Artif. Intell..

[39]  Juan José Rodríguez Diez,et al.  Interval and dynamic time warping-based decision trees , 2004, SAC '04.

[40]  Eamonn J. Keogh,et al.  A Minimum Description Length Technique for Semi-Supervised Time Series Classification , 2013, IRI.

[41]  Hüseyin Kaya,et al.  A distance based time series classification framework , 2015, Inf. Syst..

[42]  Tapio Elomaa,et al.  Principles of data mining and knowledge discovery : 6th European Conference, PKDD 2002, Helsinki, Finland, August 19-23, 2002 : proceedings , 2002 .

[43]  Dechawut Wanichsan,et al.  Stopping Criterion Selection for Efficient Semi-supervised Time Series Classification , 2008, Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing.

[44]  Bernhard Schölkopf,et al.  Semi-Supervised Learning (Adaptive Computation and Machine Learning) , 2006 .

[45]  Pradipta Kishore Dash,et al.  Time sequence data mining using time-frequency analysis and soft computing techniques , 2008, Appl. Soft Comput..

[46]  Eamonn J. Keogh,et al.  DTW-D: time series semi-supervised learning from a single example , 2013, KDD.

[47]  Sylvie Gibet,et al.  On Recursive Edit Distance Kernels With Application to Time Series Classification , 2010, IEEE Transactions on Neural Networks and Learning Systems.

[48]  Mirjana Ivanovic,et al.  Comparison of different weighting schemes for the kNN classifier on time-series data , 2016, Knowledge and Information Systems.

[49]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[50]  Cécile Amblard,et al.  Classification trees for time series , 2012, Pattern Recognit..

[51]  Jason Lines,et al.  Time-Series Classification with COTE: The Collective of Transformation-Based Ensembles , 2015, IEEE Transactions on Knowledge and Data Engineering.

[52]  Celso André R. de Sousa,et al.  Time Series Transductive Classification on Imbalanced Data Sets: An Experimental Study , 2014, 2014 22nd International Conference on Pattern Recognition.

[53]  Liu Xiao-ying Fast Subsequence Matching in Time-series Database , 2008 .

[54]  Yan Zhou,et al.  Democratic co-learning , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[55]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[56]  Gareth J. Janacek,et al.  Clustering time series from ARMA models with clipped data , 2004, KDD.

[57]  Jason Lines,et al.  Time series classification with ensembles of elastic distance measures , 2015, Data Mining and Knowledge Discovery.

[58]  Paul Lukowicz,et al.  On general purpose time series similarity measures and their use as kernel functions in support vector machines , 2014, Inf. Sci..

[59]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[60]  Li Wei,et al.  Fast time series classification using numerosity reduction , 2006, ICML.

[61]  Bing-Yu Sun,et al.  A Study on the Dynamic Time Warping in Kernel Machines , 2007, 2007 Third International IEEE Conference on Signal-Image Technologies and Internet-Based System.

[62]  José Manuel Benítez,et al.  On the stopping criteria for k-Nearest Neighbor in positive unlabeled time series classification problems , 2016, Inf. Sci..

[63]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[64]  Celso André R. de Sousa,et al.  An experimental analysis on time series transductive classification on graphs , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[65]  Minyoung Kim,et al.  Semi-supervised learning of hidden conditional random fields for time-series classification , 2013, Neurocomputing.

[66]  Kristóf Marussy,et al.  SUCCESS: A New Approach for Semi-supervised Classification of Time-Series , 2013, ICAISC.

[67]  Jun Meng,et al.  Granulation-based symbolic representation of time series and semi-supervised classification , 2011, Comput. Math. Appl..

[68]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[69]  J. L. Hodges,et al.  Rank Methods for Combination of Independent Experiments in Analysis of Variance , 1962 .

[70]  Pierre-François Marteau,et al.  Time Warp Edit Distance with Stiffness Adjustment for Time Series Matching , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[71]  Christos Faloutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[72]  Cauligi S. Raghavendra,et al.  Semi-supervised Failure Prediction for Oil Production Wells , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[73]  Elio Masciari,et al.  Exploiting structural similarity for effective Web information extraction , 2007, Data Knowl. Eng..

[74]  Junyi Shen,et al.  Classification of multivariate time series using two-dimensional singular value decomposition , 2008, Knowl. Based Syst..

[75]  Marco Cuturi,et al.  Fast Global Alignment Kernels , 2011, ICML.