SUCCESS: A New Approach for Semi-supervised Classification of Time-Series

The growing interest in time-series classification can be attributed to the intensively increasing amount of temporal data collected by widespread sensors. Often, human experts may only review a small portion of all the available data. Therefore, the available labeled data may not be representative enough and semi-supervised techniques may be necessary. In order to construct accurate classifiers, semi-supervised techniques learn both from labeled and unlabeled data. In this paper, we introduce a novel semi-supervised time-series classifier based on constrained hierarchical clustering and dynamic time warping. We discuss our approach in the framework of graph theory and evaluate it on 44 publicly available real-world time-series datasets from various domains. Our results show that our approach substantially outperforms the state-of-the-art semi-supervised time-series classifier. The results are also justified by statistical significance tests.

[1]  Krisztian Buza,et al.  SOHAC: Efficient Storage of Tick Data That Supports Search and Analysis , 2012, ICDM.

[2]  Krisztian Buza,et al.  Fusion Methods for Time-Series Classification , 2011 .

[3]  Dechawut Wanichsan,et al.  Stopping Criterion Selection for Efficient Semi-supervised Time Series Classification , 2008, Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing.

[4]  B. Malek,et al.  Novel Shoulder-Surfing Resistant Haptic-based Graphical Password , 2006 .

[5]  Li Wei,et al.  Semi-supervised time series classification , 2006, KDD '06.

[6]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[7]  Ayhan Demiriz,et al.  Semi-Supervised Clustering Using Genetic Algorithms , 1999 .

[8]  Stefan C. Kremer,et al.  Clustering unlabeled data with SOMs improves classification of labeled real-world data , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[9]  Li Wei,et al.  Fast time series classification using numerosity reduction , 2006, ICML.

[10]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[11]  See-Kiong Ng,et al.  Positive Unlabeled Leaning for Time Series Classification , 2011, IJCAI.

[12]  Shi Zhong,et al.  Semi-Supervised Sequence Classification With Hmms , 2005, Int. J. Pattern Recognit. Artif. Intell..

[13]  Lars Schmidt-Thieme,et al.  Fast Classification of Electrocardiograph Signals via Instance Selection , 2011, 2011 IEEE First International Conference on Healthcare Informatics, Imaging and Systems Biology.

[14]  Matthias Seeger,et al.  Learning from Labeled and Unlabeled Data , 2010, Encyclopedia of Machine Learning.

[15]  R. Manmatha,et al.  Word image matching using dynamic time warping , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[16]  Alexandros Nanopoulos,et al.  Hubs in Space: Popular Nearest Neighbors in High-Dimensional Data , 2010, J. Mach. Learn. Res..

[17]  Michael H. F. Wilkinson,et al.  Automatic diatom identification using contour analysis by morphological curvature scale spaces , 2005, Machine Vision and Applications.

[18]  Sadaaki Miyamoto,et al.  Semi-supervised agglomerative hierarchical clustering algorithms with pairwise constraints , 2010, International Conference on Fuzzy Systems.

[19]  Petra Perner,et al.  Advances in Data Mining , 2002, Lecture Notes in Computer Science.

[20]  Bernhard Sick,et al.  Signature Verification with Dynamic RBF Networks and Time Series Motifs , 2006 .

[21]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[22]  Hui Ding,et al.  Querying and mining of time series data: experimental comparison of representations and distance measures , 2008, Proc. VLDB Endow..

[23]  Günther Palm,et al.  On the Effects of Constraints in Semi-supervised Hierarchical Clustering , 2006, ANNPR.

[24]  Mohan Kumar,et al.  Using dynamic time warping for online temporal fusion in multisensor systems , 2008, Inf. Fusion.

[25]  David Yarowsky,et al.  Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora , 2010, COLING.

[26]  Alexandros Nanopoulos,et al.  Time-Series Classification in Many Intrinsic Dimensions , 2010, SDM.