View-Adaptive Weighted Deep Transfer Learning for Distributed Time-Series Classification

In this paper, we propose an effective, multi-view, deep, transfer learning framework for multivariate time-series data. Though widely used for tasks such as computer vision, the application of transfer learning to time-series classification problems (e.g., classification of light curves) is underexplored. The proposed framework makes several important contributions to facilitate knowledge sharing, while simultaneously ensuring an effective solution for domain specific fine-level categorizations. First, in contrast to the traditional approaches, the proposed framework describes pairwise view similarity by identifying a smaller subset of source-view samples that closely resemble the target data patterns. Second, by means of two-phase learning, a generic baseline model is learned on a larger source data collection and later fine-tuned on a smaller target data collection, precisely approximating the target data patterns. Third, an effective view-adaptive timestamp weighting scheme evaluates the relative importance of each timestamp in a more data-driven manner, which enables a more flexible yet discriminative feature representation scheme in the presence of evolving data characteristics. As shown by experiments, compared to the existing approaches, our proposed deep transfer learning framework improves classification performance by around 2-3% in the UCI multi-view activity recognition dataset, while also showing a robust, generalized representation capacity in classifying several large-scale multi-view light curve collections.

[1]  Jason Lines,et al.  Time series classification with ensembles of elastic distance measures , 2015, Data Mining and Knowledge Discovery.

[2]  Michael I. Jordan,et al.  Learning Transferable Features with Deep Adaptation Networks , 2015, ICML.

[3]  Amy Loutfi,et al.  A review of unsupervised feature learning and deep learning for time-series modeling , 2014, Pattern Recognit. Lett..

[4]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[5]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[6]  A. J. Drake,et al.  The Catalina Surveys Southern periodic variable star catalogue , 2017 .

[7]  Billur Barshan,et al.  Recognizing Daily and Sports Activities in Two Open Source Machine Learning Environments Using Body-Worn Sensor Units , 2014, Comput. J..

[8]  Yoshua Bengio,et al.  How transferable are features in deep neural networks? , 2014, NIPS.

[9]  Gabriela Csurka,et al.  Domain Adaptation for Visual Applications: A Comprehensive Survey , 2017, ArXiv.

[10]  Ashit Talukder,et al.  Active learning based news veracity detection with feature weighting and deep-shallow fusion , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[11]  Ying Wah Teh,et al.  Deep learning algorithms for human activity recognition using mobile and wearable sensor networks: State of the art and research challenges , 2018, Expert Syst. Appl..

[12]  Chotirat Ratanamahatana,et al.  Robust and Accurate Anomaly Detection in ECG Artifacts Using Time Series Motif Discovery , 2015, Comput. Math. Methods Medicine.

[13]  Yu Cheng,et al.  Boosting Deep Learning Risk Prediction with Generative Adversarial Networks for Electronic Health Records , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[14]  Jorma Rissanen,et al.  The Minimum Description Length Principle in Coding and Modeling , 1998, IEEE Trans. Inf. Theory.

[15]  Luis Gravano,et al.  k-Shape: Efficient and Accurate Clustering of Time Series , 2015, SIGMOD Conference.

[16]  Arno Siebes,et al.  StreamKrimp: Detecting Change in Data Streams , 2008, ECML/PKDD.

[17]  P. Grünwald The Minimum Description Length Principle (Adaptive Computation and Machine Learning) , 2007 .

[18]  Junsong Yuan,et al.  Context-aware graph-based analysis for detecting anomalous activities , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[19]  Cyrus Shahabi,et al.  Feature subset selection and feature ranking for multivariate time series , 2005, IEEE Transactions on Knowledge and Data Engineering.

[20]  T. A. Lister,et al.  Gaia Data Release 2. Summary of the contents and survey properties , 2018, 1804.09365.

[21]  William J. Tolone,et al.  Identifying malicious social media contents using multi-view Context-Aware active learning , 2019, Future Gener. Comput. Syst..

[22]  Wenyu Zhang,et al.  Multivariate Time Series Classification Using Dynamic Time Warping Template Selection for Human Activity Recognition , 2015, 2015 IEEE Symposium Series on Computational Intelligence.

[23]  Alexandros Karatzoglou,et al.  Towards a universal neural network encoder for time series , 2018, CCIA.

[24]  Isaac Cho,et al.  VIFI: Virtual information fabric infrastructure for data-driven discoveries from distributed earth science data , 2017, 2017 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI).

[25]  Bernhard Pfahringer,et al.  Compression-Based Discretization of Continuous Attributes , 1995, ICML.

[26]  Shiguang Shan,et al.  Multi-View Discriminant Analysis , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Razvan Pascanu,et al.  How to Construct Deep Recurrent Neural Networks , 2013, ICLR.

[28]  Akshay Pai,et al.  Deep-learnt classification of light curves , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[29]  C. Shahabi,et al.  CLeVer : a Feature Subset Selection Technique for Multivariate Time Series ? ( Full Version ) , 2005 .

[30]  Tim Oates,et al.  Time series classification from scratch with deep neural networks: A strong baseline , 2016, 2017 International Joint Conference on Neural Networks (IJCNN).

[31]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[32]  Ashit Talukder,et al.  Graph clustering for weapon discharge event detection and tracking in infrared imagery using deep features , 2017, Defense + Security.

[33]  Tom Drummond,et al.  Learning Factorized Representations for Open-set Domain Adaptation , 2018, ICLR.

[34]  Sylvie Gibet,et al.  On Recursive Edit Distance Kernels With Application to Time Series Classification , 2010, IEEE Transactions on Neural Networks and Learning Systems.

[35]  Ludovic Denoyer,et al.  Spatio-Temporal Neural Networks for Space-Time Series Forecasting and Relations Discovery , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[36]  Ashit Talukder,et al.  Identifying extremism in social media with multi-view context-aware subset optimization , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[37]  Yun Fu,et al.  Multi-View Time Series Classification: A Discriminative Bilinear Projection Approach , 2016, CIKM.

[38]  Ernest E. Croner,et al.  The Palomar Transient Factory: System Overview, Performance, and First Results , 2009, 0906.5350.

[39]  Germain Forestier,et al.  Data augmentation using synthetic data for time series classification with deep residual networks , 2018, ArXiv.