Explainable Unsupervised Change-point Detection via Graph Neural Networks

Change-point detection (CPD) aims at detecting the abrupt property changes lying behind time series data. The property changes in a multivariate time series often result from highly entangled reasons, ranging from independent changes of variables to correlation changes between variables. Learning to uncover the reasons behind the changes in an unsupervised setting is a new and challenging task. Previous CPD methods usually detect change-points by a divergence estimation of statistical features, without delving into the reasons behind the detected changes. In this paper, we propose a correlation-aware dynamics model which separately predicts the correlation change and independent change by incorporating graph neural networks into the encoder-decoder framework. Through experiments on synthetic and real-world datasets, we demonstrate the enhanced performance of our model on the CPD tasks as well as its ability to interpret the nature and degree of the predicted changes.

[1]  Nathaniel Virgo,et al.  Permutation-equivariant neural networks applied to dynamics prediction , 2016, ArXiv.

[2]  Yann LeCun,et al.  GLoMo: Unsupervised Learning of Transferable Relational Graphs , 2018, NeurIPS.

[3]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[4]  Rob Fergus,et al.  Learning Multiagent Communication with Backpropagation , 2016, NIPS.

[5]  Deborah Estrin,et al.  Using mobile phones to determine transportation modes , 2010, TOSN.

[6]  Kristian Kersting,et al.  Stochastic Online Anomaly Analysis for Streaming Time Series , 2017, IJCAI.

[7]  Chris D. Nugent,et al.  Evaluation of Prompted Annotation of Activity Data Recorded from a Smart Phone , 2014, Sensors.

[8]  Ken-ichi Kawarabayashi,et al.  What Can Neural Networks Reason About? , 2019, ICLR.

[9]  J. Hartigan,et al.  A Bayesian Analysis for Change Point Problems , 1993 .

[10]  Taehoon Kim,et al.  Reading Documents for Bayesian Online Change Point Detection , 2015, EMNLP.

[11]  Bulent Yener,et al.  Deep density ratio estimation for change point detection , 2019, ArXiv.

[12]  Wenhu Chen,et al.  Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting , 2019, NeurIPS.

[13]  Yisong Yue,et al.  Long-term Forecasting using Tensor-Train RNNs , 2017, ArXiv.

[14]  Ryan P. Adams,et al.  Bayesian Online Changepoint Detection , 2007, 0710.3742.

[15]  Carl E. Rasmussen,et al.  Gaussian Process Change Point Models , 2010, ICML.

[16]  Masashi Sugiyama,et al.  Change-Point Detection in Time-Series Data by Relative Density-Ratio Estimation , 2011 .

[17]  Didier Stricker,et al.  Introducing a New Benchmarked Dataset for Activity Monitoring , 2012, 2012 16th International Symposium on Wearable Computers.

[18]  Kevin P. Murphy,et al.  Modeling changing dependency structure in multivariate time series , 2007, ICML '07.

[19]  Xing Xie,et al.  Understanding transportation modes based on GPS data for web applications , 2010, TWEB.

[20]  Le Song,et al.  M-Statistic for Kernel Change-Point Detection , 2015, NIPS.

[21]  Yiming Yang,et al.  Kernel Change-point Detection with Auxiliary Deep Generative Models , 2019, ICLR.

[22]  Ben Poole,et al.  Categorical Reparameterization with Gumbel-Softmax , 2016, ICLR.

[23]  Edit Gombay,et al.  ESTIMATORS AND TESTS FOR CHANGE IN VARIANCES , 1996 .

[24]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[25]  Max Welling,et al.  Attention, Learn to Solve Routing Problems! , 2018, ICLR.

[26]  Min Zheng,et al.  Pyramid Recurrent Neural Networks for Multi-Scale Change-Point Detection , 2018 .

[27]  Bernhard Schölkopf,et al.  A Kernel Two-Sample Test , 2012, J. Mach. Learn. Res..

[28]  Steven Reece,et al.  Sequential Bayesian Prediction in the Presence of Changepoints and Faults , 2010, Comput. J..

[29]  Yoshinobu Kawahara,et al.  Change-Point Detection in Time-Series Data Based on Subspace Identification , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[30]  V. Moskvina,et al.  Application of the singular spectrum analysis for change-point detection in time series , 2006 .

[31]  Melvin J. Hinich,et al.  Time Series Analysis by State Space Methods , 2001 .

[32]  Jure Leskovec,et al.  How Powerful are Graph Neural Networks? , 2018, ICLR.

[33]  Xing Xie,et al.  Learning transportation mode from raw gps data for geographic applications on the web , 2008, WWW.

[34]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[35]  R. Zemel,et al.  Neural Relational Inference for Interacting Systems , 2018, ICML.

[36]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[37]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.