Online common change-point detection in a set of nonstationary categorical time series

Abstract Categorical sequences are widely used in various domains to describe the evolutionary state of the process under study. This article addresses the problem of behavioral change detection for multiple categorical time series. Relying on the sequential likelihood ratio test, an online change detection method is proposed based on the joint modeling of all the categorical sequences. To model the joint probability density, a nonhomogeneous Markov model is used. It allows modeling the transition dynamics over time and considering their dependence on some exogenous factors that may influence the behavior changes. An adaptive threshold is learned using Monte Carlo simulations to detect different changes and reduce false alarms. The performance of the proposed method is evaluated using two real-world and four synthetic datasets. It is compared with two state-of-the-art change detection methods, namely logistic regression and homogeneous Markov model. The experimentation using synthetic datasets highlights the proposed method’s effectiveness in terms of both the detection precision and the detection delay. The real-world data are issued from a water network and school-to-work transition. The analysis of the model estimated parameters allows us to characterize the detected changes in a real-world context.

[1]  Taskin Koçak,et al.  Smart Grid Technologies: Communication Technologies and Standards , 2011, IEEE Transactions on Industrial Informatics.

[2]  P. Holland,et al.  Robust regression using iteratively reweighted least-squares , 1977 .

[3]  A. Scott,et al.  A Cluster Analysis Method for Grouping Means in the Analysis of Variance , 1974 .

[4]  Vipin Kumar,et al.  Contextual Time Series Change Detection , 2013, SDM.

[5]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .

[6]  Ammar Belatreche,et al.  Adaptive Hidden Markov Model With Anomaly States for Price Manipulation Detection , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[7]  N. Cheifetz Détection et classification de signatures temporelles CAN pour l’aide à la maintenance de sous-systèmes d’un véhicule de transport collectif , 2013 .

[8]  G. Moustakides Optimal stopping times for detecting changes in distributions , 1986 .

[9]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[10]  Douglas C. Montgomery,et al.  Process monitoring for multiple count data using generalized linear model-based control charts , 2003 .

[11]  Masashi Sugiyama,et al.  Sequential change‐point detection based on direct density‐ratio estimation , 2012, Stat. Anal. Data Min..

[12]  Tao Chen,et al.  Change detection of electric customer behavior based on AMR measurements , 2015, 2015 IEEE Eindhoven PowerTech.

[13]  Emmanuel Yashchin,et al.  On Detection of Changes in Categorical Data , 2012 .

[14]  Jian Li,et al.  Directional change‐point detection for process control with multivariate categorical data , 2013 .

[15]  Michael Höhle,et al.  Online Change-Point Detection in Categorical Time Series , 2010 .

[16]  Niall M. Adams,et al.  Multiple changepoint detection in categorical data streams , 2019, Stat. Comput..

[17]  Etienne Côme,et al.  Analyzing year-to-year changes in public transport passenger behaviour using smart card data , 2017 .

[18]  Hua Zhang,et al.  A change detection framework by fusing threshold and clustering methods for optical medium resolution remote sensing images , 2019 .

[19]  A. Willsky,et al.  A generalized likelihood ratio approach to the detection and estimation of jumps in linear systems , 1976 .

[20]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[21]  William H. Woodall,et al.  Control Charts for Poisson Count Data with Varying Sample Sizes , 2010 .

[22]  Michael Anyadike-Danes,et al.  Predicting successful and unsuccessful transitions from school to work by using sequence methods , 2002 .

[23]  Haniza Yazid,et al.  Performance analysis of image thresholding: Otsu technique , 2018 .

[24]  M. Rosenblatt Remarks on Some Nonparametric Estimates of a Density Function , 1956 .

[25]  Jean-Philippe Vial,et al.  Theory and algorithms for linear optimization - an interior point approach , 1998, Wiley-Interscience series in discrete mathematics and optimization.

[26]  Hao Yu,et al.  Retrospective change detection in categorical time series , 2017 .

[27]  Lorenzo Bruzzone,et al.  Image thresholding based on the EM algorithm and the generalized Gaussian distribution , 2007, Pattern Recognit..

[28]  Zhen Liu,et al.  Efficient Bayesian analysis of multiple changepoint models with dependence across segments , 2009, Stat. Comput..

[29]  Michèle Basseville,et al.  Detection of abrupt changes , 1993 .

[30]  G. Lorden PROCEDURES FOR REACTING TO A CHANGE IN DISTRIBUTION , 1971 .

[31]  Dino Ienco,et al.  Change detection in categorical evolving data streams , 2014, SAC.

[32]  William H. Woodall,et al.  The Use of Control Charts in Health-Care and Public-Health Surveillance , 2006 .

[33]  Gilbert Ritschard,et al.  Analyzing and Visualizing State Sequences in R with TraMineR , 2011 .

[34]  Haeran Cho,et al.  Change-point detection in panel data via double CUSUM statistic , 2016, 1611.08631.