Compound Sequential Change Point Detection in Multiple Data Streams

We consider sequential change point detection in multiple data streams, where each stream has its own change point. Once a change point is detected for a data stream, this stream is deactivated permanently. The goal is to maximize the normal operation of the pre-change streams, while controlling the proportion of post-change streams among the active streams at all time points. This problem has wide applications in science, social science, and engineering. Taking a Bayesian formulation, we develop a compound sequential decision theory framework for this problem. Under this framework, an oracle procedure is proposed that is optimal among all sequential procedures which control the expected proportion of post-change streams at each time point. We also investigate the asymptotic behavior of the proposed method when the number of data streams grows large. Several non-standard technical tools involving partially ordered spaces and monotone coupling of stochastic processes are developed for proving the optimality result. Numerical examples are provided to illustrate the use and performance of the proposed method.

[1]  A. R. Crathorne,et al.  Economic Control of Quality of Manufactured Product. , 1933 .

[2]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .

[3]  H. Robbins An Empirical Bayes Approach to Statistics , 1956 .

[4]  T. E. Harris A lower bound for the critical probability in a certain percolation process , 1960, Mathematical Proceedings of the Cambridge Philosophical Society.

[5]  A. Shiryaev On Optimum Methods in Quickest Detection Problems , 1963 .

[6]  R. Bellman Dynamic programming. , 1957, Science.

[7]  S. W. Roberts A Comparison of Some Control Chart Procedures , 1966 .

[8]  G. Lorden PROCEDURES FOR REACTING TO A CHANGE IN DISTRIBUTION , 1971 .

[9]  T. Kamae,et al.  Stochastic Inequalities on Partially Ordered Spaces , 1977 .

[10]  D. A. Edwards On the existence of probability measures with given marginals , 1978 .

[11]  M. Pollak Optimal Detection of a Change in Distribution , 1985 .

[12]  H. Robbins Asymptotically Subminimax Solutions of Compound Statistical Decision Problems , 1985 .

[13]  G. Moustakides Optimal stopping times for detecting changes in distributions , 1986 .

[14]  J. Wellner,et al.  Empirical Processes with Applications to Statistics , 2009 .

[15]  M. Pollak Average Run Lengths of an Optimal Method of Detecting a Change in Distribution. , 1987 .

[16]  Jon A. Wellner,et al.  Empirical Processes with Applications to Statistics. , 1988 .

[17]  T. Lindvall Lectures on the Coupling Method , 1992 .

[18]  Nick Andrews,et al.  A Statistical Algorithm for the Early Detection of Outbreaks of Infectious Disease , 1996 .

[19]  T. Lindvall ON STRASSEN'S THEOREM ON STOCHASTIC DOMINATION , 1999 .

[20]  H. Thorisson Coupling, stationarity, and regeneration , 2000 .

[21]  Wim J. J. Veerkamp,et al.  Detection of Known Items in Adaptive Testing with a Statistical Quality Control Method , 2000 .

[22]  John D. Storey,et al.  Empirical Bayes Analysis of a Microarray Experiment , 2001 .

[23]  B. Efron Large-Scale Simultaneous Hypothesis Testing , 2004 .

[24]  Cun-Hui Zhang,et al.  Compound decision theory and empirical bayes methods , 2003 .

[25]  Bradley Efron,et al.  Microarrays, Empirical Bayes and the Two-Groups Model. Rejoinder. , 2008, 0808.0572.

[26]  M. Newton Large-Scale Simultaneous Hypothesis Testing: The Choice of a Null Hypothesis , 2008 .

[27]  V. Veeravalli,et al.  Asymptotically Optimal Quickest Change Detection in Distributed Sensor Systems , 2008 .

[28]  Ž. Pedišić,et al.  Handbook for Quantitative Methods , 2010 .

[29]  Y. Mei Efficient scalable schemes for monitoring a large number of data streams , 2010 .

[30]  Christian P. Robert,et al.  Large-scale inference , 2010 .

[31]  Aleksey S. Polunchenko,et al.  State-of-the-Art in Sequential Change-Point Detection , 2011, 1109.2938.

[32]  Hao Chen,et al.  Graph-based change-point detection , 2012, 1209.1625.

[33]  David Siegmund,et al.  Sequential multi-sensor change-point detection , 2013, 2013 Information Theory and Applications Workshop (ITA).

[34]  Jay Bartroff,et al.  Multiple Hypothesis Tests Controlling Generalized Error Rates for Sequential Data , 2014, 1406.5933.

[35]  Hock Peng Chan,et al.  Optimal sequential detection in multi-stream data , 2015, 1506.08504.

[36]  W. D. Linden,et al.  Bayesian Checks on Cheating on Tests , 2015, Psychometrika.

[37]  Xiaodong Wang,et al.  Quickest Detection of False Data Injection Attack in Wide-Area Smart Grids , 2015, IEEE Transactions on Smart Grid.

[38]  Gregory J. Cizek,et al.  Handbook of Quantitative Methods for Detecting Cheating on Tests , 2016 .

[39]  Hua-Hua Chang,et al.  Sequential Detection of Compromised Items Using Response Times in Computerized Adaptive Testing , 2018, Psychometrika.

[40]  Hao Chen,et al.  Sequential change-point detection based on nearest neighbors , 2016, The Annals of Statistics.

[41]  Georgios Fellouris,et al.  Sequential multiple testing with generalized error control: An asymptotic optimality theory , 2016, The Annals of Statistics.