E-detectors: a nonparametric framework for online changepoint detection

Sequential changepoint detection is a classical problem with a variety of applications. However, the majority of prior work has been parametric, for example, focusing on exponential families. We develop a fundamentally new and general framework for changepoint detection when the preand post-change distributions are nonparametrically specified (and thus composite). Our procedures come with clean, nonasymptotic bounds on the average run length (frequency of false alarms). In certain nonparametric cases (like sub-Gaussian or sub-exponential), we also provide near-optimal bounds on the detection delay following a changepoint. The primary technical tool that we introduce is called an e-detector, which is composed of sums of e-processes—a fundamental generalization of nonnegative supermartingales— that are started at consecutive times. We first introduce simple Shiryaev-Roberts and CUSUM-style e-detectors, and then show how to design their mixtures in order to achieve both statistical and computational efficiency. We demonstrate their efficacy in detecting changes in the mean of a bounded random variable without any i.i.d. assumptions, with an application to tracking the performance of a sports team over multiple seasons.

[1]  Wouter M. Koolen,et al.  Testing exchangeability: Fork-convexity, supermartingales and e-processes , 2021, Int. J. Approx. Reason..

[2]  Alessandro Rinaldo,et al.  Nonparametric Iterated-Logarithm Extensions of the Sequential Generalized Likelihood Ratio Test , 2020, IEEE Journal on Selected Areas in Information Theory.

[3]  Wouter M. Koolen,et al.  Admissible anytime-valid sequential inference must rely on nonnegative martingales. , 2020, 2009.03167.

[4]  Yudong Chen,et al.  High‐dimensional, multiscale online changepoint detection , 2020, Journal of the Royal Statistical Society: Series B (Statistical Methodology).

[5]  L. Wasserman,et al.  Universal inference , 2019, Proceedings of the National Academy of Sciences.

[6]  V. Vovk Testing Randomness Online , 2019, Statistical Science.

[7]  Vladimir Vovk,et al.  Game‐Theoretic Foundations for Probability and Finance , 2019, Wiley Series in Probability and Statistics.

[8]  Jon D. McAuliffe,et al.  Time-uniform, nonparametric, nonasymptotic confidence sequences , 2018, The Annals of Statistics.

[9]  Jon D. McAuliffe,et al.  Time-uniform Chernoff bounds via nonnegative supermartingales , 2018, Probability Surveys.

[10]  Donald F. Towsley,et al.  Quickest Detection of Dynamic Events in Networks , 2018, IEEE Transactions on Information Theory.

[11]  Le Song,et al.  M-Statistic for Kernel Change-Point Detection , 2015, NIPS.

[12]  A. Tartakovsky Nearly optimal sequential tests of composite hypotheses revisited , 2014 .

[13]  M. Basseville,et al.  Sequential Analysis: Hypothesis Testing and Changepoint Detection , 2014 .

[14]  Xiequan Fan,et al.  Exponential inequalities for martingales with applications , 2013, 1311.6273.

[15]  D. Siegmund,et al.  Sequential multi-sensor change-point detection , 2012, 2013 Information Theory and Applications Workshop (ITA).

[16]  Y. Mei Efficient scalable schemes for monitoring a large number of data streams , 2010 .

[17]  Moshe Pollak,et al.  Third-order Asymptotic Optimality of the Generalized Shiryaev-Roberts Changepoint Detection Procedures , 2010, 1005.1129.

[18]  Alexander G. Tartakovsky,et al.  On optimality of the Shiryaev-Roberts procedure for detecting a change in distribution , 2009, 0904.3370.

[19]  H. Vincent Poor,et al.  One Shot Schemes for Decentralized Quickest Change Detection , 2008, IEEE Transactions on Information Theory.

[20]  G. Moustakides SEQUENTIAL CHANGE DETECTION REVISITED , 2008, 0804.0741.

[21]  M. Davy,et al.  An online kernel change detection algorithm , 2005, IEEE Transactions on Signal Processing.

[22]  Tze Leung Lai,et al.  Information Bounds and Quick Detection of Parameter Changes in Stochastic Systems , 1998, IEEE Trans. Inf. Theory.

[23]  T. Lai Sequential changepoint detection in quality control and dynamical systems , 1995 .

[24]  D. Siegmund,et al.  Using the Generalized Likelihood Ratio Statistic for Sequential Detection of a Change-Point , 1995 .

[25]  R. Durrett Probability: Theory and Examples , 1993 .

[26]  Y. Ritov Decision Theoretic Optimality of the Cusum Procedure , 1990 .

[27]  G. Moustakides Optimal stopping times for detecting changes in distributions , 1986 .

[28]  M. Pollak Optimal Detection of a Change in Distribution , 1985 .

[29]  A. Willsky,et al.  A generalized likelihood ratio approach to the detection and estimation of jumps in linear systems , 1976 .

[30]  D. Siegmund,et al.  APPROXIMATIONS TO THE EXPECTED SAMPLE SIZE OF CERTAIN SEQUENTIAL TESTS , 1975 .

[31]  G. Lorden PROCEDURES FOR REACTING TO A CHANGE IN DISTRIBUTION , 1971 .

[32]  G. Lorden On Excess Over the Boundary , 1970 .

[33]  S. W. Roberts A Comparison of Some Control Chart Procedures , 1966 .

[34]  G. Barnard Control Charts and Stochastic Processes , 1959 .

[35]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .

[36]  Akimichi Takemura,et al.  Non-asymptotic analysis of a new bandit algorithm for semi-bounded rewards , 2015, J. Mach. Learn. Res..

[37]  Y. Mei Efficient scalable schemes for monitoring a large number of data streams , 2010 .

[38]  A. Takemura,et al.  An Asymptotically Optimal Bandit Algorithm for Bounded Support Models. , 2010, COLT 2010.

[39]  W. Gasarch,et al.  The Book Review Column 1 Coverage Untyped Systems Simple Types Recursive Types Higher-order Systems General Impression 3 Organization, and Contents of the Book , 2022 .

[40]  A. Shiryaev On Optimum Methods in Quickest Detection Problems , 1963 .

[41]  Jean-Luc Ville Étude critique de la notion de collectif , 1939 .