Single and Multiple Change-Point Detection with Differential Privacy

The change-point detection problem seeks to identify distributional changes at an unknown changepoint k∗ in a stream of data. This problem appears in many important practical settings involving personal data, including biosurveillance, fault detection, finance, signal detection, and security systems. The field of differential privacy offers data analysis tools that provide powerful worst-case privacy guarantees. We study the statistical problem of change-point detection through the lens of differential privacy. We give private algorithms for both online and offline change-point detection, analyze these algorithms theoretically, and provide empirical validation of our results.

[1]  Y. Mei Efficient scalable schemes for monitoring a large number of data streams , 2010 .

[2]  Cynthia Dwork,et al.  Calibrating Noise to Sensitivity in Private Data Analysis , 2006, TCC.

[3]  S. Panchapakesan,et al.  Inference about the Change-Point in a Sequence of Random Variables: A Selection Approach , 1988 .

[4]  Alexander G. Tartakovsky,et al.  Efficient Computer Network Anomaly Detection by Changepoint Detection Methods , 2012, IEEE Journal of Selected Topics in Signal Processing.

[5]  A. R. Crathorne,et al.  Economic Control of Quality of Manufactured Product. , 1933 .

[6]  J. Cima,et al.  On weak* convergence in ¹ , 1996 .

[7]  Hock Peng Chan,et al.  Optimal sequential detection in multi-stream data , 2015, 1506.08504.

[8]  Adam D. Smith,et al.  Discovering frequent patterns in sensitive data , 2010, KDD.

[9]  H. Vincent Poor,et al.  Quickest Detection in Cognitive Radio: A Sequential Change Detection Framework , 2008, IEEE GLOBECOM 2008 - 2008 IEEE Global Telecommunications Conference.

[10]  M. Pollak Average Run Lengths of an Optimal Method of Detecting a Change in Distribution. , 1987 .

[11]  Robert Lund,et al.  Detection of Undocumented Changepoints: A Revision of the Two-Phase Regression Model , 2002 .

[12]  E. Carlstein Nonparametric Change-Point Estimation , 1988 .

[13]  Aaron Roth,et al.  The Algorithmic Foundations of Differential Privacy , 2014, Found. Trends Theor. Comput. Sci..

[14]  G. Moustakides Optimal stopping times for detecting changes in distributions , 1986 .

[15]  Y. Mei Sequential change-point detection when unknown parameters are present in the pre-change distribution , 2006, math/0605322.

[16]  T. Lai Sequential changepoint detection in quality control and dynamical systems , 1995 .

[17]  Y. Mei Efficient scalable schemes for monitoring a large number of data streams , 2010 .

[18]  S. W. Roberts A Comparison of Some Control Chart Procedures , 1966 .

[19]  Yajun Mei,et al.  Is Average Run Length to False Alarm Always an Informative Criterion? , 2008 .

[20]  David Siegmund,et al.  MODEL SELECTION FOR HIGH-DIMENSIONAL, MULTI-SEQUENCE CHANGE-POINT PROBLEMS , 2012 .

[21]  Moni Naor,et al.  Differential privacy under continual observation , 2010, STOC '10.

[22]  R. Takahashi,et al.  Fuzzy/Bayesian change point detection approach to incipient fault detection , 2011 .

[23]  T. Lai SEQUENTIAL ANALYSIS: SOME CLASSICAL PROBLEMS AND NEW CHALLENGES , 2001 .

[24]  Adam D. Smith,et al.  The structure of optimal private tests for simple hypotheses , 2018, STOC.

[25]  Frank McSherry,et al.  Privacy integrated queries: an extensible platform for privacy-preserving data analysis , 2009, SIGMOD Conference.

[26]  Howard S. Burkom,et al.  Statistical Challenges Facing Early Outbreak Detection in Biosurveillance , 2010, Technometrics.

[27]  M. Pollak Optimal Detection of a Change in Distribution , 1985 .

[28]  G. Lorden PROCEDURES FOR REACTING TO A CHANGE IN DISTRIBUTION , 1971 .

[29]  Rudolf B. Blazek,et al.  Detection of intrusions in information systems by sequential change-point methods , 2005 .

[30]  P. Perron,et al.  Computation and Analysis of Multiple Structural-Change Models , 1998 .

[31]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .

[32]  M. Kulldor,et al.  Prospective time-periodic geographical disease surveillance using a scan statistic , 2001 .

[33]  A. Shiryaev On Optimum Methods in Quickest Detection Problems , 1963 .