Optimal detection of a jump in the intensity of a Poisson process or in a density with likelihood ratio statistics

We consider the problem of detecting a ‘bump’ in the intensity of a Poisson process or in a density. We analyze two types of likelihood ratio-based statistics, which allow for exact finite sample inference and asymptotically optimal detection: The maximum of the penalized square root of log likelihood ratios (‘penalized scan’) evaluated over a certain sparse set of intervals and a certain average of log likelihood ratios (‘condensed average likelihood ratio’). We show that penalizing the square root of the log likelihood ratio — rather than the log likelihood ratio itself — leads to a simple penalty term that yields optimal power. The thus derived penalty may prove useful for other problems that involve a Brownian bridge in the limit. The second key tool is an approximating set of intervals that is rich enough to allow for optimal detection, but which is also sparse enough to allow justifying the validity of the penalization scheme simply via the union bound. This results in a considerable simplification in the theoretical treatment compared with the usual approach for this type of penalization technique, which requires establishing an exponential inequality for the variation of the test statistic. Another advantage of using the sparse approximating set is that it allows fast computation in nearly linear time. We present a simulation study that illustrates the superior performance of the penalized scan and of the condensed average likelihood ratio compared with the standard scan statistic.

[1]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[2]  Hock Peng Chan,et al.  Detection of spatial clustering with average likelihood ratio test statistics , 2009, 0911.3769.

[3]  Daniel B. Neill,et al.  Expectation-based scan statistics for monitoring spatial time series data , 2009 .

[4]  G. Sawitzki,et al.  Excess Mass Estimates and Tests for Multimodality , 1991 .

[5]  C. Loader Large-deviation approximations to the distribution of scan statistics , 1991, Advances in Applied Probability.

[6]  J. Wellner,et al.  Empirical Processes with Applications to Statistics , 2009 .

[7]  Lutz Dümbgen,et al.  New goodness-of-fit tests and their application to nonparametric confidence sets , 1998 .

[8]  Guenther Walther,et al.  The Block Criterion for Multiscale Inference About a Density, With Applications to Other Multiscale Problems , 2010 .

[9]  G. Walther Optimal and fast detection of spatial clusters with scan statistics , 2010, 1002.4770.

[10]  Pemetaan Jumlah Balita,et al.  Spatial Scan Statistic , 2014, Encyclopedia of Social Network Analysis and Mining.

[11]  Andrew W. Moore,et al.  A Fast Multi-Resolution Method for Detection of Significant Spatial Disease Clusters , 2003, NIPS.

[12]  Daniel B Neill,et al.  An empirical comparison of spatial scan statistics for outbreak detection , 2009, International journal of health geographics.

[13]  D. W. Scott,et al.  The Mode Tree: A Tool for Visualization of Nonparametric Density Features , 1993 .

[14]  W. Polonik Measuring Mass Concentrations and Estimating Density Contour Clusters-An Excess Mass Approach , 1995 .

[15]  I. Good,et al.  Density Estimation and Bump-Hunting by the Penalized Likelihood Method Exemplified by Scattering and Meteorite Data , 1980 .

[16]  H. Chan,et al.  Detection with the scan and the average likelihood ratio , 2011, 1107.4344.

[17]  Xiaoming Huo,et al.  Near-optimal detection of geometric objects by fast multiscale methods , 2005, IEEE Transactions on Information Theory.

[18]  J. Hartigan,et al.  The Dip Test of Unimodality , 1985 .

[19]  S. R. Jammalamadaka,et al.  Scan Statistics and Applications , 2000 .

[20]  V. Spokoiny,et al.  Multiscale testing of qualitative hypotheses , 2001 .

[21]  W. Verstraeten,et al.  Relating increasing hantavirus incidences to the changing climate: the mast connection , 2009, International journal of health geographics.

[22]  L. Duembgen,et al.  Multiscale inference about a density , 2007, 0706.3968.

[23]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .