Causality in Scale Space as an Approach to Change Detection

Kernel density estimation and kernel regression are useful ways to visualize and assess the structure of data. Using these techniques we define a temporal scale space as the vector space spanned by bandwidth and a temporal variable. In this space significance regions that reflect a significant derivative in the kernel smooth similar to those of SiZer (Significant Zero-crossings of derivatives) are indicated. Significance regions are established by hypothesis tests for significant gradient at every point in scale space. Causality is imposed onto the space by restricting to kernels with left-bounded or finite support and shifting kernels forward. We show that these adjustments to the methodology enable early detection of changes in time series constituting live surveillance systems of either count data or unevenly sampled measurements. Warning delays are comparable to standard techniques though comparison shows that other techniques may be better suited for single-scale problems. Our method reliably detects change points even with little to no knowledge about the relevant scale of the problem. Hence the technique will be applicable for a large variety of sources without tailoring. Furthermore this technique enables us to obtain a retrospective reliable interval estimate of the time of a change point rather than a point estimate. We apply the technique to disease outbreak detection based on laboratory confirmed cases for pertussis and influenza as well as blood glucose concentration obtained from patients with diabetes type 1.

[1]  Jie Chen,et al.  Change-point analysis as a tool to detect abrupt climate variations , 2012, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[2]  Peter J. Diggle,et al.  Point process methodology for on‐line spatio‐temporal disease surveillance , 2005 .

[3]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[4]  M. Cule,et al.  Maximum likelihood estimation of a multi‐dimensional log‐concave density , 2008, 0804.3989.

[5]  J BELLIKA,et al.  ropagation of program control : A tool for istributed disease surveillance ohan , 2007 .

[6]  M. C. Jones,et al.  A reliable data-based bandwidth selection method for kernel density estimation , 1991 .

[7]  Chandra Erdman,et al.  bcp: An R Package for Performing a Bayesian Analysis of Change Point Problems , 2007 .

[8]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .

[9]  Mark C. W. van Rossum,et al.  A Novel Spike Distance , 2001, Neural Computation.

[10]  Leonhard Held,et al.  Spatio‐temporal disease mapping using INLA , 2011 .

[11]  J. Marron,et al.  SCALE SPACE VIEW OF CURVE ESTIMATION , 2000 .

[12]  J. Marron,et al.  SiZer for Exploration of Structures in Curves , 1999 .

[13]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[14]  S. Panchapakesan,et al.  Inference about the Change-Point in a Sequence of Random Variables: A Selection Approach , 1988 .

[15]  James Stephen Marron,et al.  Advanced Distribution Theory for SiZer , 2006 .

[16]  Gunnar Hartvigsen,et al.  Mobile phone-based pattern recognition and data analysis for patients with type 1 diabetes. , 2012, Diabetes technology & therapeutics.

[17]  T. Geisel,et al.  Natural human mobility patterns and spatial spread of infectious diseases , 2011, 1103.6224.

[18]  Daniel B. Neill,et al.  Fast subset scan for spatial pattern detection , 2012 .

[19]  Andrew P. Witkin,et al.  Uniqueness of the Gaussian Kernel for Scale-Space Filtering , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  C. Granger Investigating causal relations by econometric models and cross-spectral methods , 1969 .

[21]  I. S. Gradshteyn,et al.  Table of Integrals, Series, and Products , 1976 .

[22]  Y. Mei Sequential change-point detection when unknown parameters are present in the pre-change distribution , 2006, math/0605322.

[23]  T. Lai Sequential changepoint detection in quality control and dynamical systems , 1995 .

[24]  J. Hartigan,et al.  A Bayesian Analysis for Change Point Problems , 1993 .

[25]  Matthew P. Wand,et al.  Kernel Smoothing , 1995 .

[26]  P. Fearnhead,et al.  On‐line inference for multiple changepoint problems , 2007 .

[27]  David V. Hinkley,et al.  Inference about the change-point in a sequence of binomial variables , 1970 .

[28]  W. A. Shewhart,et al.  The Application of Statistics as an Aid in Maintaining Quality of a Manufactured Product , 1925 .

[29]  B. Roehr Whooping cough outbreak hits several US states , 2010, BMJ : British Medical Journal.

[30]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[31]  Stefan Edlund,et al.  Comparing three basic models for seasonal influenza. , 2011, Epidemics.

[32]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[33]  Mark Yuying An,et al.  Logconcavity versus Logconvexity: A Complete Characterization , 1998 .

[34]  Hongjoong Kim,et al.  A novel approach to detection of intrusions in computer networks via adaptive sequential and batch-sequential change-point detection methods , 2006, IEEE Transactions on Signal Processing.