Developments in the roles, features, and evaluation of alerting algorithms for disease outbreak monitoring

utomated systems for public health surveillance have evolved over the past several years as national and local institutions have been learning the most effective ways to share and apply population health information for outbreak investigation and tracking. The changes have included developments in algorithmic alerting methodology. This article presents research efforts at The Johns Hopkins University Applied Physics Laboratory for advancing this methodology. The analytic methods presented cover outcome variable selection, background estimation, determination of anomalies for alerting, and practical evaluation of detection performance. The methods and measures are adapted from information theory, signal processing, financial forecasting, and radar engineering for effective use in the biosurveillance data environment. Examples are restricted to univariate algorithms for daily time series of syndromic data, with discussion of future generalization and enhancement.

[1]  Alyson G. Wilson,et al.  Statistical methods in counterterrorism : game theory, modeling, syndromic surveillance, and biometric authentication , 2006 .

[2]  A. Kraskov,et al.  Estimating mutual information. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Chris Chatfield,et al.  Holt‐Winters Forecasting: Some Practical Issues , 1988 .

[4]  Averill M. Law,et al.  Simulation Modeling and Analysis , 1982 .

[5]  Amir-Homayoon Najmi,et al.  An adaptive prediction and detection algorithm for multistream syndromic surveillance , 2005, BMC Medical Informatics Decis. Mak..

[6]  Mutual information and electron correlation in momentum space. , 2006, The Journal of chemical physics.

[7]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[8]  C. Müller,et al.  Large-scale clustering of cDNA-fingerprinting data. , 1999, Genome research.

[9]  H. Burkom Development, adaptation, and assessment of alerting algorithms for biosurveillance , 2003 .

[10]  A. Dobson,et al.  The application of statistical process control charts to the detection and monitoring of hospital-acquired infections. , 2001, Journal of quality in clinical practice.

[11]  David L. Buckeridge,et al.  Obtaining the Data , 2007 .

[12]  Carsten O. Daub,et al.  Estimating mutual information using B-spline functions – an improved similarity measure for analysing gene expression data , 2004, BMC Bioinformatics.

[13]  L M Pecora,et al.  Assessing spatial coupling in complex population dynamics using mutual prediction and continuity statistics. , 2005, Theoretical population biology.

[14]  David L. Buckeridge,et al.  Alerting Algorithms for Biosurveillance , 2007 .

[15]  Kathy J Hurt-Mullen,et al.  Syndromic surveillance on the epidemiologist's desktop: making sense of much data. , 2005, MMWR supplements.

[16]  Oded Maimon,et al.  Evaluation of gene-expression clustering via mutual information distance measure , 2007, BMC Bioinformatics.

[17]  Russ P Lopez Disease Surveillance: A Public Health Informatics Approach , 2007 .

[18]  Tom Burr,et al.  Modeling emergency department visit patterns for infectious disease complaints: results and application to disease surveillance , 2005, BMC Medical Informatics Decis. Mak..

[19]  L. Hutwagner,et al.  The bioterrorism preparedness and response Early Aberration Reporting System (EARS) , 2003, Journal of Urban Health.

[20]  S. Magruder,et al.  Progress in understanding and using over-the-counter pharmaceuticals for syndromic surveillance. , 2004, MMWR supplements.

[21]  Minitab Statistical Methods for Quality Improvement , 2001 .

[22]  Anne B. Koehler,et al.  Forecasting models and prediction intervals for the multiplicative Holt-Winters method , 2001 .

[23]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[24]  Bassem Mahafza,et al.  Radar Systems Analysis and Design Using MATLAB , 2000 .

[25]  D. W. Scott On optimal and data based histograms , 1979 .

[26]  Namhee Kim,et al.  Modeling and detection of respiratory-related outbreak signatures , 2007, BMC Medical Informatics Decis. Mak..

[27]  Galit Shmueli,et al.  Automated time series forecasting for biosurveillance , 2007, Statistics in medicine.

[28]  Herbert A. Sturges,et al.  The Choice of a Class Interval , 1926 .

[29]  G S Michaels,et al.  Cluster analysis and data visualization of large-scale gene expression data. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[30]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[31]  C. Chatfield,et al.  Prediction intervals for multiplicative Holt-Winters , 1991 .

[32]  T. Moon,et al.  Mathematical Methods and Algorithms for Signal Processing , 1999 .

[33]  Peter R. Winters,et al.  Forecasting Sales by Exponentially Weighted Moving Averages , 1960 .

[34]  Joseph S. Lombardo,et al.  Understanding the Data: Health Indicators in Disease Surveillance , 2006 .

[35]  Don McNicol,et al.  A Primer of Signal Detection Theory , 1976 .

[36]  Fraser,et al.  Independent coordinates for strange attractors from mutual information. , 1986, Physical review. A, General physics.

[37]  Ian Witten,et al.  Data Mining , 2000 .

[38]  Moon,et al.  Estimation of mutual information using kernel density estimators. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[39]  Carsten O. Daub,et al.  The mutual information: Detecting and evaluating dependencies between variables , 2002, ECCB.

[40]  Stephen E. Fienberg,et al.  Current and Potential Statistical Methods for Monitoring Multiple Data Streams for Biosurveillance , 2006 .

[41]  I. Kohane,et al.  An Epidemiological Network Model for Disease Outbreak Detection , 2007, PLoS medicine.

[42]  Colleen A Bradley,et al.  BioSense: implementation of a National Early Event Detection and Situational Awareness System. , 2005, MMWR supplements.