Benchmark data and power calculations for evaluating disease outbreak detection methods.

INTRODUCTION Early detection of disease outbreaks enables public health officials to implement immediate disease control and prevention measures. Computer-based syndromic surveillance systems are being implemented to complement reporting by physicians and other health-care professionals to improve the timeliness of disease-outbreak detection. Space-time disease-surveillance methods have been proposed as a supplement to purely temporal statistical methods for outbreak detection to detect localized outbreaks before they spread to larger regions. OBJECTIVE The aims of this study were twofold: 1) to design and make available benchmark data sets for evaluating the statistical power of space-time early detection methods and 2) to evaluate the power of the prospective purely temporal and space-time scan statistics by applying them to the benchmark data sets at different parameter settings. METHODS Simulated data sets based on the geography and population of New York City were created, including effects of outbreaks of varying size and location. Data sets with no outbreak effects were also created. Scan statistics were then run on these data sets, and the resulting power performances were analyzed and compared. RESULTS The prospective space-time scan statistic performs well for a spectrum of outbreak models. By comparison, the prospective purely temporal scan statistic has higher power for detecting citywide outbreaks but lower power for detecting geographically localized outbreaks. CONCLUSIONS The benchmark data sets created for this study can be used successfully for formal statistical power evaluations and comparisons. If an anomaly caused by an outbreak is local, purely temporal surveillance methods might be unable to detect it, in which case space-time methods would be necessary for early detection.

[1]  Peter A. Rogerson,et al.  Monitoring point patterns for the development of space–time clusters , 2001 .

[2]  M. Kulldorff,et al.  Dead Bird Clusters as an Early Warning System for West Nile Virus Activity , 2003, Emerging infectious diseases.

[3]  F. Mostashari,et al.  Monitoring over-the-counter medication sales for early detection of disease outbreaks--New York City. , 2005, MMWR supplements.

[4]  P A Rogerson,et al.  Surveillance systems for monitoring the development of spatial patterns. , 1997, Statistics in medicine.

[5]  Andrew W. Moore,et al.  Data, network, and application: technical description of the Utah RODS Winter Olympic Biosurveillance System , 2002, AMIA.

[6]  S Hashimoto,et al.  Detection of epidemics in their early stage through infectious disease surveillance. , 2000, International journal of epidemiology.

[7]  Y. Elbert,et al.  Disease outbreak detection system using syndromic data in the greater Washington DC area. , 2002, American journal of preventive medicine.

[8]  A Nelson,et al.  National Bioterrorism Syndromic Surveillance Demonstration Program. , 2004, MMWR supplements.

[9]  Constandinos N. Theophilides,et al.  Identifying West Nile virus risk areas: the Dynamic Continuous-Area Space-Time system. , 2003, American journal of epidemiology.

[10]  E. Järpe Surveillance of the interaction parameter of the ising model , 1999 .

[11]  Martin Kulldorff,et al.  Prospective time periodic geographical disease surveillance using a scan statistic , 2001 .

[12]  Galit Shmueli,et al.  Early statistical detection of anthrax outbreaks by tracking over-the-counter medication sales , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Farzad Mostashari,et al.  Clinical evaluation of the Emergency Medical Services (EMS) ambulance dispatch-based syndromic surveillance system, New York City , 2003, Journal of Urban Health.

[14]  M. Dwass Modified Randomization Tests for Nonparametric Hypotheses , 1957 .

[15]  David Bock,et al.  A review and discussion of prospective statistical surveillance in public health , 2003 .

[16]  M. Kulldorff A spatial scan statistic , 1997 .

[17]  R. Platt,et al.  Using automated medical records for rapid identification of illness syndromes (syndromic surveillance): the example of lower respiratory infection , 2001, BMC public health.

[18]  M. Kulldorff,et al.  Syndromic surveillance in public health practice, New York City. , 2004, Emerging infectious diseases.

[19]  Peter J. Park,et al.  Power comparisons for disease clustering tests , 2003, Comput. Stat. Data Anal..