Factors affecting automated syndromic surveillance

OBJECTIVE The increased threat of bioterroristic attacks and epidemic events requires the development of accurate and timely outbreak detection systems for early identification of anomalies in public health data. MATERIAL AND METHODS We propose an automated outbreak detection system based on syndromic data. This system uses an autoregressive model with seasonal components to monitor, online, the daily counts of chief complaints for respiratory syndromes at the emergency department of two major metropolitan hospitals. We evaluate this system by estimating the false positive rate in real data under the assumption that there were no outbreaks of disease, and the true positive rate in real baseline data in which we injected stochastically simulated outbreaks of different shape and size. We then use directed graphical models to account for the effect of exogenous factors on the detection performance of the system. RESULTS Our study shows that for a week-long outbreak, our model has an overall 84.8% true detection accuracy across all shapes of outbreaks, while the outbreak size influences the earliness to detection. The false and true positive rates are also associated with the exogenous factors and knowledge about these factors can help to improve the detection accuracy. CONCLUSION This study suggests that the integration of multiple data sources can significantly improve the detection accuracy of syndromic surveillance systems.

[1]  Farzad Mostashari,et al.  Clinical evaluation of the Emergency Medical Services (EMS) ambulance dispatch-based syndromic surveillance system, New York City , 2003, Journal of Urban Health.

[2]  Paul R. Cohen,et al.  Multiple Comparisons in Induction Algorithms , 2000, Machine Learning.

[3]  Yelena Yesha,et al.  Data Mining: Next Generation Challenges and Future Directions , 2004 .

[4]  Paul R. Cohen,et al.  Very Predictive Ngrams for Space-Limited Probabilistic Models , 2003, IDA.

[5]  Michael I. Jordan,et al.  Probabilistic Networks and Expert Systems , 1999 .

[6]  Paola Sebastiani,et al.  Automated Detection of Influenza Epidemics with Hidden Markov Models , 2003, IDA.

[7]  Marcello Pagano,et al.  Using temporal context to improve biosurveillance , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[8]  R. G. Darling,et al.  Threats in bioterrorism. I: CDC category A agents. , 2002, Emergency medicine clinics of North America.

[9]  Joseph S. Lombardo,et al.  A systems overview of the Electronic Surveillance System for the Early Notification of Community-Based Epidemics (ESSENCE II) , 2003, Journal of Urban Health.

[10]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[11]  Paola Sebastiani,et al.  Bayesian methods , 2003 .

[12]  Paola Sebastiani,et al.  Chapter 3 Biosurveillance and Outbreak Detection , 2003 .

[13]  M. Hugh-jones,et al.  The Sverdlovsk anthrax outbreak of 1979. , 1994, Science.

[14]  Lisa J. Trigg,et al.  Roundtable on Bioterrorism Detection , 2002 .

[15]  George E. P. Box,et al.  Time Series Analysis: Forecasting and Control , 1977 .

[16]  David J. Hand,et al.  Intelligent Data Analysis: An Introduction , 2005 .

[17]  Kenneth D. Mandl,et al.  Time series modeling for syndromic surveillance , 2003, BMC Medical Informatics Decis. Mak..

[18]  Galit Shmueli,et al.  Early statistical detection of anthrax outbreaks by tracking over-the-counter medication sales , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Richard Platt,et al.  Use of Automated Ambulatory-Care Encounter Records for Detection of Acute Illness Clusters, Including Potential Bioterrorism Events , 2002, Emerging infectious diseases.

[20]  P E SARTWELL,et al.  The distribution of incubation periods of infectious disease. , 1950, American journal of hygiene.

[21]  Y. Elbert,et al.  Disease outbreak detection system using syndromic data in the greater Washington DC area. , 2002, American journal of preventive medicine.

[22]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[23]  Robert H. Shumway,et al.  Time Series Analysis and Its Applications (Springer Texts in Statistics) , 2005 .

[24]  Isaac S. Kohane,et al.  Brief Review: The Contributions of Biomedical Informatics to the Fight Against Bioterrorism , 2002, J. Am. Medical Informatics Assoc..

[25]  William B. Lober,et al.  Review Paper: Implementing Syndromic Surveillance: A Practical Guide Informed by the Early Experience , 2003, J. Am. Medical Informatics Assoc..

[26]  David S. Stoffer,et al.  Time series analysis and its applications , 2000 .

[27]  Michael M. Wagner,et al.  Technical Description of RODS: A Real-time Public Health Surveillance System , 2003, Journal of the American Medical Informatics Association.

[28]  R. Serfling Methods for current statistical analysis of excess pneumonia-influenza deaths. , 1963, Public health reports.