Bayesian Biosurveillance of Disease Outbreaks

Early, reliable detection of disease outbreaks is a critical problem today. This paper reports an investigation of the use of causal Bayesian networks to model spatio-temporal patterns of a non-contagious disease (respiratory anthrax infection) in a population of people. The number of parameters in such a network can become enormous, if not carefully managed. Also, inference needs to be performed in real time as population data stream in. We describe techniques we have applied to address both the modeling and inference challenges. A key contribution of this paper is the explication of assumptions and techniques that are sufficient to allow the scaling of Bayesian network modeling and inference to millions of nodes for real-time surveillance applications. The results reported here provide a proof-of-concept that Bayesian networks can serve as the foundation of a system that effectively performs Bayesian biosurveillance of disease outbreaks.

[1]  M. Kulldor,et al.  Prospective time-periodic geographical disease surveillance using a scan statistic , 2001 .

[2]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[3]  James D. Hamilton Time Series Analysis , 1994 .

[4]  M. Meltzer,et al.  The economic impact of a bioterrorist attack: are prevention and postattack intervention programs justifiable? , 1997, Emerging infectious diseases.

[5]  E. Carlstein Nonparametric Change-Point Estimation , 1988 .

[6]  S. Greenland Causal Analysis in the Health Sciences , 2000 .

[7]  Andrew W. Moore,et al.  Bayesian Network Anomaly Pattern Detection for Disease Outbreaks , 2003, ICML.

[8]  Finn Verner Jensen,et al.  Inference in Multiply Sectioned Bayesian Networks with Extended Shafer-Shenoy and Lazy Propagation , 1999, UAI.

[9]  Andrew W. Moore,et al.  Algorithms for rapid outbreak detection: a research synthesis , 2005, J. Biomed. Informatics.

[10]  Martin Kulldorff,et al.  Prospective time periodic geographical disease surveillance using a scan statistic , 2001 .

[11]  Tom Fawcett,et al.  Activity monitoring: noticing interesting changes in behavior , 1999, KDD '99.

[12]  Avi Pfeffer,et al.  Object-Oriented Bayesian Networks , 1997, UAI.

[13]  R. Serfling Methods for current statistical analysis of excess pneumonia-influenza deaths. , 1963, Public health reports.

[14]  G. D. Williamson,et al.  A monitoring system for detecting aberrations in public health surveillance reports. , 1999, Statistics in medicine.

[15]  L. Hutwagner,et al.  The bioterrorism preparedness and response Early Aberration Reporting System (EARS) , 2003, Journal of Urban Health.

[16]  Sampath Srinivas,et al.  A Probabilistic Approach to Hierarchical Model-based Diagnosis , 1994, UAI.

[17]  Kenneth D. Mandl,et al.  Time series modeling for syndromic surveillance , 2003, BMC Medical Informatics Decis. Mak..

[18]  Michael M. Wagner,et al.  Value of ICD-9-Coded Chief Complaints for Detection of Epidemics , 2002, J. Am. Medical Informatics Assoc..

[19]  Dean F. Sittig,et al.  The emerging science of very early detection of disease outbreaks. , 2001, Journal of public health management and practice : JPHMP.

[20]  Andrew W. Moore,et al.  Summary of Biosurveillance-relevant statistical and data mining technologies , 2002 .

[21]  Andrew W. Moore,et al.  Data mining for early disease outbreak detection , 2004 .

[22]  M. Kulldorff A spatial scan statistic , 1997 .

[23]  Andrew W. Moore,et al.  A Fast Multi-Resolution Method for Detection of Significant Spatial Disease Clusters , 2003, NIPS.

[24]  David L. Craft,et al.  Emergency response to an anthrax attack , 2003, Proceedings of the National Academy of Sciences of the United States of America.