A Unified Framework for Epidemic Prediction based on Poisson Regression

Epidemic prediction is an important problem in epidemic control. Poisson regression methods are often adopted in existing works, mostly with only the (intra-)regional environmental factors considered. As the diffusion of epidemics is affected by not only the intra-regional factors but also inter-regional and external ones, a unified framework based on Poisson regression with the three types of factors incorporated is proposed for the prediction. Specifically, we propose a Poisson-regression-based model first with the intra-regional and inter-regional factors included. The intra-regional factor in a particular time interval is represented by one feature vector with the regionally environmental and social factors considered. The inter-regional factor is modeled by a diffusion matrix which describes the possibilities that the epidemics can spread from one region to another, which in turn accounts for the propagating effects of the infected cases. To learn the structure of the diffusion matrix, we propose two approaches-utilizing some a priori knowledge (e.g., transportation network) and estimating it from scratch via a sparse structure assumption. The resulting optimization problem of the maximum a posterior solution is a convex one and can be efficiently solved by the alternating direction method of multipliers (ADMM). In addition, we incorporate also the external factor, i.e., the imported cases. With one fact that the distribution of the number of infected cases over a year is (approximately) unimodal for most epidemics and one assumption that the importing rate has a small variance over the year, we can approximate the effect of the external factor with a parametric function (e.g., a quadratic function) over time. The resulting optimization problem is still convex and can be also solved by the ADMM algorithm. Empirical evaluations are conducted based on a real data set which records the 16-days-reported cases in the Yunnan province of China for seven years, from 2005 to 2011. The experimental results demonstrate the effectiveness of our proposed models.

[1]  J. Pilz,et al.  Bayesian modelling of the effect of climate on malaria in Burundi , 2010, Malaria Journal.

[2]  P. Kaye Infectious diseases of humans: Dynamics and control , 1993 .

[3]  J. Achcar,et al.  Use of Poisson spatiotemporal regression models for the Brazilian Amazon Forest: malaria count data. , 2011, Revista da Sociedade Brasileira de Medicina Tropical.

[4]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[5]  D. Hunter,et al.  A Tutorial on MM Algorithms , 2004 .

[6]  J. Aponte,et al.  Spatio-seasonal modeling of the incidence rate of malaria in Mozambique , 2008, Malaria Journal.

[7]  Sanyi Tang,et al.  Predicting the HIV/AIDS epidemic and measuring the effect of mobility in mainland China. , 2013, Journal of theoretical biology.

[8]  D. Hunter,et al.  Optimization Transfer Using Surrogate Objective Functions , 2000 .

[9]  Joel Schwartz,et al.  Weather-based prediction of Plasmodium falciparum malaria in epidemic-prone regions of Ethiopia II. Weather-based prediction systems perform comparably to early detection systems in identifying times for interventions , 2004, Malaria Journal.

[10]  Xiaoming Yuan,et al.  On the O(1/t) Convergence Rate of Alternating Direction Method with Logarithmic-Quadratic Proximal Regularization , 2012, SIAM J. Optim..

[11]  Antoine Flahault,et al.  Predicting Pneumonia and Influenza Mortality from Morbidity Data , 2007, PloS one.

[12]  S. Sharmin,et al.  A Stochastic Model for Early Identification of Infectious Disease Epidemics With Application to Measles Cases in Bangladesh , 2015, Asia-Pacific journal of public health.

[13]  J. Hilbe Negative Binomial Regression: Index , 2011 .

[14]  K. Lange,et al.  The MM Alternative to EM , 2010, 1104.2203.

[15]  F. Brauer,et al.  Mathematical Models in Population Biology and Epidemiology , 2001 .

[16]  Hui Xiong,et al.  Towards controlling virus propagation in information systems with point-to-group information sharing , 2009, Decis. Support Syst..

[17]  J. T. Wulu,et al.  Regression analysis of count data , 2002 .

[18]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[19]  M. G. Garner,et al.  Predicting Ross River Virus Epidemics from Regional Weather Data , 2002, Epidemiology.

[20]  P. K. Srijith,et al.  Semi-supervised Gaussian Process Ordinal Regression , 2013, ECML/PKDD.

[21]  Pravin K. Trivedi,et al.  Regression Analysis of Count Data , 1998 .

[22]  J. Hilbe Negative Binomial Regression: Preface , 2007 .

[23]  S. Ellner,et al.  Human mobility patterns predict divergent epidemic dynamics among cities , 2013, Proceedings of the Royal Society B: Biological Sciences.

[24]  J. Robins,et al.  Transmission Dynamics and Control of Severe Acute Respiratory Syndrome , 2003, Science.

[25]  Chuan-Sheng Foo,et al.  A majorization-minimization algorithm for (multiple) hyperparameter learning , 2009, ICML '09.

[26]  Paul J. Birrell,et al.  Bayesian modeling to unmask and predict influenza A/H1N1pdm dynamics in London , 2011, Proceedings of the National Academy of Sciences.