Nowcasting by Bayesian Smoothing: A flexible, generalizable model for real-time epidemic tracking

Delays in case reporting are common to disease surveillance systems, making it difficult to track diseases in real-time. “Nowcast” approaches attempt to estimate the complete case counts for a given reporting date, using a time series of case reports that is known to be incomplete due to reporting delays. Modeling the reporting delay distribution is a common feature of nowcast approaches. However, many nowcast approaches ignore a crucial feature of infectious disease transmission—that future cases are intrinsically linked to past reported cases—and are optimized to a single application, which may limit generalizability. Here, we present a Bayesian approach, NobBS (Nowcasting by Bayesian Smoothing) capable of producing smooth and accurate nowcasts in multiple disease settings. We test NobBS on dengue in Puerto Rico and influenza-like illness (ILI) in the United States to examine performance and robustness across settings exhibiting a range of common reporting delay characteristics (from stable to time-varying), and compare this approach with a published nowcasting package. We show that introducing a temporal relationship between cases considerably improves performance when the reporting delay distribution is time-varying, and we identify trade-offs in the role of moving windows to accurately capture changes in the delay. We present software implementing this new approach (R package “NobBS”) for widespread application. Significance Achieving accurate, real-time estimates of disease activity is challenged by delays in case reporting. However, approaches that seek to estimate cases in spite of reporting delays often do not consider the temporal relationship between cases during an outbreak, nor do they identify characteristics of robust approaches that generalize to a wide range of surveillance contexts with very different reporting delays. Here, we present a smooth Bayesian nowcasting approach that produces accurate estimates that capture the time evolution of the epidemic curve and outperform a previous approach in the literature. We assess the performance for two diseases to identify important features of the reporting delay distribution that contribute to the model’s performance and robustness across surveillance settings.

[1]  J Kaldor,et al.  Changing pattern of delays in reporting AIDS diagnoses in Australia , 1998, Australian and New Zealand journal of public health.

[2]  John S. Brownstein,et al.  Evaluating the performance of infectious disease forecasts: A comparison of climate-driven and seasonal dengue forecasts for Mexico , 2016, Scientific Reports.

[3]  Michael Höhle,et al.  Bayesian outbreak detection in the presence of reporting delays , 2015, Biometrical journal. Biometrische Zeitschrift.

[4]  J. Wallinga,et al.  Different Epidemic Curves for Severe Acute Respiratory Syndrome Reveal Similar Impacts of Control Measures , 2004, American journal of epidemiology.

[5]  R. Rosenfeld,et al.  A collaborative multiyear, multimodel assessment of seasonal influenza forecasting in the United States , 2019, Proceedings of the National Academy of Sciences.

[6]  C. M. Comiskey,et al.  AIDS in Ireland: the reporting delay distribution and the implementation of integral equation models , 1992, Comput. Appl. Biosci..

[7]  Paul H. Garthwaite,et al.  Modelling reporting delays for outbreak detection in infectious disease data , 2015 .

[8]  S B Thacker,et al.  The Science of Public Health Surveillance , 1989, Journal of public health policy.

[9]  Mauricio Santillana,et al.  Accurate estimation of influenza epidemics using Google search data via ARGO , 2015, Proceedings of the National Academy of Sciences.

[10]  Jacco Wallinga,et al.  Nowcasting the Number of New Symptomatic Cases During Infectious Disease Outbreaks Using Constrained P-spline Smoothing , 2019, Epidemiology.

[11]  D. Panagiotakos,et al.  Prevalence, and associated risk factors, of self-reported diabetes mellitus in a sample of adult urban population in Greece: MEDICAL Exit Poll Research in Salamis (MEDICAL EXPRESS 2002) , 2004, BMC public health.

[12]  Kenneth S. Kaminsky,et al.  Prediction of IBNR claim counts by modelling the distribution of report lags , 1987 .

[13]  Marc Lipsitch,et al.  Improving the evidence base for decision making during a pandemic: the example of 2009 influenza A/H1N1. , 2011, Biosecurity and bioterrorism : biodefense strategy, practice, and science.

[14]  Statistical Methods for the Chain Ladder Technique , 1999 .

[15]  X M Tu,et al.  Regression analysis of censored and truncated data: estimating reporting-delay distributions and AIDS incidence from surveillance data. , 1994, Biometrics.

[16]  Naren Ramakrishnan,et al.  Collaborative efforts to forecast seasonal influenza in the United States, 2015–2016 , 2019, Scientific Reports.

[17]  J. F. Lawless,et al.  Adjustments for reporting delays and the prediction of occurred but not reported events , 1994 .

[18]  Erhard Kremer,et al.  IBNR-claims and the two-way model of ANOVA , 1982 .

[19]  R. Verrall A STATE SPACE REPRESENTATION OF THE CHAIN LADDER LINEAR MODEL , 1989 .

[20]  Michael Höhle,et al.  Bayesian nowcasting during the STEC O104:H4 outbreak in Germany, 2011 , 2014, Biometrics.