Rational evaluation of various epidemic models based on the COVID-19 data of China

During the study of epidemics, one of the most significant and also challenging problems is to forecast the future trends, on which all follow-up actions of individuals and governments heavily rely. However, to pick out a reliable predictable model/method is far from simple, a rational evaluation of various possible choices is eagerly needed, especially under the severe threat of COVID-19 epidemics which is spreading worldwide right now. In this paper, based on the public COVID-19 data of seven provinces/cities in China reported during the spring of 2020, we make a systematical investigation on the forecast ability of eight widely used empirical functions, four statistical inference methods and five dynamical models widely used in the literature. We highlight the significance of a well balance between model complexity and accuracy, over-fitting and under-fitting, as well as model robustness and sensitivity. We further introduce the Akaike information criterion, root mean square errors and robustness index to quantify these three golden means and to evaluate various epidemic models/methods. Through extensive simulations, we find that the inflection point plays a crucial role in the choice of the size of dataset in forecasting. Before the inflection point, no model considered here could make a reliable prediction. We further notice the Logistic function steadily underestimate the final epidemic size, while the Gomertz's function makes an overestimation in all cases. Since the methods of sequential Bayesian and time-dependent reproduction number take the non-constant nature of the effective reproduction number with the progression of epidemics into consideration, we suggest to employ them especially in the late stage of an epidemic. The transition-like behavior of exponential growth method from underestimation to overestimation with respect to the inflection point might be useful for constructing a more reliable forecast. Towards the dynamical models based on ODEs, it is observed that the SEIR-QD and SEIR-PO models generally show a better performance than SIR, SEIR and SEIR-AHQ models on the COVID-19 epidemics, whose success could be attributed to the inclusion of self-protection and quarantine, and a proper trade-off between model complexity and fitting accuracy.

[1]  Sharon K. Greene,et al.  Applying infectious disease forecasting to public health: a path forward using influenza forecasting examples , 2019, BMC Public Health.

[2]  N. Sugiura Further analysts of the data by akaike' s information criterion and the finite corrections , 1978 .

[3]  L Forsberg White,et al.  A likelihood‐based method for real‐time estimation of the serial interval and reproductive number of an epidemic , 2008, Statistics in medicine.

[4]  G. Kitagawa,et al.  Information Criteria and Statistical Modeling , 2007 .

[5]  Wu-Chun Cao,et al.  Mathematical modelling of SARS and other infectious diseases in China: a review , 2009, Tropical medicine & international health : TM & IH.

[6]  Anirudh V. Mutalik Models to predict H1N1 outbreaks: a literature review , 2017 .

[7]  J. Wallinga,et al.  Different Epidemic Curves for Severe Acute Respiratory Syndrome Reveal Similar Impacts of Control Measures , 2004, American journal of epidemiology.

[8]  Sebastian Funk,et al.  Assessing the performance of real-time epidemic forecasts: A case study of Ebola in the Western Area region of Sierra Leone, 2014-15 , 2017, bioRxiv.

[9]  Zhien Ma,et al.  Dynamical Modeling and Analysis of Epidemics , 2009 .

[10]  Daihai He,et al.  Simple framework for real-time forecast in a data-limited situation: the Zika virus (ZIKV) outbreaks in Brazil from 2015 to 2016 as an example , 2019, Parasites & Vectors.

[11]  L. Bettencourt,et al.  Real Time Bayesian Estimation of the Epidemic Potential of Emerging Infectious Diseases , 2008, PloS one.

[12]  Pejman Rohani,et al.  Avoidable errors in the modelling of outbreaks of emerging pathogens, with special reference to Ebola , 2014, Proceedings of the Royal Society B: Biological Sciences.

[13]  Gavin J. Gibson,et al.  Comparison and Assessment of Epidemic Models , 2018 .

[14]  Ying Wang,et al.  Estimation of the epidemic properties of the 2019 novel coronavirus: A mathematical modeling study , 2020, medRxiv.

[15]  Michael Y. Li,et al.  Why is it difficult to accurately predict the COVID-19 epidemic? , 2020, Infectious Disease Modelling.

[16]  Madhav V. Marathe,et al.  A framework for evaluating epidemic forecasts , 2017, BMC Infectious Diseases.

[17]  H. Akaike A new look at the statistical model identification , 1974 .

[18]  Gerardo Chowell,et al.  The RAPIDD ebola forecasting challenge: Synthesis and lessons learnt. , 2017, Epidemics.

[19]  Ian M. Hall,et al.  Modelling the global spread of diseases: A review of current practice and capability , 2018, Epidemics.

[20]  Liangrong Peng,et al.  Epidemic analysis of COVID-19 in China by dynamical modeling , 2020, medRxiv.

[21]  Ursula Klingmüller,et al.  Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood , 2009, Bioinform..

[22]  Jukka Corander,et al.  On the Identifiability of Transmission Dynamic Models for Infectious Diseases , 2015, Genetics.

[23]  Xinxin Zhang,et al.  Phase-adjusted estimation of the number of Coronavirus Disease 2019 cases in Wuhan, China , 2020, Cell Discovery.

[24]  Jiang Yu,et al.  Modeling and prediction for the trend of outbreak of NCP based on a time-delay dynamic system , 2020 .

[25]  Andreas Handel,et al.  A review of mathematical models of influenza A infections within a host or cell culture: lessons learned and challenges ahead , 2011, BMC public health.

[26]  Dao Nguyen,et al.  Statistical Inference for Partially Observed Markov Processes via the R Package pomp , 2015, 1509.00503.

[27]  Jianhong Wu,et al.  Estimation of the Transmission Risk of the 2019-nCoV and Its Implication for Public Health Interventions , 2020, Journal of clinical medicine.

[28]  M. Brisson,et al.  Mathematical Modeling of the Transmission Dynamics of Clostridium difficile Infection and Colonization in Healthcare Settings: A Systematic Review , 2016, PloS one.

[29]  Sanjay Basu,et al.  Complexity in Mathematical Models of Public Health Policies: A Guide for Consumers of Models , 2013, PLoS medicine.

[30]  Michael Höhle,et al.  Model selection and parameter estimation for dynamic epidemic models via iterated filtering: application to rotavirus in Germany , 2018, Biostatistics.

[31]  Michael Y. Li Important Concepts in Mathematical Modeling of Infectious Diseases , 2018 .

[32]  Pierre-Yves Boëlle,et al.  The R0 package: a toolbox to estimate reproduction numbers for epidemic outbreaks , 2012, BMC Medical Informatics and Decision Making.

[33]  A. McQuarrie,et al.  Regression and Time Series Model Selection , 1998 .

[34]  C. Viboud,et al.  Mathematical models to characterize early epidemic growth: A review. , 2016, Physics of life reviews.

[35]  Gerardo Chowell,et al.  Assessing parameter identifiability in compartmental dynamic models using a computational approach: application to infectious disease transmission models , 2019, Theoretical Biology and Medical Modelling.

[36]  W. Michael Conklin,et al.  Monte Carlo Methods in Bayesian Computation , 2001, Technometrics.