A comparative study of SIR Model, Linear Regression, Logistic Function and ARIMA Model for forecasting COVID-19 cases

Starting February 2020, COVID-19 was confirmed in 11,946 people worldwide, with a mortality rate of almost 2%. A significant number of epidemic diseases consisting of human Coronavirus display patterns. In this study, with the benefit of data analytic, we develop regression models and a Susceptible-Infected-Recovered (SIR) model for the contagion to compare the performance of models to predict the number of cases. First, we implement a good understanding of data and perform Exploratory Data Analysis (EDA). Then, we derive parameters of the model from the available data corresponding to the top 4 regions based on the history of infections and the most infected people as of the end of August 2020. Then models are compared, and we recommend further research.

[1]  Giuseppe C. Calafiore,et al.  A Modified SIR Model for the COVID-19 Contagion in Italy , 2020, 2020 59th IEEE Conference on Decision and Control (CDC).

[2]  M. Sillanpää,et al.  Novel coronavirus disease 2019 (COVID-19) pandemic: From transmission to control with an interdisciplinary vision , 2021, Environmental Research.

[3]  M. Coccia,et al.  International trade as critical parameter of COVID-19 spread that outclasses demographic, economic, environmental, and pollution factors , 2021, Environmental Research.

[4]  P. Colaneri,et al.  Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy , 2020, Nature Medicine.

[5]  Pedro Furtado,et al.  Epidemiology SIR with Regression, Arima, and Prophet in Forecasting Covid-19 , 2021, Engineering Proceedings.

[6]  Anirudh V. Mutalik Models to predict H1N1 outbreaks: a literature review , 2017 .

[7]  M. H. Stietiya,et al.  SARS-CoV-2 in the environment: Modes of transmission, early detection and potential role of pollutions , 2020, Science of The Total Environment.

[8]  B. Tabachnick,et al.  Using Multivariate Statistics , 1983 .

[9]  Michael Y. Li,et al.  Why is it difficult to accurately predict the COVID-19 epidemic? , 2020, Infectious Disease Modelling.

[10]  M. Coccia,et al.  Can commercial trade represent the main indicator of the COVID-19 diffusion due to human-to-human interactions? A comparative analysis between Italy, France, and Spain , 2021, Environmental Research.

[11]  O. Bjørnstad,et al.  Dynamics of measles epidemics: Estimating scaling of transmission rates using a time series sir model , 2002 .

[12]  W. O. Kermack,et al.  Contributions to the mathematical theory of epidemics—I , 1991, Bulletin of mathematical biology.

[13]  M. Coccia The relation between length of lockdown, numbers of infected people and deaths of Covid-19, and economic growth of countries: Lessons learned to cope with future pandemics similar to Covid-19 and to constrain the deterioration of economic system , 2021, Science of The Total Environment.

[14]  Rajan K. Chakrabarty,et al.  COVID-19 Progression Timeline and Effectiveness of Response-to-Spread Interventions across the United States , 2020, medRxiv.

[15]  D. Cummings,et al.  Novel coronavirus 2019-nCoV: early estimation of epidemiological parameters and epidemic predictions , 2020, medRxiv.

[16]  P. C. Bernardes,et al.  Relationship between COVID-19 and weather: Case study in a tropical country , 2020, International Journal of Hygiene and Environmental Health.

[17]  M. Coccia The effects of atmospheric stability with low wind speed and of air pollution on the accelerated transmission dynamics of COVID-19 , 2020 .

[18]  A. Lover,et al.  Sentinel Event Surveillance to Estimate Total SARS-CoV-2 Infections, United States , 2020, medRxiv.

[19]  E. Bontempi Commercial exchanges instead of air pollution as possible origin of COVID-19 initial diffusion phase in Italy: More efforts are necessary to address interdisciplinary research , 2020, Environmental Research.

[20]  F. Roodposhti,et al.  Portfolio Optimization Using Ant Colony Method a Case Study on Tehran Stock Exchange , 2018 .

[21]  E. Faerstein,et al.  A DICTIONARY OF EPIDEMIOLOGY , 2016 .

[22]  Mohammed M. Alquraish,et al.  Analyzing and forecasting COVID‐19 pandemic in the Kingdom of Saudi Arabia using ARIMA and SIR models , 2020, Comput. Intell..

[23]  Rajan Gupta,et al.  Trend Analysis and Forecasting of COVID-19 outbreak in India , 2020, medRxiv.

[24]  C. Anastassopoulou,et al.  Data-based analysis, modelling and forecasting of the COVID-19 outbreak , 2020, PloS one.

[25]  M. Coccia An index to quantify environmental risk of exposure to future epidemics of the COVID-19 and similar viral agents: Theory and practice , 2020, Environmental Research.

[26]  Franco Blanchini,et al.  Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy , 2020, Nature Medicine.

[27]  F. Squazzoni,et al.  Understanding COVID-19 diffusion requires an interdisciplinary, multi-dimensional approach , 2020, Environmental Research.

[28]  Mario Coccia,et al.  Factors determining the diffusion of COVID-19 and suggested strategy to prevent future accelerated viral infectivity similar to COVID , 2020, Science of The Total Environment.

[29]  A. S. Carstea,et al.  Extending the SIR epidemic model , 2004 .

[30]  C. Viboud,et al.  Mathematical models to characterize early epidemic growth: A review. , 2016, Physics of life reviews.

[31]  D. Cummings,et al.  Novel coronavirus 2019-nCoV (COVID-19): early estimation of epidemiological parameters and epidemic size estimates , 2021, Philosophical Transactions of the Royal Society B.

[32]  Yongli Cai,et al.  A conceptual model for the coronavirus disease 2019 (COVID-19) outbreak in Wuhan, China with individual reaction and governmental action , 2020, International Journal of Infectious Diseases.

[34]  G. Calafiore,et al.  A time-varying SIRD model for the COVID-19 contagion in Italy , 2020, Annual Reviews in Control.

[35]  M. Coccia How do low wind speeds and high levels of air pollution support the spread of COVID-19? , 2020, Atmospheric Pollution Research.

[36]  C. Tse,et al.  Prediction of COVID-19 spreading profiles in South Korea, Italy and Iran by data-driven coding , 2020, medRxiv.

[37]  M. Coccia High health expenditures and low exposure of population to air pollution as critical factors that can reduce fatality rate in COVID-19 pandemic crisis: a global analysis , 2021, Environmental Research.

[38]  M. Coccia Effects of the spread of COVID-19 on public health of polluted cities: results of the first wave for explaining the dejà vu in the second wave of COVID-19 pandemic and epidemics of future vital agents , 2021, Environmental Science and Pollution Research.

[39]  J. Hyman,et al.  Real-time forecasts of the 2019-nCoV epidemic in China from February 5th to February 24th, 2020 , 2020, 2002.05069.

[40]  O. Diekmann,et al.  Mathematical Epidemiology of Infectious Diseases: Model Building, Analysis and Interpretation , 2000 .

[41]  S. Abolmaali,et al.  Forecasting COVID-19 Number of Cases by Implementing ARIMA and SARIMA with Grid Search in the United States , 2021, medRxiv.

[42]  Minghui Li,et al.  Monitoring transmissibility and mortality of COVID-19 in Europe , 2020, International Journal of Infectious Diseases.

[43]  Muhammad Farhan Bashir,et al.  Correlation between environmental pollution indicators and COVID-19 pandemic: A brief study in Californian context , 2020, Environmental Research.

[44]  Shaghayegh Haghjooy Javanmard,et al.  Inefficiency of SIR models in forecasting COVID-19 epidemic: a case study of Isfahan , 2021, Scientific Reports.

[45]  J. Rocklöv,et al.  The reproductive number of COVID-19 is higher compared to SARS coronavirus , 2020, Journal of travel medicine.