Machine Learning-Powered Mitigation Policy Optimization in Epidemiological Models

A crucial aspect of managing a public health crisis is to effectively balance prevention and mitigation strategies, while taking their socio-economic impact into account. In particular, determining the influence of different non-pharmaceutical interventions (NPIs) on the effective use of public resources is an important problem, given the uncertainties on when a vaccine will be made available. In this paper, we propose a new approach for obtaining optimal policy recommendations based on epidemiological models, which can characterize the disease progression under different interventions, and a look-ahead reward optimization strategy to choose the suitable NPI at different stages of an epidemic. Given the time delay inherent in any epidemiological model and the exponential nature especially of an unmanaged epidemic, we find that such a look-ahead strategy infers non-trivial policies that adhere well to the constraints specified. Using two different epidemiological models, namely SEIR and EpiCast, we evaluate the proposed algorithm to determine the optimal NPI policy, under a constraint on the number of daily new cases and the primary reward being the absence of restrictions.

[1]  Jacques Klein,et al.  Data-driven Simulation and Optimization for Covid-19 Exit Strategies , 2020, KDD.

[2]  Changliu Liu,et al.  A Microscopic Epidemic Model and Pandemic Prediction Using Multi-Agent Reinforcement Learning , 2020, ArXiv.

[3]  Fausto Gozzi,et al.  A Simple Planning Problem for COVID-19 Lockdown , 2020, SSRN Electronic Journal.

[4]  C. Macken,et al.  Mitigation strategies for pandemic influenza in the United States. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Michael Y. Li,et al.  Why is it difficult to accurately predict the COVID-19 epidemic? , 2020, Infectious Disease Modelling.

[6]  Harshad Khadilkar,et al.  Optimising Lockdown Policies for Epidemic Control using Reinforcement Learning , 2020, Transactions of the Indian National Academy of Engineering.

[7]  Reza Yaesoubi,et al.  Identifying cost‐effective dynamic policies to control epidemics , 2016, Statistics in medicine.

[8]  W. Liang,et al.  Modified SEIR and AI prediction of the epidemics trend of COVID-19 in China under public health interventions , 2020, Journal of thoracic disease.

[9]  Guido Lorenzoni,et al.  Macroeconomic Implications of Covid-19: Can Negative Supply Shocks Cause Demand Shortages? , 2020, SSRN Electronic Journal.

[10]  J. Robins,et al.  Transmissibility of 1918 pandemic influenza , 2004, Nature.

[11]  Rushil Anirudh,et al.  Improved surrogates in inertial confinement fusion with manifold and cycle consistencies , 2019, Proceedings of the National Academy of Sciences.

[12]  Marcelo Menezes Morato,et al.  An optimal predictive control strategy for COVID-19 (SARS-CoV-2) social distancing policies in Brazil , 2020, Annual Reviews in Control.

[13]  Philippe Lemey,et al.  Deep reinforcement learning for large-scale epidemic control , 2020, ECML/PKDD.

[14]  N. Lurie,et al.  Developing Covid-19 Vaccines at Pandemic Speed. , 2020, The New England journal of medicine.

[15]  Nicholas Soures,et al.  SIRNet: Understanding Social Distancing Measures with Hybrid Neural Network Model for COVID-19 Infectious Spread , 2020, 2004.10376.

[16]  C. Macken,et al.  Modeling targeted layered containment of an influenza pandemic in the United States , 2008, Proceedings of the National Academy of Sciences.

[17]  C. Whittaker,et al.  Report 9: Impact of non-pharmaceutical interventions (NPIs) to reduce COVID19 mortality and healthcare demand , 2020 .

[18]  Shaobo He,et al.  SEIR modeling of the COVID-19 and its dynamics , 2020, Nonlinear dynamics.

[19]  X. Rodó,et al.  A modified SEIR model to predict the COVID-19 outbreak in Spain and Italy: Simulating control scenarios and multi-scale epidemics , 2020, Results in Physics.

[20]  Andrew Plummer,et al.  School dismissal as a pandemic influenza response: When, where and for how long? , 2019, Epidemics.

[21]  Slawomir Koziel,et al.  Surrogate-Based Modeling and Optimization , 2013 .