论文信息 - Optimising Lockdown Policies for Epidemic Control using Reinforcement Learning

Optimising Lockdown Policies for Epidemic Control using Reinforcement Learning

There has been intense debate about lockdown policies in the context of Covid-19 for limiting damage both to health and to the economy. We present an AI-driven approach for generating optimal lockdown policies that control the spread of the disease while balancing both health and economic costs. Furthermore, the proposed reinforcement learning approach automatically learns those policies, as a function of disease and population parameters. The approach accounts for imperfect lockdowns, can be used to explore a range of policies using tunable parameters, and can be easily extended to fine-grained lockdown strictness. The control approach can be used with any compatible disease and network simulation models.

Harshad Khadilkar | Tanuja Ganu | Deva P Seetharam

[1] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[2] N. Linton,et al. Real-Time Estimation of the Risk of Death from Novel Coronavirus (COVID-19) Infection: Inference Using Exported Cases , 2020, Journal of clinical medicine.

[3] Richard S. Sutton,et al. Reinforcement Learning , 1992, Handbook of Machine Learning.

[4] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[5] C. Mooney,et al. Monte Carlo Simulation , 1997 .

[6] Liliana Perez,et al. An agent-based approach for modeling dynamics of contagious disease spread , 2009, International journal of health geographics.

[7] Cecilia Mascolo,et al. Evolution of a location-based online social network: analysis and models , 2012, IMC '12.