Estimating Spread of Contact-Based Contagions in a Population Through Sub-Sampling

Physical contacts result in the spread of various phenomena such as viruses, gossips, ideas, packages and marketing pamphlets across a population. The spread depends on how people move and co-locate with each other, or their mobility patterns. How far such phenomena spread has significance for both policy making and personal decision making, e.g., studying the spread of COVID-19 under different intervention strategies such as wearing a mask. In practice, mobility patterns of an entire population is never available, and we usually have access to location data of a subset of individuals. In this paper, we formalize and study the problem of estimating the spread of a phenomena in a population, given that we only have access to sub-samples of location visits of some individuals in the population. We show that simple solutions such as estimating the spread in the sub-sample and scaling it to the population, or more sophisticated solutions that rely on modeling location visits of individuals do not perform well in practice, the former because it ignores contacts between unobserved individuals and sampled ones and the latter because it yields inaccurate modeling of co-locations. Instead, we directly model the co-locations between the individuals. We introduce PollSpreader and PollSusceptible, two novel approaches that model the co-locations between individuals using a contact network, and infer the properties of the contact network using the subsample to estimate the spread of the phenomena in the entire population. We show that our estimates provide an upper bound and a lower bound on the spread of the disease in expectation. Finally, using a large high-resolution real-world mobility dataset, we experimentally show that our estimates are accurate, while other methods that do not correctly account for co-locations between individuals result in wrong observations (e.g, premature herd-immunity).

[1]  Yiu Chung Lau,et al.  Temporal dynamics in viral shedding and transmissibility of COVID-19 , 2020, Nature Medicine.

[2]  Mikhail Prokopenko,et al.  Modelling transmission and control of the COVID-19 pandemic in Australia , 2020, Nature communications.

[3]  David S. Rosenblum,et al.  A Non-Parametric Generative Model for Human Trajectories , 2018, IJCAI.

[4]  HighWire Press Proceedings of the Royal Society of London. Series A, Containing papers of a mathematical and physical character , 1934 .

[5]  Lantao Yu,et al.  SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient , 2016, AAAI.

[6]  C. Macken,et al.  Modeling targeted layered containment of an influenza pandemic in the United States , 2008, Proceedings of the National Academy of Sciences.

[7]  B. Cowling,et al.  Multi-route respiratory infection: When a transmission route may dominate , 2020, Science of The Total Environment.

[8]  Bhaskar Krishnamachari,et al.  COVID-19 Risk Estimation using a Time-varying SIR-model , 2020, COVID@SIGSPATIAL.

[9]  X. Liu,et al.  Hawkes process modeling of COVID-19 with mobility leading indicators and spatial covariates , 2020, International Journal of Forecasting.

[10]  K. Azuma,et al.  Assessing the risk of COVID-19 from multiple pathways of exposure to SARS-CoV-2: Modeling in health-care settings and effectiveness of nonpharmaceutical interventions , 2020, Environment International.

[11]  Sharon J Peacock,et al.  Pathophysiology, Transmission, Diagnosis, and Treatment of Coronavirus Disease 2019 (COVID-19): A Review. , 2020, JAMA.

[12]  Rosalind M Eggo,et al.  Projecting contact matrices in 177 geographical regions: An update and comparison with empirical data for the COVID-19 era , 2020, medRxiv.

[13]  C. Whittaker,et al.  Report 9: Impact of non-pharmaceutical interventions (NPIs) to reduce COVID19 mortality and healthcare demand , 2020 .

[14]  R. Mikolajczyk,et al.  Social Contacts and Mixing Patterns Relevant to the Spread of Infectious Diseases , 2008, PLoS medicine.

[15]  E. Kostelich,et al.  To mask or not to mask: Modeling the potential for face mask use by the general public to curtail the COVID-19 pandemic , 2020, Infectious Disease Modelling.

[16]  Yan Liu,et al.  Toward Accurate Spatiotemporal COVID-19 Risk Scores Using High-Resolution Real-World Mobility Data , 2022, ACM Trans. Spatial Algorithms Syst..

[17]  Jing Zhao,et al.  Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus–Infected Pneumonia , 2020, The New England journal of medicine.

[18]  Mark Jit,et al.  Projecting social contact matrices in 152 countries using contact surveys and demographic data , 2017, PLoS Comput. Biol..

[19]  Francesco Castelli,et al.  Comparing SARS-CoV-2 with SARS-CoV and influenza pandemics , 2020, The Lancet Infectious Diseases.

[20]  Jean-François Paiement,et al.  A Generative Model of Urban Activities from Cellular Data , 2018, IEEE Transactions on Intelligent Transportation Systems.

[21]  L. Poon,et al.  Clinical progression and viral load in a community outbreak of coronavirus-associated SARS pneumonia : a prospective study , 2003 .

[22]  W. O. Kermack,et al.  A contribution to the mathematical theory of epidemics , 1927 .

[23]  W. Lim,et al.  Viral shedding patterns of coronavirus in patients with probable severe acute respiratory syndrome , 2004, The Lancet.

[24]  Jiawei Han,et al.  A Data-Driven Graph Generative Model for Temporal Interaction Networks , 2020, KDD.

[25]  Farnoush Banaei Kashani,et al.  Efficient Reachability Query Evaluation in Large Spatiotemporal Contact Datasets , 2012, Proc. VLDB Endow..

[26]  Siddharth Gupta,et al.  The TimeGeo modeling framework for urban mobility without travel surveys , 2016, Proceedings of the National Academy of Sciences.

[27]  N. Ferguson,et al.  Time lines of infection and disease in human influenza: a review of volunteer challenge studies. , 2008, American journal of epidemiology.

[28]  L. Fang,et al.  Duration of symptom onset to hospital admission and admission to discharge or death in SARS in mainland China: a descriptive study , 2009, Tropical medicine & international health : TM & IH.

[29]  Jie Feng,et al.  Learning to Simulate Human Mobility , 2020, KDD.

[30]  Stewart T. Chang,et al.  Covasim: An agent-based model of COVID-19 dynamics and interventions , 2020, medRxiv.

[31]  M. Cevik,et al.  SARS-CoV-2, SARS-CoV, and MERS-CoV viral load dynamics, duration of viral shedding, and infectiousness: a systematic review and meta-analysis , 2020, The Lancet Microbe.

[32]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[33]  Quentin J. Leclerc,et al.  Quantifying the impact of physical distance measures on the transmission of COVID-19 in the UK , 2020, BMC Medicine.