Challenges and Limitations of Geospatial Data and Analyses in the Context of COVID-19

The global COVID-19 pandemic has infected public consciousness with a new appreciation of geospatial analyses in public health. Not since John Snow’s 1854 cholera map of London has a public health crisis brought so much attention to geospatial data, and their importance to understanding, predicting, and preventing the spread of disease. Although high profile models and visualization dashboards of COVID-19 have increased awareness of geospatial data and analytics in public health, they have also amplified concerns regarding data issues, including questions of reproducibility and replicability (R&R), provenance, scale of analysis, fitness for use, trust, privacy, and uncertainty. The following chapter sections discuss these data issues and their implications for decision making during the COVID-19 pandemic. R&R and provenance—Many complex geospatial R&R issues can be simplified by a focus on who has (or should have) access to associated geospatial provenance information which is here defined. Human mobility data—Non-traditional data sources, such as social media, can augment official reporting sources and provide insight into mobility and social distancing behaviors. We discuss how to incorporate human mobility data selectively, while maintaining anonymity, with official data sources during emergency situations. Scales of analysis—Although several models and geo-visualization tools are providing information about disease spread based on mobility, network connectivity, and/or socio-economic conditions, these models and tools rarely shed light about the impact of the spatial scale at which data are available and models are implemented. The widely adopted practice in the U.S. of mapping data to counties may provide false impressions to policy makers and the public regarding the locations at greatest and least risk. Given the dynamic nature of the disease, temporal scale is also crucial to understanding the implications of models. Understanding uncertainty in projection models—A key issue in infectious disease epidemiological models is the characterization and quantification of uncertainty in predictions. Uncertainties come from assumptions on model parameters, model estimation, and both systematic and random uncertainties from input data, including temporal lags in reporting. Model choice may add to overall uncertainties in the predicted numbers. Since these forecasts have an immense impact on public health policy and individual risk-based decision making, it is paramount to understand and convey the sources and nature of uncertainties.

[1]  Huan Liu,et al.  Is the Sample Good Enough? Comparing Data from Twitter's Streaming API with Twitter's Firehose , 2013, ICWSM.

[2]  Robert J. Hanisch,et al.  Research Data Framework (RDaF): Motivation, Development, and a Preliminary Framework Core , 2021 .

[3]  B. Mandelbrot How Long Is the Coast of Britain? Statistical Self-Similarity and Fractional Dimension , 1967, Science.

[4]  Carlo Ratti,et al.  Geo-located Twitter as proxy for global mobility patterns , 2013, Cartography and geographic information science.

[5]  Erle C. Ellis,et al.  The spatial and temporal domains of modern ecology , 2018, Nature Ecology & Evolution.

[6]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .

[7]  X. Li,et al.  The characteristics of multi-source mobility datasets and how they reveal the luxury nature of social distancing in the U.S. during the COVID-19 pandemic , 2020, medRxiv.

[8]  Timur Friedman,et al.  Evaluating Mobility Pattern Space Routing for DTNs , 2005, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[9]  M. R. Ferrández,et al.  Mathematical modeling of the spread of the coronavirus disease 2019 (COVID-19) taking into account the undetected infections. The case of China , 2020, Communications in Nonlinear Science and Numerical Simulation.

[10]  Tim O'Reilly,et al.  What is Web 2.0: Design Patterns and Business Models for the Next Generation of Software , 2007 .

[11]  Nuno R. Faria,et al.  The effect of human mobility and control measures on the COVID-19 epidemic in China , 2020, Science.

[12]  Elisa Franco,et al.  The challenges of modeling and forecasting the spread of COVID-19 , 2020, Proceedings of the National Academy of Sciences.

[13]  Luke S Sloan,et al.  Who Tweets with Their Location? Understanding the Relationship between Demographic Characteristics and the Use of Geoservices and Geotagging on Twitter , 2015, PloS one.

[14]  J. Horney,et al.  The Impact of Social Vulnerability on COVID-19 in the U.S.: An Analysis of Spatially Varying Relationships , 2020, American Journal of Preventive Medicine.

[15]  B. Kar Scaling modeled potential residential loss from a storm surge , 2008 .

[16]  Patricia Balvanera,et al.  Biodiversity and Ecosystem Services , 2013 .

[17]  Jimin Wang,et al.  NeuroTPR: A neuro‐net toponym recognition model for extracting locations from social media messages , 2020, Trans. GIS.

[18]  W. O. Kermack,et al.  A contribution to the mathematical theory of epidemics , 1927 .

[19]  D. Quattrochi,et al.  Scale, Multiscaling, Remote Sensing, and GIS , 1997 .

[20]  Zhicheng Wang,et al.  Combating COVID-19: health equity matters , 2020, Nature Medicine.

[21]  A. Wilder-Smith,et al.  The global community needs to swiftly ramp up the response to contain COVID-19 , 2020, The Lancet.

[22]  Anna María Nápoles,et al.  COVID-19 and Racial/Ethnic Disparities. , 2020, JAMA.

[23]  Xiao Huang,et al.  Twitter reveals human mobility dynamics during the COVID-19 pandemic , 2020, PloS one.

[24]  M. Hodgson,et al.  Observational Scale and Modeled Potential Residential Loss from a Storm Surge , 2012 .

[25]  O Diekmann,et al.  The construction of next-generation matrices for compartmental epidemic models , 2010, Journal of The Royal Society Interface.

[26]  J. A. Tullis,et al.  Where Is the Provenance? Ethical Replicability and Reproducibility in GIScience and Its Critical Applications , 2020 .

[27]  N. Boccara,et al.  Automata network SIR models for the spread of infectious diseases in populations of moving individuals , 1992 .

[28]  Dave Higdon,et al.  Forecasting seasonal influenza with a state-space SIR model. , 2017, The annals of applied statistics.

[29]  Nicholas P. Jewell,et al.  Caution Warranted: Using the Institute for Health Metrics and Evaluation Model for Predicting the Course of the COVID-19 Pandemic , 2020, Annals of Internal Medicine.

[30]  N. Lam,et al.  On the Issues of Scale, Resolution, and Fractal Analysis in the Mapping Sciences* , 1992 .

[31]  An Pan,et al.  Evolving Epidemiology and Impact of Non-pharmaceutical Interventions on the Outbreak of Coronavirus Disease 2019 in Wuhan, China , 2020, medRxiv.

[32]  Mark Dredze,et al.  The Twitter Social Mobility Index: Measuring Social Distancing Practices from Geolocated Tweets , 2020, ArXiv.

[33]  E. Gakidou,et al.  Predictive performance of international COVID-19 mortality forecasting models , 2020, medRxiv.

[34]  Jiajun Liu,et al.  Understanding Human Mobility from Twitter , 2014, PloS one.

[35]  Judith Gelernter,et al.  Geo‐parsing Messages from Microtext , 2011, Trans. GIS.

[36]  Susan L. Cutter,et al.  Bridging Twitter and Survey Data for Evacuation Assessment of Hurricane Matthew and Hurricane Irma , 2020 .

[37]  Reid Ewing,et al.  Does Density Aggravate the COVID-19 Pandemic? , 2020, Journal of the American Planning Association.

[38]  Huan Ning,et al.  Identifying disaster related social media for rapid response: a visual-textual fused CNN architecture , 2020, Int. J. Digit. Earth.

[39]  B. Finkenstädt,et al.  Statistical Inference in a Stochastic Epidemic SEIR Model with Control Intervention: Ebola as a Case Study , 2006, Biometrics.

[40]  B. Kar,et al.  Characterizing the spread of COVID-19 from human mobility patterns and SocioDemographic indicators , 2020, ARIC@SIGSPATIAL.

[41]  Diansheng Guo,et al.  A novel approach to leveraging social media for rapid flood mapping: a case study of the 2015 South Carolina floods , 2018 .

[42]  A. Stewart Fotheringham,et al.  Reproducibility and Replicability in Geographical Analysis , 2019, Geographical Analysis.

[43]  T. Kuniya Prediction of the Epidemic Peak of Coronavirus Disease in Japan, 2020 , 2020, Journal of clinical medicine.

[44]  P. Atkinson,et al.  Spatial Scale Problems and Geostatistical Solutions: A Review , 2000 .

[45]  Martin A. Tanner,et al.  Learning as We Go: An Examination of the Statistical Accuracy of COVID19 Daily Death Count Predictions , 2020, medRxiv.

[46]  Erik Brynjolfsson,et al.  Big data: the management revolution. , 2012, Harvard business review.

[47]  M. Williams,et al.  Towards an Ethical Framework for Publishing Twitter Data in Social Research: Taking into Account Users’ Views, Online Context and Algorithmic Estimation , 2017, Sociology.

[48]  Daniel Barker,et al.  Predictions, role of interventions and effects of a historic national lockdown in India's response to the COVID-19 pandemic: data science call to arms. , 2020, Harvard data science review.

[49]  S. Cutter,et al.  Leveraging Twitter to gauge evacuation compliance: Spatiotemporal analysis of Hurricane Matthew , 2017, PloS one.

[50]  Min Zhao,et al.  A nationwide survey of psychological distress among Chinese people in the COVID-19 epidemic: implications and policy recommendations , 2020, General Psychiatry.