gwpcorMapper: an interactive mapping tool for exploring geographically weighted correlation and partial correlation in high-dimensional geospatial datasets

Exploratory spatial data analysis (ESDA) plays a key role in research that includes geographic data. In ESDA, analysts often want to be able to visualize observations and local relationships on a map. However, software dedicated to visualizing local spatial relations between multiple variables in high dimensional datasets remains undeveloped. This paper introduces gwpcorMapper, a newly developed software application for mapping geographically weighted correlation and partial correlation in large multivariate datasets. gwpcorMapper facilitates ESDA by giving researchers the ability to interact with map components that describe local correlative relationships. We built gwpcorMapper using the R Shiny framework. The software inherits its core algorithm from GWpcor, an R library for calculating the geographically weighted correlation and partial correlation statistics. We demonstrate the application of gwpcorMapper by using it to explore census data in order to find meaningful relationships that describe the work-life environment in the 23 special wards of Tokyo, Japan. We show that gwpcorMapper is useful in both variable selection and parameter tuning for geographically weighted statistics. gwpcorMapper highlights that there are strong statistically clear local variations in the relationship between the number of commuters and the total number of hours worked when considering the total population in each district across the 23 special wards of Tokyo. Our application demonstrates that the ESDA process with high-dimensional geospatial data using gwpcorMapper has applications across multiple fields.

[1]  Barbara Mayer,et al.  Compact City A Plan For A Liveable Urban Environment , 2016 .

[2]  A. Stewart Fotheringham,et al.  Geographically Weighted Regression: A Method for Exploring Spatial Nonstationarity , 2010 .

[3]  L. Anselin Spatial Econometrics: Methods and Models , 1988 .

[4]  Petra Ostermann Recent Developments In Spatial Analysis Spatial Statistics Behavioural Modelling And Computational Intelligence , 2016 .

[5]  Martin Charlton,et al.  GWmodel: An R Package for Exploring Spatial Heterogeneity Using Geographically Weighted Models , 2013, 1306.0413.

[6]  Tamara D. Madensen,et al.  Neighborhood Effects , 2021, Encyclopedia of Evolutionary Psychological Science.

[7]  Martin Charlton,et al.  Geographically weighted principal components analysis , 2011, Int. J. Geogr. Inf. Sci..

[8]  M. Charlton,et al.  More bark than bytes? Reflections on 21+ years of geocomputation , 2017 .

[9]  Luc Anselin,et al.  The Future of Spatial Analysis in the Social Sciences , 1999, Ann. GIS.

[10]  Paul Harris,et al.  Exploring spatial variation and spatial relationships in a freshwater acidification critical load data set for Great Britain using geographically weighted summary statistics , 2010, Comput. Geosci..

[11]  Christopher D. Lloyd,et al.  Analysing population characteristics using geographically weighted principal components analysis: A case study of Northern Ireland in 2001 , 2010, Comput. Environ. Urban Syst..

[12]  Helena Mitasova,et al.  Open Geospatial Software and Data: A Review of the Current State and A Perspective into the Future , 2020, ISPRS Int. J. Geo Inf..

[13]  Simon Elias Bibri,et al.  Compact city planning and development: Emerging practices and strategies for achieving the goals of sustainability , 2020 .

[14]  Jason Dykes,et al.  Geographically Weighted Visualization: Interactive Graphics for Scale-Varying Exploratory Analysis , 2007, IEEE Transactions on Visualization and Computer Graphics.

[15]  Antony Unwin,et al.  Exploratory spatial data analysis with local statistics , 1998 .

[16]  Gennady L. Andrienko,et al.  Exploratory analysis of spatial and temporal data - a systematic approach , 2005 .

[17]  Luc Anselin,et al.  Interactive Techniques and Exploratory Spatial Data Analysis , 1996 .

[18]  M. Braga,et al.  Exploratory Data Analysis , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[19]  Jae-Gil Lee,et al.  Geospatial Big Data: Challenges and Opportunities , 2015, Big Data Res..

[20]  Y. Takane,et al.  Generalized Inverse Matrices , 2011 .

[21]  Martin Charlton,et al.  The GWmodel R package: further topics for exploring spatial heterogeneity using geographically weighted models , 2013, Geo spatial Inf. Sci..

[22]  Qin Jian On cartographic visualization , 2000 .

[23]  Heiko Balzter,et al.  Investigating spatial error structures in continuous raster data , 2019, Int. J. Appl. Earth Obs. Geoinformation.

[24]  S. Fotheringham,et al.  Geographically weighted summary statistics — aframework for localised exploratory data analysis , 2002 .

[25]  Tomoki Nakaya,et al.  Scalable GWR: A Linear-Time Algorithm for Large-Scale Geographically Weighted Regression with Polynomial Kernels , 2019, Annals of the American Association of Geographers.

[26]  M. Goodchild The Validity and Usefulness of Laws in Geographic Information Science and Geography , 2004 .

[27]  Youngihn Kho,et al.  GeoDa: An Introduction to Spatial Data Analysis , 2006 .

[28]  Markus Neteler,et al.  Highlighting recent trends in open source geospatial science and software , 2020, Trans. GIS.

[29]  Exploratory Spatial Data Analysis , 2017, Encyclopedia of GIS.

[30]  Chris Brunsdon Exploratory spatial data analysis and local indicators of spatial association with XLISP-STAT , 1998 .

[31]  Manfred M. Fischer,et al.  Recent Developments in Spatial Analysis , 1997 .

[32]  Melnned M. Kantardzic Big Data Analytics , 2013, Lecture Notes in Computer Science.

[33]  D. Wheeler Diagnostic Tools and a Remedial Method for Collinearity in Geographically Weighted Regression , 2007 .

[34]  Jinha Yoon,et al.  Long Commute Time and Sleep Problems with Gender Difference in Work–Life Balance: A Cross-sectional Study of More than 25,000 Workers , 2019, Safety and health at work.