Hyper-local geographically weighted regression: extending GWR through local model selection and local bandwidth optimization

Geographically weighted regression (GWR) is an inherently exploratory technique for examining process non-stationarity in data relationships. This paper develops and applies a hyper-local GWR which extends such investigations further. The hyper-local GWR simultaneously optimizes both local model selection (which covariates to include in each local regression) and local kernel bandwidth specification (how much data should be included locally). These are evaluated using a measure of model fit. The hyper-local GWR approach evaluates different kernel bandwidths at each location and selects the most parsimonious local regression model. By allowing models and bandwidths to vary locally, this approach extends and refines the one-size-fits-all "whole map model" and "constant bandwidth calibration" under standard GWR. The results provide an alternative, complementary and more nuanced interpretation of localized regression. The method is illustrated using a case study modeling soil total nitrogen (STN) and soil total phosphorus (STP) from data collected at 689 locations in a watershed in Northern China. The analysis compares linear regression, standard GWR, and hyper-local GWR models of STN and STP and highlights the different locations at which covariates are identified as significant predictors of STN and STP by the different GWR approaches and the spatial variation in bandwidths. The hyper-local GWR results indicate that the STN processes are more non-stationary and localized than found via a standard application of GWR. By contrast, the results for STP are more confirmatory (i.e., similar) between the two GWR approaches providing extra assurance about the nature of the moderate non-stationary relationships observed. That is, a standard GWR may underestimate localized spatial heterogeneity where it is strongly present (as in the STN case study) and may overestimate it where spatial homogeneity is present (as in the STP case study). The overall benefits of hyper-local GWR are discussed, particularly in the context of the original investigative aims of GWR. A hyper-local approach provides a useful counter view of local regression modeling to that found with standard GWR. Where spatial non-stationarity exists, the hyper-local GWR provides a more spatially nuanced indication of the localization than a standard GWR analysis and can be used to suggest the direction of further analyses and investigations. Some areas of further work are suggested.

[1]  P. Longley,et al.  Spatial analysis: Modelling in a GIS environment , 1996 .

[2]  A. Stewart Fotheringham,et al.  Geographically Weighted Regression: A Method for Exploring Spatial Nonstationarity , 2010 .

[3]  S. Fotheringham,et al.  Geographically Weighted Regression , 1998 .

[4]  Yunqiang Wang,et al.  Spatial variability of soil total nitrogen and soil total phosphorus under different land uses in a small watershed on the Loess Plateau, China , 2009 .

[5]  David C. Wheeler,et al.  Simultaneous Coefficient Penalization and Model Selection in Geographically Weighted Regression: The Geographically Weighted Lasso , 2009 .

[6]  Martin Charlton,et al.  Geographically weighted regression with parameter-specific distance metrics , 2017, Int. J. Geogr. Inf. Sci..

[7]  Chang-Lin Mei,et al.  Spatially Varying Coefficient Models: Testing for Spatial Heteroscedasticity and Reweighting Estimation of the Coefficients , 2011 .

[8]  Fernando Tusell,et al.  Alleviating the effect of collinearity in geographically weighted regression , 2014, J. Geogr. Syst..

[9]  N. G. Best,et al.  The deviance information criterion: 12 years on , 2014 .

[10]  A. Fotheringham,et al.  The Multiple Testing Issue in Geographically Weighted Regression , 2016 .

[11]  Martin Charlton,et al.  Geographically weighted discriminant analysis , 2007 .

[12]  David R. Anderson,et al.  Multimodel Inference , 2004 .

[13]  Martin Charlton,et al.  Moving window kriging with geographically weighted variograms , 2010 .

[14]  Nina S. N. Lam,et al.  Geographically Weighted Elastic Net: A Variable-Selection and Modeling Method under the Spatially Nonstationary Condition , 2018 .

[15]  Stephanie T. Lanza,et al.  Sensitivity and Specificity of Information Criteria , 2018, bioRxiv.

[16]  A. Stewart Fotheringham,et al.  Multiscale Geographically Weighted Regression (MGWR) , 2017 .

[17]  M. Durbán,et al.  Modeling regional economic dynamics: Spatial dependence, spatial heterogeneity and nonlinearities , 2014 .

[18]  Shinji Nakaoka,et al.  New algorithm for constructing area-based index with geographical heterogeneities and variable selection: An application to gastric cancer screening , 2016, Scientific Reports.

[19]  Alexis J. Comber,et al.  Geographically weighted correspondence matrices for local error reporting and change analyses: mapping the spatial distribution of errors and change , 2017 .

[20]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[21]  David Wheeler,et al.  Multicollinearity and correlation among local regression coefficients in geographically weighted regression , 2005, J. Geogr. Syst..

[22]  Martin Charlton,et al.  GWmodel: An R Package for Exploring Spatial Heterogeneity Using Geographically Weighted Models , 2013, 1306.0413.

[23]  Kazuaki Miyamoto,et al.  A General Framework for Estimation and Inference of Geographically Weighted Regression Models: 2. Spatial Association and Model Specification Tests , 2002 .

[24]  S. Fotheringham,et al.  Geographically weighted summary statistics — aframework for localised exploratory data analysis , 2002 .

[25]  A. Stewart Fotheringham,et al.  Robust Geographically Weighted Regression: A Technique for Quantifying Spatial Relationships Between Freshwater Acidification Critical Loads and Catchment Attributes , 2010 .

[26]  Rich Harris,et al.  Using Contextualized Geographically Weighted Regression to Model the Spatial Heterogeneity of Land Prices in Beijing, China , 2013, Trans. GIS.

[27]  A. Páez,et al.  A General Framework for Estimation and Inference of Geographically Weighted Regression Models: 1. Location-Specific Kernel Bandwidths and a Test for Locational Heterogeneity , 2002 .

[28]  Tomoki Nakaya,et al.  Introducing bootstrap methods to investigate coefficient non-stationarity in spatial regression models , 2017 .

[29]  D. Wheeler Diagnostic Tools and a Remedial Method for Collinearity in Geographically Weighted Regression , 2007 .

[30]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[31]  Jack C. Yue,et al.  A modification to geographically weighted regression , 2017, International Journal of Health Geographics.

[32]  Martin Charlton,et al.  Living with Collinearity in Local Regression Models , 2012 .

[33]  Clifford M. Hurvich,et al.  Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion , 1998 .

[34]  Daniel A. Griffith,et al.  A Moran coefficient-based mixed effects approach to investigate spatially varying relationships , 2016 .

[35]  J. Chilès,et al.  Geostatistics: Modeling Spatial Uncertainty , 1999 .

[36]  Alexis Comber,et al.  Geographically Weighted Structural Equation Models : spatial variation in the drivers of environmental restoration effectiveness , 2022 .

[37]  Martin Charlton,et al.  The Use of Geographically Weighted Regression for Spatial Prediction: An Evaluation of Models Using Simulated Data Sets , 2010 .

[38]  Yee Leung,et al.  Statistical Tests for Spatial Nonstationarity Based on the Geographically Weighted Regression Model , 2000 .

[39]  Alexis J. Comber,et al.  Distance metric choice can both reduce and induce collinearity in geographically weighted regression , 2018, Environment and Planning B: Urban Analytics and City Science.

[40]  Chris Brunsdon,et al.  Geographically Weighted Regression: The Analysis of Spatially Varying Relationships , 2002 .

[41]  C. F. Sirmans,et al.  Spatial Modeling With Spatially Varying Coefficient Processes , 2003 .

[42]  J. Mcdonald,et al.  A nonparametric analysis of employment density in a polycentric city , 1997 .

[43]  Alexis J. Comber Geographically weighted methods for estimating local surfaces of overall, user and producer accuracies , 2013 .

[44]  Martin Charlton,et al.  Geographically weighted principal components analysis , 2011, Int. J. Geogr. Inf. Sci..

[45]  Giles M. Foody,et al.  Local characterization of thematic classification accuracy through spatially constrained confusion matrices , 2005 .

[46]  Concha Bielza,et al.  Lazy lasso for local regression , 2012, Comput. Stat..

[47]  Catherine Loader,et al.  Smoothing: Local Regression Techniques , 2012 .