Geographically weighted elastic net logistic regression

AbstractThis paper develops a localized approach to elastic net logistic regression, extending previous research describing a localized elastic net as an extension to a localized ridge regression or a localized lasso. All such models have the objective to capture data relationships that vary across space. Geographically weighted elastic net logistic regression is first evaluated through a simulation experiment and shown to provide a robust approach for local model selection and alleviating local collinearity, before application to two case studies: county-level voting patterns in the 2016 USA presidential election, examining the spatial structure of socio-economic factors associated with voting for Trump, and a species presence–absence data set linked to explanatory environmental and climatic factors at gridded locations covering mainland USA. The approach is compared with other logistic regressions. It improves prediction for the election case study only which exhibits much greater spatial heterogeneity in the binary response than the species case study. Model comparisons show that standard geographically weighted logistic regression over-estimated relationship non-stationarity because it fails to adequately deal with collinearity and model selection. Results are discussed in the context of predictor variable collinearity and selection and the heterogeneities that were observed. Ongoing work is investigating locally derived elastic net parameters.

[1]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[2]  David A. Belsley,et al.  Regression Analysis and its Application: A Data-Oriented Approach.@@@Applied Linear Regression.@@@Regression Diagnostics: Identifying Influential Data and Sources of Collinearity , 1981 .

[3]  Arthur E. Hoerl,et al.  Application of ridge analysis to regression problems , 1962 .

[4]  A. Stewart Fotheringham,et al.  Local Forms of Spatial Analysis , 2010 .

[5]  David C. Wheeler,et al.  Simultaneous Coefficient Penalization and Model Selection in Geographically Weighted Regression: The Geographically Weighted Lasso , 2009 .

[6]  Shinji Nakaoka,et al.  New algorithm for constructing area-based index with geographical heterogeneities and variable selection: An application to gastric cancer screening , 2016, Scientific Reports.

[7]  L. Anselin SPATIAL DEPENDENCE AND SPATIAL STRUCTURAL INSTABILITY IN APPLIED REGRESSION ANALYSIS , 1990 .

[8]  David Wheeler,et al.  Multicollinearity and correlation among local regression coefficients in geographically weighted regression , 2005, J. Geogr. Syst..

[9]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[10]  D. Wheeler Diagnostic Tools and a Remedial Method for Collinearity in Geographically Weighted Regression , 2007 .

[11]  Chris Brunsdon,et al.  Geographically Weighted Regression: The Analysis of Spatially Varying Relationships , 2002 .

[12]  Marie-Josée Fortin,et al.  Exploring spatial non-stationarity of fisheries survey data using geographically weighted regression (GWR): an example from the Northwest Atlantic , 2010 .

[13]  Martin Charlton,et al.  GWmodel: An R Package for Exploring Spatial Heterogeneity Using Geographically Weighted Models , 2013, 1306.0413.

[14]  Hans Wackernagel,et al.  Multivariate Geostatistics: An Introduction with Applications , 1996 .

[15]  Tomoki Nakaya,et al.  Introducing bootstrap methods to investigate coefficient non-stationarity in spatial regression models , 2017 .

[16]  A. Stewart Fotheringham,et al.  Geographically Weighted Regression: A Method for Exploring Spatial Nonstationarity , 2010 .

[17]  Martin Charlton,et al.  The Use of Geographically Weighted Regression for Spatial Prediction: An Evaluation of Models Using Simulated Data Sets , 2010 .

[18]  R. Brereton,et al.  Crucial problems in regression modelling and their solutions. , 2002, The Analyst.

[19]  Dengkui Li,et al.  Structure identification and variable selection in geographically weighted regression models , 2017 .

[20]  P. Atkinson,et al.  Exploring the Relations Between Riverbank Erosion and Geomorphological Controls Using Geographically Weighted Logistic Regression , 2002 .

[21]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[22]  S. Fotheringham,et al.  Modeling the spatial variation of the explanatory factors of human-caused wildfires in Spain using geographically weighted logistic regression , 2014 .

[23]  A. Stewart Fotheringham,et al.  Geographically weighted regression and multicollinearity: dispelling the myth , 2016, J. Geogr. Syst..

[24]  Alexis J. Comber,et al.  Geographically weighted correspondence matrices for local error reporting and change analyses: mapping the spatial distribution of errors and change , 2017 .

[25]  A. Páez,et al.  Geographically Weighted Regression , 2021, Handbook of Regional Science.

[26]  W. W. Muir,et al.  Regression Diagnostics: Identifying Influential Data and Sources of Collinearity , 1980 .

[27]  Martin Charlton,et al.  Spatial Nonstationarity and Autoregressive Models , 1998 .

[28]  J. Friedman,et al.  A Statistical View of Some Chemometrics Regression Tools , 1993 .

[29]  Nina S. N. Lam,et al.  Geographically Weighted Elastic Net: A Variable-Selection and Modeling Method under the Spatially Nonstationary Condition , 2018 .

[30]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[31]  Fernando Tusell,et al.  Alleviating the effect of collinearity in geographically weighted regression , 2014, J. Geogr. Syst..

[32]  Martin Charlton,et al.  Geographically weighted principal components analysis , 2011, Int. J. Geogr. Inf. Sci..

[33]  S. Fotheringham,et al.  Geographically weighted summary statistics — aframework for localised exploratory data analysis , 2002 .

[34]  Damaris Zurell,et al.  Collinearity: a review of methods to deal with it and a simulation study evaluating their performance , 2013 .

[35]  Martin Charlton,et al.  Living with Collinearity in Local Regression Models , 2012 .