Mapping soil organic carbon content by geographically weighted regression: A case study in the Heihe River Basin, China

Abstract In large heterogeneous areas the relationship between soil organic carbon (SOC) and environmental covariates may vary throughout the area, bringing about difficulty for accurate modeling of the regional SOC variation. The benefit of local, geographically weighted regression (GWR) coefficients was tested in a case study on soil organic carbon mapping across a 50,810 km 2 area in northwestern China. This area is composed of an alpine ecosystem in the upper reaches and oases in the middle reaches. The benefit was quantified by comparing the quality of the maps obtained by GWR and geographically weighted ridge regression (GWRR) on the one side and multiple linear regression (MLR) on the other side. In these methods spatial dependence of model residuals is ignored. The root mean squared error (RMSE) of predictions of natural log-transformed SOC obtained with GWR was smaller than with MLR: 0.565 versus 0.618 g/kg. The use of a local ridge parameter in GWRR did not lead to an increase in accuracy. Besides we compared the quality of maps obtained by geographically weighted regression followed by simple kriging of model residuals (GWRSK) and kriging with an external drift (KED) with global regression coefficients. In these methods the spatial dependence of model residuals is incorporated in the model. The RMSE with KED was smaller than with GWRSK: 0.515 versus 0.546 g/kg. We conclude that fitting regression coefficients locally as in GWR only paid when no spatial random effect was included in the model. When a spatial random effect was included, the flexibility of local, geographically weighted regression coefficients was not needed and even undesirable as it led to less accurate predictions than KED with global regression coefficients. In comparing the accuracy of prediction methods by leave-one-out cross-validation (LOOCV) of a non-probability sample it is important to account for possible autocorrelation of pairwise differences in the prediction errors. The effective sample sizes were substantially smaller than the total number of sampling points, so that most pairwise differences in MSE were not significant at a significance level of 10% in a two-sided paired t -test.

[1]  J. Tu Spatially varying relationships between land use and water quality across an urbanization gradient explored by geographically weighted regression. , 2011 .

[2]  A. Flores,et al.  A simplified approach for estimating soil carbon and nitrogen stocks in semi-arid complex terrain , 2011 .

[3]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[4]  Sabine Grunwald,et al.  Modeling of Soil Organic Carbon Fractions Using Visible–Near‐Infrared Spectroscopy , 2009 .

[5]  John F. O'Callaghan,et al.  The extraction of drainage networks from digital elevation data , 1984, Comput. Vis. Graph. Image Process..

[6]  Edzer J. Pebesma,et al.  Multivariable geostatistics in S: the gstat package , 2004, Comput. Geosci..

[7]  Keith McCloy,et al.  Predictive mapping of soil organic carbon in wet cultivated lands using classification-tree based models: the case study of Denmark. , 2010, Journal of environmental management.

[8]  Isobel J. Simpson,et al.  Source origins, modeled profiles, and apportionments of halogenated hydrocarbons in the greater Pearl River Delta region, southern China , 2009 .

[9]  Ling Lu,et al.  Estimation of evapotranspiration in an arid region by remote sensing - A case study in the middle reaches of the Heihe River Basin , 2012, Int. J. Appl. Earth Obs. Geoinformation.

[10]  Rattan Lal,et al.  Predicting the spatial variation of the soil organic carbon pool at a regional scale. , 2010 .

[11]  Z. Niu,et al.  Watershed Allied Telemetry Experimental Research , 2009 .

[12]  Sabine Grunwald,et al.  Multi-criteria characterization of recent digital soil mapping and modeling approaches , 2009 .

[13]  F. Veroustraete,et al.  Analysing the forcing mechanisms for net primary productivity changes in the Heihe River Basin, north‐west China , 2009 .

[14]  Marie-Josée Fortin,et al.  Spatial autocorrelation and statistical tests: Some solutions , 2009 .

[15]  J. A. Schell,et al.  Monitoring vegetation systems in the great plains with ERTS , 1973 .

[16]  C. Hurburgh,et al.  Near-Infrared Reflectance Spectroscopy–Principal Components Regression Analyses of Soil Properties , 2001 .

[17]  Rattan Lal,et al.  A geographically weighted regression kriging approach for mapping soil organic carbon stock , 2012 .

[18]  Rosa Francaviglia,et al.  Estimation of soil organic matter by geostatistical methods: Use of auxiliary information in agricultural and environmental assessment , 2014 .

[19]  David Ardia,et al.  Differential Evolution (DEoptim) for Non-Convex Portfolio Optimization , 2010 .

[20]  Frank Canters,et al.  A multiple regression approach to assess the spatial distribution of Soil Organic Carbon (SOC) at the regional scale (Flanders, Belgium) , 2008 .

[21]  Gerard B. M. Heuvelink,et al.  Sampling for validation of digital soil maps , 2011 .

[22]  Chuanrong Zhang,et al.  Predictive mapping of soil total nitrogen at a regional scale: A comparison between geographically weighted regression and cokriging , 2013 .

[23]  Martin Charlton,et al.  The Use of Geographically Weighted Regression for Spatial Prediction: An Evaluation of Models Using Simulated Data Sets , 2010 .

[24]  Peter Scull,et al.  A Top-Down Approach to the State Factor Paradigm for Use in Macroscale Soil Analysis , 2010 .

[25]  Daniel A. Griffith,et al.  Spatial-Filtering-Based Contributions to a Critique of Geographically Weighted Regression (GWR) , 2008 .

[26]  R. M. Lark,et al.  Geostatistical mapping of geomorphic variables in the presence of trend , 2006 .

[27]  P. Moran The Interpretation of Statistical Maps , 1948 .

[28]  David Wheeler,et al.  Multicollinearity and correlation among local regression coefficients in geographically weighted regression , 2005, J. Geogr. Syst..

[29]  S. Fotheringham,et al.  Geographically Weighted Regression , 1998 .

[30]  Sabine Grunwald,et al.  Comparison of multivariate methods for inferential modeling of soil carbon using visible/near-infrared spectra , 2008 .

[31]  Jones Robert,et al.  Soil Classification 2001 , 2002 .

[32]  David C. Wheeler,et al.  Diagnostic tools and remedial methods for collinearity in linear regression models with spatially varying coefficients , 2006 .

[33]  Christopher D. Lloyd,et al.  Nonstationary models for exploring and mapping monthly precipitation in the United Kingdom , 2009 .

[34]  Tianxiang Yue,et al.  Spatially distributed modeling of soil organic matter across China: An application of artificial neural network approach , 2013 .

[35]  D. B. Duncan MULTIPLE RANGE AND MULTIPLE F TESTS , 1955 .

[36]  H. Elsenbeer,et al.  Soil organic carbon concentrations and stocks on Barro Colorado Island — Digital soil mapping using Random Forests analysis , 2008 .

[37]  R. Reese Geostatistics for Environmental Scientists , 2001 .

[38]  Juan Wu The effect of ecological management in the upper reaches of Heihe River , 2011 .

[39]  Martin Charlton,et al.  Geographically weighted regression with a non-Euclidean distance metric: a case study using hedonic house price data , 2014, Int. J. Geogr. Inf. Sci..

[40]  G. Kiely,et al.  Towards spatial geochemical modelling: Use of geographically weighted regression for mapping soil organic carbon contents in Ireland , 2011 .

[41]  L. Zhang,et al.  Comparison of spatial and non-spatial logistic regression models for modeling the occurrence of cloud cover in north-eastern Puerto Rico , 2013 .

[42]  Paul Harris,et al.  Estimating Freshwater Acidification Critical Load Exceedance Data for Great Britain Using Space-Varying Relationship Models , 2011 .

[43]  D. Wheeler Diagnostic Tools and a Remedial Method for Collinearity in Geographically Weighted Regression , 2007 .

[44]  Chenghu Zhou,et al.  Purposive Sampling for Digital Soil Mapping for Areas with Limited Data , 2008 .

[45]  Martin Charlton,et al.  Living with Collinearity in Local Regression Models , 2012 .

[46]  Gary A. Peterson,et al.  Soil Attribute Prediction Using Terrain Analysis , 1993 .