Probability mapping of soil thickness by random survival forest at a national scale

Soil thickness (ST) is a crucial factor in earth surface modelling and soil storage capacity calculations (e.g., available water capacity and carbon stocks). However, the observed depths recorded in soil information systems for some profiles are often less than the actual ST (i.e., right censored data). The use of such data will negatively affect model and map accuracy, yet few studies have been done to resolve this issue or propose methods to correct for right censored data. Therefore, this work demonstrates how right censored data can be accounted for in the ST modelling of mainland France. We propose the use of Random Survival Forest (RSF) for ST probability mapping within a Digital Soil Mapping framework. Among 2109 sites of the French Soil Monitoring Network, 1089 observed STs were defined as being right censored. Using RSF, the probability of exceeding a given depth was modelled using freely available spatial data representing the main soil-forming factors. Subsequently, the models were extrapolated to the full spatial extent of mainland France. As examples, we produced maps showing the probability of exceeding the thickness of each GlobalSoilMap standard depth: 5, 15, 30, 60, 100, and 200 cm. In addition, a bootstrapping approach was used to assess the 90% confidence intervals. Our results showed that RSF was able to correct for right censored data entries occurring within a given dataset. RSF was more reliable for thin (0.3 m) and thick soils (1 to 2 m), as they performed better (overall accuracy from 0.793 to 0.989) than soils with a thickness between 0.3 and 1 m. This study provides a new approach for modelling right censored soil information. Moreover, RSF can produce probability maps at any depth less than the maximum depth of the calibration data, which is of great value for designing additional sampling campaigns and decision making in geotechnical engineering.

[1]  M. Deurer,et al.  Soil Ecosystem Services , 2011 .

[2]  B. Minasny,et al.  Kriging method evaluation for assessing the spatial distribution of urban soil lead contamination. , 2002, Journal of environmental quality.

[3]  H. Jenny Factors of Soil Formation: A System of Quantitative Pedology , 2011 .

[4]  Johan Bouma,et al.  The significance of soils and soil science towards realization of the United Nations sustainable development goals , 2016 .

[5]  Dominique Arrouays,et al.  GlobalSoilMap France: High-resolution spatial modelling the soils of France up to two meter depth. , 2016, The Science of the total environment.

[6]  Jean-Louis Roujean,et al.  ECOCLIMAP-II/Europe: a twofold database of ecosystems and surface parameters at 1 km resolution based on satellite information for use in land surface, meteorological and climate models , 2012 .

[7]  T. Hengl,et al.  Mapping the global depth to bedrock for land surface modeling , 2017 .

[8]  L. Boruvka Soil depth prediction supported by primary terrain attributes: a comparison of methods , 2018 .

[9]  B. Minasny,et al.  Digital Soil Map of the World , 2009, Science.

[10]  David R. Montgomery,et al.  A process-based model for colluvial soil depth and shallow landsliding using digital elevation data , 1995 .

[11]  E. Kaplan,et al.  Nonparametric Estimation from Incomplete Observations , 1958 .

[12]  Dominique Arrouays,et al.  National estimation of soil organic carbon storage potential for arable soils: A data-driven approach coupled with carbon-landscape zones. , 2019, The Science of the total environment.

[13]  W. Dietrich,et al.  The soil production function and landscape equilibrium , 1997, Nature.

[14]  Roger G. Barry,et al.  Spatial and temporal variability in active layer thickness over the Russian Arctic drainage basin , 2005 .

[15]  V. L. Mulder,et al.  Probability mapping of iron pan presence in sandy podzols in South-West France, using digital soil mapping , 2017 .

[16]  N. Batjes,et al.  Total carbon and nitrogen in the soils of the world , 1996 .

[17]  E. Barriuso,et al.  Which persistent organic pollutants can we map in soil using a large spacing systematic soil monitoring design? A case study in Northern France. , 2011, The Science of the total environment.

[18]  Dominique Arrouays,et al.  Uncertainty assessment of GlobalSoilMap soil available water capacity products: A French case study , 2019, Geoderma.

[19]  A. McBratney,et al.  Further results on prediction of soil properties from terrain attributes: heterotopic cokriging and regression-kriging , 1995 .

[20]  Andrea Vacca,et al.  Rates and spatial variations of soil erosion in Europe: A study based on erosion plot data , 2010 .

[21]  T. G. Orton,et al.  Using measurements close to a detection limit in a geostatistical case study to predict selenium concentration in topsoil , 2009 .

[22]  E. Barriuso,et al.  Analyzing the spatial distribution of PCB concentrations in soils using below-quantification limit data. , 2012, Journal of environmental quality.

[23]  William E. Dietrich,et al.  Cosmogenic nuclides, topography, and the spatial variation of soil depth , 1999 .

[24]  P. Dixon,et al.  Data augmentation for a Bayesian spatial model involving censored observations , 2007 .

[25]  Philippe Lagacherie,et al.  GlobalSoilMap: Toward a Fine-Resolution Global Grid of Soil Properties , 2014 .

[26]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[27]  A-Xing Zhu,et al.  Soil Mapping Using GIS, Expert Knowledge, and Fuzzy Logic , 2001 .

[28]  G. Heuvelink,et al.  A generic framework for spatial prediction of soil variables based on regression-kriging , 2004 .

[29]  Dominique Arrouays,et al.  GlobalSoilMap : Basis of the global spatial soil information system , 2014 .

[30]  C. Post,et al.  Accounting for soil inorganic carbon in the ecosystem services framework for United Nations sustainable development goals , 2018, Geoderma.

[31]  R. Gill,et al.  Cox's regression model for counting processes: a large sample study : (preprint) , 1982 .

[32]  Gary A. Peterson,et al.  Soil Attribute Prediction Using Terrain Analysis , 1993 .

[33]  Alex B. McBratney,et al.  Elucidation of soil-landform interrelationships by canonical ordination analysis , 1991 .

[34]  Charles E. Kellogg,et al.  Soil Survey Manual , 2017 .

[35]  Dominique Arrouays,et al.  Evaluating large-extent spatial modeling approaches: A case study for soil depth for France , 2016 .

[36]  Hemant Ishwaran,et al.  Evaluating Random Forests for Survival Analysis using Prediction Error Curves. , 2012, Journal of statistical software.

[37]  Budiman Minasny,et al.  A conditioned Latin hypercube method for sampling in the presence of ancillary information , 2006, Comput. Geosci..

[38]  D. Brus,et al.  Operationalizing digital soil mapping for nationwide updating of the 1:50,000 soil map of the Netherlands , 2015 .

[39]  J. Hassett,et al.  Power function decay of hydraulic conductivity for a TOPMODEL-based infiltration routine , 2006 .

[40]  Hans-Jörg Vogel,et al.  A systemic approach for modeling soil functions , 2018 .

[41]  Daniel Joly,et al.  Les types de climats en France, une construction spatiale , 2010 .

[42]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[43]  B. Minasny,et al.  A mechanistic model to predict soil thickness in a valley area of Rio Grande do Sul, Brazil , 2018 .

[44]  James C. Bell,et al.  Soil drainage class probability mapping using a soil-landscape model , 1994 .

[45]  J. Sterne,et al.  Development and validation of a prognostic model for survival time data: application to prognosis of HIV positive patients treated with antiretroviral therapy , 2004, Statistics in medicine.

[46]  David G. Chandler,et al.  Modeling soil depth from topographic and land cover attributes , 2009 .

[47]  Hemant Ishwaran,et al.  Random Survival Forests , 2008, Wiley StatsRef: Statistics Reference Online.

[48]  Dominique Arrouays,et al.  Fine resolution map of top- and subsoil carbon sequestration potential in France. , 2018, The Science of the total environment.

[49]  Gerard B.M. Heuvelink,et al.  Mapping rootable depth and root zone plant-available water holding capacity of the soil of sub-Saharan Africa , 2018, Geoderma.

[50]  Rik Leemans,et al.  Ecosystems and human well-being : policy responses , 2005 .

[51]  Harold S. J. Zald,et al.  Influence of soil thickness on stand characteristics in a Sierra Nevada mixed-conifer forest , 2007, Plant and Soil.

[52]  Gerard Hazeu,et al.  Determining changes and flows in European landscapes 1990–2000 using CORINE land cover data , 2010 .

[53]  Budiman Minasny,et al.  A rudimentary mechanistic model for soil production and landscape development , 1999 .

[54]  Y. Cantón,et al.  Restoring soil functions by means of cyanobacteria inoculation: Importance of soil conditions and species selection , 2018, Land Degradation & Development.

[55]  M. Wiesmeier,et al.  Soil structure as an indicator of soil functions: A review , 2018 .

[56]  David G. Rossiter,et al.  Prediction of soil depth using environmental variables in an anthropogenic landscape, a case study in the Western Ghats of Kerala, India. , 2009 .

[57]  Bayesian Inference and Prediction of Gaussian Random Fields Based on Censored Data , 2005 .

[58]  William E. Dietrich,et al.  Stochastic processes of soil production and transport: erosion rates, topographic variation and cosmogenic nuclides in the Oregon Coast Range , 2001 .

[59]  Luis Samaniego,et al.  Climate Change as Driver for Ecosystem Services Risk and Opportunities , 2019, Atlas of Ecosystem Services.

[60]  Budiman Minasny,et al.  On digital soil mapping , 2003 .

[61]  Craig Rasmussen,et al.  Geomorphically based predictive mapping of soil thickness in upland watersheds , 2009 .

[62]  J. L. Parra,et al.  Very high resolution interpolated climate surfaces for global land areas , 2005 .

[63]  Dominique King,et al.  Improving the kriging of a soil variable using slope gradient as external drift , 1996 .

[64]  A. McBratney,et al.  Spatial variability of soil horizon depth in natural loess-derived soils , 2010 .

[65]  D. Brus,et al.  A comparison of kriging, co-kriging and kriging combined with regression for spatial interpolation of horizon depth with censored observations , 1995 .

[66]  David Morin,et al.  Operational High Resolution Land Cover Map Production at the Country Scale Using Satellite Image Time Series , 2017, Remote. Sens..

[67]  R. Webster,et al.  Mapping heavy metals in polluted soil by disjunctive kriging. , 1996, Environmental pollution.

[68]  F. Harrell,et al.  Evaluating the yield of medical tests. , 1982, JAMA.

[69]  Johan Bouma,et al.  The challenge of soil science meeting society's demands in a “post-truth”, “fact free” world , 2018 .

[70]  Philippe Lagacherie,et al.  Evaluating Digital Soil Mapping approaches for mapping GlobalSoilMap soil properties from legacy data in Languedoc-Roussillon (France) , 2015 .

[71]  Dominique Arrouays,et al.  National versus global modelling the 3D distribution of soil organic carbon in mainland France , 2016 .