Building a pedotransfer function for soil bulk density on regional dataset and testing its validity over a larger area

Abstract Though soil bulk density (BD) is important for crop growth and land management, information about BD is often missing in soil database due to the fact that the determination of BD is time consuming and labor intensive. In order to fill this gap, pedotransfer functions (PTFs) have been developed for predicting bulk density during last decades. To avoid non-valid extrapolation, the validity domain of PTFs should be investigated. In this study, a PTFs for bulk density was built using GBM model on data from Region Centre and we tested its validity on data from mainland France except Region Centre. Standardized Euclidean distance was applied to distinguish dissimilar soil samples from Region Centre data and other regions data. The established PTFs were not suitable for predicting these dissimilar samples. In order to make a balance between prediction accuracy and the number of samples can be predicted with our PTFs, different cutoff limits of Standardized Euclidean distance was compared. When optimized distance cutoff limit was at 97%, compared to result without excluding dissimilar samples, RMSPE value on test data decreased from 0.179 to 0.163 g cm − 3 . An additional sampling strategy based on these dissimilar samples can improve the predictive ability of PTFs. We also suggest that the purposive sampling strategy or the systematic and dense sampling strategy will do large contribution in robust modelling of PTFs.

[1]  M. Bernoux,et al.  Comparison between analytical methods for organic carbon and organic matter determination in sandy Spodosols of France , 1998 .

[2]  M. Meirvenne,et al.  Predictive Quality of Pedotransfer Functions for Estimating Bulk Density of Forest Soils , 2005 .

[3]  C. Federer Nitrogen Mineralization and Nitrification: Depth Variation in Four New England Forest Soils , 1983 .

[4]  Chandra A. Madramootoo,et al.  Soil bulk density and crop yield under eleven consecutive years of corn with different tillage and residue practices in a sandy loam soil in central Canada , 2005 .

[5]  D. Jeffrey A note on the use of ignition loss as a means for the approximate estimation of soil bulk density. , 1970 .

[6]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .

[7]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[8]  Budiman Minasny,et al.  Building and testing conceptual and empirical models for predicting soil bulk density , 2007 .

[9]  T. G. Orton,et al.  Evaluation of modelling approaches for predicting the spatial distribution of soil organic carbon stocks at the national scale , 2014, 1502.02513.

[10]  Philippe Lagacherie,et al.  GlobalSoilMap: Toward a Fine-Resolution Global Grid of Soil Properties , 2014 .

[11]  Jerome H Friedman,et al.  Multiple additive regression trees with application in epidemiology , 2003, Statistics in medicine.

[12]  Elaine Cristina Cardoso Fidalgo,et al.  PEDOTRANSFER FUNCTIONS FOR ESTIMATING SOIL BULK DENSITY FROM EXISTING SOIL SURVEY REPORTS IN BRAZIL , 2007 .

[13]  B. Minasny,et al.  Using distance metrics to determine the appropriate domain of pedotransfer function predictions. , 2009 .

[14]  Modelling soil particle-size distribution in the region 'Centre' (France) , 2014 .

[15]  Dominique Arrouays,et al.  Optimizing pedotransfer functions for estimating soil bulk density using boosted regression trees. , 2009 .

[16]  S. Kaneko,et al.  A pedotransfer function for estimating bulk density of forest soil in Japan affected by volcanic ash , 2014 .

[17]  P. Filzmoser,et al.  Outlier Detection for Compositional Data Using Robust Methods , 2008 .

[18]  Cathy A. Seybold,et al.  Predicting soil bulk density for incomplete databases , 2014 .

[19]  Jan De Pue,et al.  Hierarchical Pedotransfer Functions to Predict Bulk Density of Highly Weathered Soils in Central Africa , 2015 .

[20]  Axel Don,et al.  Soil organic carbon stocks are systematically overestimated by misuse of the parameters bulk density and rock fragment content , 2016 .

[21]  Dominique Arrouays,et al.  Mapping soil Pb stocks and availability in mainland France combining regression trees with robust geostatistics , 2012 .

[22]  D. Timlin,et al.  Ensemble Approach to Provide Uncertainty Estimates of Soil Bulk Density , 2010 .

[23]  Inakwu O. A. Odeh,et al.  Enhancing pedotransfer functions with environmental data for estimating bulk density and effective cation exchange capacity in a data‐sparse situation , 2016 .

[24]  N. Batjes,et al.  Pedotransfer functions to estimate bulk density from soil properties and environmental covariates: Rio Doce basin , 2016 .

[25]  T. Keller,et al.  Estimation of reference bulk density from soil particle size distribution and soil organic matter content , 2010 .

[26]  Martial Bernoux,et al.  Bulk Densities of Brazilian Amazon Soils Related to Other Soil Properties , 1998 .

[27]  Javier Tomasella,et al.  Estimating soil water retention characteristics from limited data in Brazilian Amazonia , 1998 .

[28]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[29]  Budiman Minasny,et al.  A conditioned Latin hypercube method for sampling in the presence of ancillary information , 2006, Comput. Geosci..

[30]  Dominique Arrouays,et al.  Spatial distribution of soil organic carbon stocks in France , 2010 .

[31]  P. Rousseeuw,et al.  Unmasking Multivariate Outliers and Leverage Points , 1990 .

[32]  L. Montanarella,et al.  Estimating forest soil bulk density using boosted regression modelling , 2010 .

[33]  P. Jardine,et al.  Using soil physical and chemical properties to estimate bulk density , 2005 .

[34]  W. A. Adams THE EFFECT OF ORGANIC MATTER ON THE BULK AND TRUE DENSITIES OF SOME UNCULTIVATED PODZOLIC SOILS , 1973 .

[35]  Sanjeev Kumar,et al.  A pedo-transfer function (PTF) for estimating soil bulk density from basic soil data and its comparison with existing PTFs , 2002 .

[36]  Budiman Minasny,et al.  Comparison of different approaches to the development of pedotransfer functions for water-retention curves , 1999 .

[37]  L. Manrique,et al.  BULK DENSITY OF SOILS IN RELATION TO SOIL PHYSICAL AND CHEMICAL PROPERTIES , 1991 .

[38]  Dominique Arrouays,et al.  Generic Issues on Broad-Scale Soil Monitoring Schemes: A Review , 2012 .

[39]  E. Alexander Bulk densities of California soils in relation to other soil properties. , 1980 .