Optimizing pedotransfer functions for estimating soil bulk density using boosted regression trees.

Pedotransfer functions (PTFs) are used to estimate certain soil properties that are difficult and costly to measure from others more easily available. Bulk density is one important soil property. Although not requiring complex analysis, its measurement remains time consuming and is lacking in many soil surveys. For several decades, PTFs have been developed for predicting soil bulk density. Most of these PTFs are suited only for specific agro-pedo-climatic conditions, however, and can be applied only within a limited geographic area. In this study, we derived and experimented with two new PTFs based on a multiple additive regression trees (MART) method, and assessed their performance compared with existing PTFs when applied to a country-level soil database, the Reseau de Mesures de la Qualite des Sols (RMQS) survey network. This database was designed to include the major soil types and land uses in France. The fi rst proposed PTF (Model m) involves only three predictors typically found in the existing PTFs for bulk density (C content and texture) and the second one (Model M) includes eight easily accessible quantitative and qualitative predictors (e.g., soil taxon). Both models signifi cantly outperformed existing PTFs. Without arbitrarily partitioning the data set before fi tting the model, the m and M MART models yielded R2 values of 0.83 and 0.94, respectively. The predictive quality on independent data, assessed using cross-validation, was also improved compared with published PTFs, with R2 reaching 0.62 and 0.66 and root mean square prediction errors of 0.123 and 0.117 Mg m-3 for the two MART models. (Resume d'auteur)

[1]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[2]  Walter J. Rawls,et al.  Probabilistic Approach to the Identification of Input Variables to Estimate Hydraulic Conductivity , 2008 .

[3]  Budiman Minasny,et al.  Building and testing conceptual and empirical models for predicting soil bulk density , 2007 .

[4]  Elaine Cristina Cardoso Fidalgo,et al.  PEDOTRANSFER FUNCTIONS FOR ESTIMATING SOIL BULK DENSITY FROM EXISTING SOIL SURVEY REPORTS IN BRAZIL , 2007 .

[5]  K. M. Nair,et al.  Organic carbon stock map for soils of southern India : a multifactorial approach , 2007 .

[6]  T. Hastie,et al.  Variation in demersal fish species richness in the oceans surrounding New Zealand: an analysis using boosted regression trees , 2006 .

[7]  D. Arrouays,et al.  Geostatistical assessment of Pb in soil around Paris, France. , 2006, The Science of the total environment.

[8]  T. Hancock,et al.  A performance comparison of modern statistical techniques for molecular descriptor selection and retention prediction in chromatographic QSRR studies , 2005 .

[9]  M. Meirvenne,et al.  Predictive Quality of Pedotransfer Functions for Estimating Bulk Density of Forest Soils , 2005 .

[10]  P. Jardine,et al.  Using soil physical and chemical properties to estimate bulk density , 2005 .

[11]  Éric Lepage,et al.  Measuring performance in health care: case-mix adjustment by boosted decision trees , 2004, Artif. Intell. Medicine.

[12]  Emmanuel Ledoux,et al.  The STICS model to predict nitrate leaching following agricultural practices , 2004 .

[13]  Rick L. Lawrence,et al.  Classification of remotely sensed imagery using stochastic gradient boosting as a refinement of classification tree analysis , 2004 .

[14]  J. Liebens,et al.  Influence of estimation procedure on soil organic carbon stock assessment in Flanders, Belgium , 2003 .

[15]  Y. Pachepsky,et al.  Effect of soil organic carbon on soil water retention , 2003 .

[16]  O. Duval,et al.  Use of class pedotransfer functions based on texture and bulk density of clods to generate water retention curves , 2003 .

[17]  Nidal Abu-Hamdeh,et al.  Compaction and subsoiling effects on corn growth and soil bulk density , 2003 .

[18]  Jerome H Friedman,et al.  Multiple additive regression trees with application in epidemiology , 2003, Statistics in medicine.

[19]  Budiman Minasny,et al.  From pedotransfer functions to soil inference systems , 2002 .

[20]  J. Friedman Stochastic gradient boosting , 2002 .

[21]  Walter J. Rawls,et al.  Pedotransfer functions: bridging the gap between available basic soil data and missing soil hydraulic characteristics , 2001 .

[22]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[23]  J Carpenter,et al.  Bootstrap confidence intervals: when, which, what? A practical guide for medical statisticians. , 2000, Statistics in medicine.

[24]  Martial Bernoux,et al.  Bulk Densities of Brazilian Amazon Soils Related to Other Soil Properties , 1998 .

[25]  P. Loveland,et al.  The carbon content of soil and its geographical distribution in Great Britain , 1995 .

[26]  Dominique Arrouays,et al.  MODELING CARBON STORAGE PROFILES IN TEMPERATE FOREST HUMIC LOAMY SOILS OF FRANCE , 1994 .

[27]  Kirk R. Vincent,et al.  Synthesizing bulk density for soils with abundant rock fragments Soil Science Society of America Journal, 58(2), 1994, pp 455–464 , 1994 .

[28]  L. Manrique,et al.  BULK DENSITY OF SOILS IN RELATION TO SOIL PHYSICAL AND CHEMICAL PROPERTIES , 1991 .

[29]  D. F. Grigal,et al.  BULK DENSITY OF SURFACE SOILS AND PEAT IN THE NORTH CENTRAL UNITED STATES , 1989 .

[30]  Arthur H. Johnson,et al.  CARBON, ORGANIC MATTER, AND BULK DENSITY RELATIONSHIPS IN A FORESTED SPODOSOL , 1989 .

[31]  D. Ratkowsky,et al.  The use of ignition loss to estimate bulk density of forest soils , 1989 .

[32]  W. Rawls Estimating soil bulk density from particle size analysis and organic matter content. , 1983 .

[33]  E. Alexander Bulk densities of California soils in relation to other soil properties. , 1980 .

[34]  W. A. Adams THE EFFECT OF ORGANIC MATTER ON THE BULK AND TRUE DENSITIES OF SOME UNCULTIVATED PODZOLIC SOILS , 1973 .

[35]  V. I. Stewart,et al.  QUANTITATIVE PEDOLOGICAL STUDIES ON SOILS DERIVED FROM SILURIAN MUDSTONES: II. THE RELATIONSHIP BETWEEN STONE CONTENT AND THE APPARENT DENSITY OF THE FINE EARTH , 1970 .