Predicting and mapping the soil available water capacity of Australian wheatbelt

Abstract Soil available water capacity (AWC) is the main source of water for vegetation and it is the potential amount of water available for atmospheric exchange. Studying its spatial distribution is crucial for agricultural planning and management and for use in biophysical modelling. The aim of this work is to obtain a continuous spatial prediction of AWC over Australia's wheatbelt (about 1.75 million km 2 ), using digital soil mapping techniques. We used a data set of 806 soil profiles which have field measurements of drainage upper limit (DUL) and crop lower limit (CLL). We mapped AWC at five depth intervals (0–5, 5–15, 15–30, 30–60, and 60–100 cm) with the help of different combinations of environmental information (topographic, climatic, soils, landsat imagery, gamma-ray spectrometry) as covariates. The modelling techniques used were symbolic regression (GP), Cubist, and support vector machines (SVM). We also tried two averaging methods to generate an ensemble model. We observed decreasing RMSE values with the addition of extra covariates and also an expected decreasing soil depth. In general, SVM produced the best accuracy. We were able to improve the predictions using one of the ensemble techniques, based on a weighted average of GP, Cubist and SVM model. The map generated with the optimal ensemble model was an unrealistic representation of AWC therefore we decided to present a sub-optimal model as the final map. We stress the need to not only focus on the numerical performance in order to obtain a flexible and stable model, but also a coherent visual representation without anomalies.

[1]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[2]  A. Johari,et al.  Prediction of Soil-Water Characteristic Curve Using Genetic Programming , 2006 .

[3]  Marti A. Hearst Trends & Controversies: Support Vector Machines , 1998, IEEE Intell. Syst..

[4]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[5]  J. A. Prescott,et al.  A CLIMATIC INDEX FOR THE LEACHING FACTOR IN SOIL FORMATION , 1950 .

[6]  Laura Poggio,et al.  Soil available water capacity interpolation and spatial uncertainty modelling at multiple geographical extents , 2010 .

[7]  Alex B. McBratney,et al.  Modelling soil attribute depth functions with equal-area quadratic smoothing splines , 1999 .

[8]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[9]  Budiman Minasny,et al.  Empirical estimates of uncertainty for mapping continuous depth functions of soil attributes , 2011 .

[10]  C. Granger,et al.  Improved methods of combining forecasts , 1984 .

[11]  Pierre Goovaerts,et al.  Geostatistical modelling of uncertainty in soil science , 2001 .

[12]  Jasper A. Vrugt,et al.  Comparison of point forecast accuracy of model averaging methods in hydrologic applications , 2010 .

[13]  Budiman Minasny,et al.  Prediction and digital mapping of soil carbon storage in the Lower Namoi Valley , 2006 .

[14]  Cort J. Willmott,et al.  GLOBAL DISTRIBUTION OF PLANT‐EXTRACTABLE WATER CAPACITY OF SOIL , 1996 .

[15]  David Haussler,et al.  Occam's Razor , 1987, Inf. Process. Lett..

[16]  David Draper,et al.  Assessment and Propagation of Model Uncertainty , 2011 .

[17]  Zvi Hochman,et al.  Contributions of soil and crop factors to plant available soil water capacity of annual crops on Black and Grey Vertosols , 2001 .

[18]  B. Minasny,et al.  Using soil knowledge for the evaluation of mid‐infrared diffuse reflectance spectroscopy for predicting soil physical and mechanical properties , 2008 .

[19]  J. Wilford A weathering intensity index for the Australian continent using airborne gamma-ray spectrometry and digital terrain analysis , 2012 .

[20]  R. B. Jackson,et al.  THE VERTICAL DISTRIBUTION OF SOIL ORGANIC CARBON AND ITS RELATION TO CLIMATE AND VEGETATION , 2000 .

[21]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[22]  Budiman Minasny,et al.  On digital soil mapping , 2003 .

[23]  Ni-Bin Chang,et al.  Soil moisture estimation in a semiarid watershed using RADARSAT‐1 satellite imagery and genetic programming , 2006 .

[24]  M. Carr Determining the Optimum Number of Predictors for a Linear Prediction Equation , 1988 .

[25]  Amin Elshorbagy,et al.  Modelling the dynamics of the evapotranspiration process using genetic programming , 2007 .

[26]  John R. Koza,et al.  Genetic Programming III: Darwinian Invention & Problem Solving , 1999 .

[27]  R. Clemen Combining forecasts: A review and annotated bibliography , 1989 .

[28]  H. Reuter,et al.  Functional Digital Soil Mapping for the Prediction of Available Water Capacity in Nigeria using Legacy Data , 2013 .

[29]  Budiman Minasny,et al.  Estimating Pedotransfer Function Prediction Limits Using Fuzzy k-Means with Extragrades , 2010 .

[30]  R. M. Lark,et al.  Using expert knowledge with control of false discovery rate to select regressors for prediction of soil properties , 2007 .

[31]  F. J. Veihmeyer,et al.  METHODS OF MEASURING FIELD CAPACITY AND PERMANENT WILTING PERCENTAGE OF SOILS , 1949 .

[32]  Alex B. McBratney,et al.  An overview of pedometric techniques for use in soil survey , 2000 .

[33]  M. A. Foale,et al.  Re-inventing model-based decision support with Australian dryland farmers. 2. Pragmatic provision of soil information for paddock-specific simulation and farmer decision making , 2009 .

[34]  D. K. Cassel,et al.  Field‐Measured Limits of Soil Water Availability as Related to Laboratory‐Measured Properties , 1983 .

[35]  Sherif Hashem,et al.  Optimal Linear Combinations of Neural Networks , 1997, Neural Networks.

[36]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[37]  B. Henderson,et al.  Australia-wide predictions of soil properties using decision trees , 2005 .

[38]  Budiman Minasny,et al.  Predicting and mapping soil available water capacity in Korea , 2013, PeerJ.