Use of the Nonparametric Nearest Neighbor Approach to Estimate Soil Hydraulic Properties

Nonparametric approaches are being used in various fields to address classification type problems, as well as to estimate continuous variables. One type of the nonparametric lazy learning algorithms, a k-nearest neighbor (k-NN) algorithm has been applied to estimate water retention at 233- and 21500-kPa matric potentials. Performance of the algorithm has subsequently been tested against estimations made by a neural network (NNet) model, developed using the same data and input soil attributes. We used a hierarchical set of inputs using soil texture, bulk density (Db), and organic matter (OM) content to avoid possible bias toward one set of inputs, and varied the size of the data set used to develop the NNet models and to run the k-NN estimation algorithms. Different ‘design-parameter’ settings, analogous to model parameters have been optimized. The kNN technique showed little sensitivity to potential suboptimal settings in terms of how many nearest soils were selected and how those were weighed while formulating the output of the algorithm, as long as extremes were avoided. The optimal settings were, however, dependent on the size of the development/reference data set. The nonparametric k-NN technique performed mostly equally well with the NNet models, in terms of root-mean-squared residuals (RMSRs) and mean residuals (MRs). Gradual reduction of the data set size from 1600 to 100 resulted in only a slight loss of accuracy for both the k-NN and NNet approaches. The k-NN technique is a competitive alternative to other techniques to develop pedotransfer functions (PTFs), especially since redevelopment of PTFs is not necessarily needed as new data become available.

[1]  Roger W. Johnson,et al.  An Introduction to the Bootstrap , 2001 .

[2]  James W. Jones,et al.  DYNAMIC NEAREST-NEIGHBOR METHOD FOR ESTIMATING SOIL WATER PARAMETERS , 2004 .

[3]  Upmanu Lall,et al.  A Nearest Neighbor Bootstrap For Resampling Hydrologic Time Series , 1996 .

[4]  David G. Tarboton,et al.  A Nonparametric Wet/Dry Spell Model for Resampling Daily Precipitation , 1996 .

[5]  Walter J. Rawls,et al.  Estimating Soil Water Retention from Soil Physical Properties and Characteristics , 1991 .

[6]  Ashish Sharma,et al.  A comparative study of Markov chain Monte Carlo methods for conceptual rainfall‐runoff modeling , 2004 .

[7]  M. Tapkenhinrichs,et al.  Evaluation of Pedo-Transfer Functions , 1993 .

[8]  Belur V. Dasarathy,et al.  Nearest neighbor (NN) norms: NN pattern classification techniques , 1991 .

[9]  Budiman Minasny,et al.  Comparison of different approaches to the development of pedotransfer functions for water-retention curves , 1999 .

[10]  Upmanu Lall,et al.  Streamflow simulation: A nonparametric approach , 1997 .

[11]  J.H.M. Wösten,et al.  Testing an Artificial Neural Network for Predicting Soil Hydraulic Conductivity , 1996 .

[12]  Budiman Minasny,et al.  The neuro-m method for fitting neural network parametric pedotransfer functions , 2002 .

[13]  Marc Van Meirvenne,et al.  Evaluation of Pedotransfer Functions for Predicting the Soil Moisture Retention Curve , 2001 .

[14]  Upmanu Lall,et al.  Seasonal to interannual ensemble streamflow forecasts for Ceara, Brazil: Applications of a multivariate, semiparametric algorithm , 2003 .

[15]  S. Yakowitz,et al.  Nearest Neighbor Methods for Time Series, with Application to Rainfall/Runoff Prediction , 1987 .

[16]  Marcel G. Schaap,et al.  Functional evaluation of pedotransfer functions derived from different scales of data collection , 2003 .

[17]  Van Genuchten,et al.  A closed-form equation for predicting the hydraulic conductivity of unsaturated soils , 1980 .

[18]  S. Yakowitz,et al.  Nearest‐neighbor methods for nonparametric rainfall‐runoff forecasting , 1987 .

[19]  Upmanu Lall,et al.  Multisite disaggregation of monthly to daily streamflow , 2000 .

[20]  Andrew W. Moore,et al.  Locally Weighted Learning , 1997, Artificial Intelligence Review.

[21]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[22]  H.W.G. Booltink,et al.  Neural network models to predict soil water retention , 1999 .

[23]  M. Schaap,et al.  ROSETTA: a computer program for estimating soil hydraulic parameters with hierarchical pedotransfer functions , 2001 .

[24]  S. Sorooshian,et al.  Comparison of pedotransfer functions to compute water holding capacity using the Van Genuchten model in inorganic soils , 1999 .

[25]  Marcel G. Schaap,et al.  Improved Prediction of Unsaturated Hydraulic Conductivity with the Mualem‐van Genuchten Model , 2000 .

[26]  David G. Tarboton,et al.  Nonhomogeneous Markov Model for Daily Precipitation , 1996 .

[27]  Susan A. Murphy,et al.  Monographs on statistics and applied probability , 1990 .

[28]  M. Schaap,et al.  Neural network analysis for hierarchical prediction of soil hydraulic properties , 1998 .

[29]  David G. Tarboton,et al.  Disaggregation procedures for stochastic hydrology based on nonparametric density estimation , 1998 .

[30]  Upmanu Lall,et al.  A nonparametric approach for daily rainfall simulation , 1999 .

[31]  Y. Pachepsky,et al.  Artificial Neural Networks to Estimate Soil Water Retention from Easily Measurable Data , 1996 .

[32]  Jeffrey S. Kern,et al.  Evaluation of Soil Water Retention Models Based on Basic Soil Physical Properties , 1995 .

[33]  Attila Nemes,et al.  Evaluation of different procedures to interpolate particle-size distributions to achieve compatibility within soil databases , 1999 .

[34]  S. Yakowitz Nearest neighbor regression estimation for null-recurrent Markov time series , 1993 .

[35]  Ashish Sharma,et al.  A nonparametric approach for representing interannual dependence in monthly streamflow sequences , 2002 .

[36]  V. Snyder Statistical Hydraulic Conductivity Models and Scaling of Capillary Phenomena in Porous Media , 1996 .

[37]  Ashish Sharma,et al.  A nonparametric model for stochastic generation of daily rainfall occurrence , 2003 .

[38]  Claire Cardie,et al.  Examining Locally Varying Weights for Nearest Neighbor Algorithms , 1997, ICCBR.

[39]  David W. Aha,et al.  A Review and Empirical Evaluation of Feature Weighting Methods for a Class of Lazy Learning Algorithms , 1997, Artificial Intelligence Review.

[40]  Kenneth Strzepek,et al.  A technique for generating regional climate scenarios using a nearest‐neighbor algorithm , 2003 .

[41]  Ashish Sharma,et al.  A nonparametric model for stochastic generation of daily rainfall amounts , 2003 .

[42]  Marcel G. Schaap,et al.  Database-related accuracy and uncertainty of pedotransfer functions , 1998 .

[43]  A. Sankarasubramanian,et al.  Flood quantiles in a changing climate: Seasonal forecasts and causal relations , 2003 .

[44]  Walter J. Rawls,et al.  Pedotransfer functions: bridging the gap between available basic soil data and missing soil hydraulic characteristics , 2001 .

[45]  James W. Jones,et al.  Wading through a swamp of complete confusion: how to choose a method for estimating soil water retention parameters for crop models , 2002 .

[46]  Balaji Rajagopalan,et al.  A resampling procedure for generating conditioned daily weather sequences , 2004 .

[47]  David G. Tarboton,et al.  Multivariate nonparametric resampling scheme for generation of daily weather variables , 1997 .

[48]  J.H.M. Wösten,et al.  Evaluation of pedotransfer functions , 2004 .