An heuristic uncertainty directed field sampling design for digital soil mapping

Abstract Legacy samples are a valuable data source for digital soil mapping. However, these sample sets are often small in size and ad hoc in spatial distribution. Constrained by the limited representativeness of such a sample set, the obtained soil maps are often incomplete in spatial coverage with “gaps” at the locations which cannot be well represented by these samples. The maps may also contain areas of high prediction uncertainty. In order to extend the predicted area and reduce prediction uncertainty, additional samples are needed. This paper presents a sampling design based on prediction uncertainty to select samples which will effectively complement the sparse and ad hoc samples, and maximize the spatial coverage of prediction and minimize prediction uncertainty. A case study in China shows that this sampling scheme was effective in achieving these goals. Compared with stratified random sampling scheme, when the number of additional samples is the same, the produced map using uncertainty directed samples has larger predicted area, and the accuracy of the produced map is higher than that of the maps using stratified random samples. The finding of this study suggests that prediction uncertainty is a useful indicator to aid field sample selection and to complement the legacy data. Furthermore, the mapping accuracy produced using this method can be quantitatively related to the number of additional samples needed which opens a new horizon for digital soil mapping.

[1]  Mykola Pechenizkiy,et al.  Feature Extraction for Classification in Knowledge Discovery Systems , 2003, KES.

[2]  Chenghu Zhou,et al.  Purposive Sampling for Digital Soil Mapping for Areas with Limited Data , 2008 .

[3]  P. Goovaerts Geostatistics in soil science: state-of-the-art and perspectives , 1999 .

[4]  J. Liu,et al.  Predictive soil mapping with limited sample data , 2015 .

[5]  J. W. van Groenigen,et al.  Chapter 14 Designing Spatial Coverage Samples Using the k-means Clustering Algorithm , 2006 .

[6]  Michael Edward Hohn,et al.  An Introduction to Applied Geostatistics: by Edward H. Isaaks and R. Mohan Srivastava, 1989, Oxford University Press, New York, 561 p., ISBN 0-19-505012-6, ISBN 0-19-505013-4 (paperback), $55.00 cloth, $35.00 paper (US) , 1991 .

[7]  Budiman Minasny,et al.  Digital mapping of soil salinity in Ardakan region, central Iran , 2014 .

[8]  H. Jenny Factors of Soil Formation: A System of Quantitative Pedology , 2011 .

[9]  R. Bilonick An Introduction to Applied Geostatistics , 1989 .

[10]  B. Hudson,et al.  The Soil Survey as Paradigm-based Science , 1992 .

[11]  J. W. Groenigen,et al.  Constrained optimisation of soil sampling for minimisation of the kriging variance , 1999 .

[12]  Chenghu Zhou,et al.  Differentiation of soil conditions over low relief areas using feedback dynamic patterns. , 2010 .

[13]  M. C. Jones,et al.  A reliable data-based bandwidth selection method for kernel density estimation , 1991 .

[14]  Tao Pei,et al.  Mapping soil organic matter in small low-relief catchments using fuzzy slope position information , 2012 .

[15]  A. Zhu,et al.  Updating Conventional Soil Maps through Digital Soil Mapping , 2011 .

[16]  A-Xing Zhu,et al.  Multi-scale digital terrain analysis and feature selection for digital soil mapping , 2010 .

[17]  Gong Zi-tong Soil Information System of China (SISChina) and Its Application , 2007 .

[18]  A-Xing Zhu,et al.  Fuzzy soil mapping based on prototype category theory , 2006 .

[19]  Chenghu Zhou,et al.  An adaptive approach to selecting a flow‐partition exponent for a multiple‐flow‐direction algorithm , 2007, Int. J. Geogr. Inf. Sci..

[20]  bak gwansu,et al.  An Adaptive Approach to , 2006 .

[21]  Dongsheng Yu,et al.  Regional patterns of soil organic carbon stocks in China. , 2007, Journal of environmental management.

[22]  Philip K. Hopke,et al.  Variable selection in classification of environmental soil samples for partial least square and neural network models , 2001 .

[23]  André Beaudoin,et al.  Digital mapping of soil properties in Canadian managed forests at 250m of resolution using the k-nearest neighbor method , 2014 .

[24]  Budiman Minasny,et al.  Operational sampling challenges to digital soil mapping in Tasmania, Australia , 2015 .

[25]  A-Xing Zhu,et al.  Prediction of soil properties using fuzzy membership values , 2010 .

[26]  Daniel Markewitz,et al.  A Comparison of Three Field Sampling Methods to Estimate Soil Carbon Content , 2012 .

[27]  Gerard B. M. Heuvelink,et al.  Do more detailed environmental covariates deliver more accurate soil maps , 2015 .

[28]  J. J. de Gruijter,et al.  An R package for spatial coverage sampling and random sampling from compact geographical strata by k-means , 2010, Comput. Geosci..

[29]  W. Ju,et al.  Net primary productivity of China's terrestrial ecosystems from a process model driven by remote sensing. , 2007, Journal of environmental management.

[30]  A-Xing Zhu,et al.  Soil texture mapping over low relief areas using land surface feedback dynamic patterns extracted from MODIS , 2012 .

[31]  A-Xing Zhu,et al.  Measuring Uncertainty in Class Assignment for Natural Resource Maps under Fuzzy Logic , 1997 .

[32]  A-Xing Zhu,et al.  Regional Soil Mapping Using Multi-Grade Representative Sampling and a Fuzzy Membership-Based Mapping Approach , 2017 .

[33]  R. D. Ramsey,et al.  Digitally Mapping Gypsic and Natric Soil Areas Using Landsat ETM Data , 2007 .

[34]  StinsonG.,et al.  Mapping attributes of Canada’s forests at moderate resolution through kNN and MODIS imagery , 2014 .

[35]  Lin Yang,et al.  An integrative hierarchical stepwise sampling strategy for spatial sampling and its application in digital soil mapping , 2011, Int. J. Geogr. Inf. Sci..

[36]  Patrick Bogaert,et al.  Updating soil survey maps using random forest and conditioned Latin hypercube sampling in the loess derived soils of northern Iran , 2014 .

[37]  J. W. van Groenigen,et al.  The influence of variogram parameters on optimal sampling schemes for mapping by kriging , 2000 .

[38]  G. Heuvelink,et al.  Optimization of sample patterns for universal kriging of environmental variables , 2007 .

[39]  Budiman Minasny,et al.  A conditioned Latin hypercube method for sampling in the presence of ancillary information , 2006, Comput. Geosci..

[40]  A-Xing Zhu,et al.  Automated soil inference under fuzzy logic , 1996 .

[41]  A-Xing Zhu,et al.  Fuzzy Representation of Special Terrain Features Using a Similarity-based Approach , 2005 .

[42]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[43]  D. J. Brus,et al.  Random sampling or geostatistical modelling? Choosing between design-based and model-based sampling strategies for soil (with discussion) , 1997 .

[44]  Budiman Minasny,et al.  On digital soil mapping , 2003 .

[45]  A-Xing Zhu,et al.  A Knowledge-Based Approach to Data Integration for Soil Mapping , 1994 .