Optimal sample selection for measurement of soil organic carbon using on-line vis-NIR spectroscopy

Abstract The selection of samples for modelling of visible and near infrared (vis-NIR) spectra for prediction of soil organic carbon (OC) is a crucial step for improving model prediction performance. This paper aims at comparing three soil sample selection methods coupled with spiking technique for improving on-line prediction performance of OC. Sample selection methods included random selection (RS), Kennard-Stone (KS) algorithm and similarity analysis (SA). Soil vis-NIR spectra was measured with an on-line fibre-type vis-NIR spectrophotometer (tec5 Technology for Spectroscopy, Germany), with a spectral range of 305–2200 nm. A multiple field sample set (268 samples) was merged with samples (148 samples) collected from one target field, and the resulted sample set was subjected to the three sample selection methods. After dividing spectra into calibration and prediction sets, partial least squares regression (PLSR) was run on the calibration set to develop calibration models for OC, and resulted models were validated using samples of the prediction set. Results show that SA performed generally better than its competitors, especially when there were 58 spiked samples used in the calibration set (54% of total spiked samples of 106), with the best residual prediction deviation (RPD) and root mean squares error of prediction (RMSEP) of 2.14–2.54 and 0.16–0.15% for laboratory and on-line prediction. KS and RS performed similarly, but depending on the size of the calibration set, KS produced slightly better models. This indicates that the proposed SA coupled with spiking holds great potential in the optimization of a calibration set size and may serve as a novel and efficient tool for balancing the cost and quality of vis–NIR calibrations for estimating OC.

[1]  Jing Wang,et al.  Mapping soil organic matter based on land degradation spectral response units using Hyperion images , 2010, Int. J. Appl. Earth Obs. Geoinformation.

[2]  R. V. Rossel,et al.  Visible and near infrared spectroscopy in soil science , 2010 .

[3]  J. De Baerdemaeker,et al.  Potential of visible and near-infrared spectroscopy to derive colour groups utilising the Munsell soil colour charts , 2007 .

[4]  Peter Dietrich,et al.  A comparison of calibration sampling schemes at the field scale , 2014 .

[5]  Abdul Mounem Mouazen,et al.  Influence of the number of samples on prediction error of visible and near infrared spectroscopy of selected soil properties at the farm scale , 2011 .

[6]  J. De Baerdemaeker,et al.  On-the-go variable-rate phosphorus fertilisation based on a visible and near-infrared soil sensor , 2008 .

[7]  Dandan Wang,et al.  Synthesized use of VisNIR DRS and PXRF for soil characterization: Total carbon and total nitrogen☆ , 2015 .

[8]  R. V. Rossel,et al.  Visible, near infrared, mid infrared or combined diffuse reflectance spectroscopy for simultaneous assessment of various soil properties , 2006 .

[9]  Sabine Grunwald,et al.  Soil total carbon analysis in Hawaiian soils with visible, near-infrared and mid-infrared diffuse reflectance spectroscopy , 2012 .

[10]  Desire L. Massart,et al.  Representative subset selection , 2002 .

[11]  Neil McKenzie,et al.  Proximal Soil Sensing: An Effective Approach for Soil Measurements in Space and Time , 2011 .

[12]  Bram Govaerts,et al.  NIR Spectroscopy: An Alternative for Soil Analysis , 2012 .

[13]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[14]  Rebecca L. Whetton,et al.  Machine learning based prediction of soil total nitrogen, organic carbon and moisture content by using VIS-NIR spectroscopy , 2016 .

[15]  Charlie Chen,et al.  Digitally mapping the information content of visible–near infrared spectra of surficial Australian soils , 2011 .

[16]  G. McCarty,et al.  Mid-Infrared and Near-Infrared Diffuse Reflectance Spectroscopy for Soil Carbon Measurement , 2002 .

[17]  Ronald D. Snee,et al.  Validation of Regression Models: Methods and Examples , 1977 .

[18]  B. Stenberg,et al.  Near‐infrared spectroscopy for within‐field soil characterization: small local calibrations compared with national libraries spiked with local samples , 2010 .

[19]  Abdul Mounem Mouazen,et al.  Comparison between artificial neural network and partial least squares for on-line visible and near infrared spectroscopy measurement of soil organic carbon, pH and clay content , 2015 .

[20]  Thorsten Behrens,et al.  Sampling optimal calibration sets in soil infrared spectroscopy , 2014 .

[21]  Panos Panagos,et al.  Prediction of soil organic carbon content by diffuse reflectance spectroscopy using a local partial least square regression approach , 2014 .

[22]  Tereza Zádorová,et al.  Uncertainty propagation in VNIR reflectance spectroscopy soil organic carbon mapping , 2013 .

[23]  Claudy Jolivet,et al.  Optimization criteria in sample selection step of local regression for quantitative analysis of large soil NIRS database , 2012 .

[24]  Thorsten Behrens,et al.  Distance and similarity-search metrics for use with soil vis-NIR spectra , 2013 .

[25]  H. S. Mahmood,et al.  Sensing soil properties in the laboratory, in situ, and on-Line: A review , 2012 .

[26]  A. Mouazen,et al.  Effect of moisture content on prediction of organic carbon and pH using visible and near infrared spectroscopy. , 2010 .

[27]  Viacheslav I. Adamchuk,et al.  A global spectral library to characterize the world’s soil , 2016 .

[28]  Claudy Jolivet,et al.  Which strategy is best to predict soil properties of a local site from a national Vis–NIR database? , 2014 .

[29]  Abdul Mounem Mouazen,et al.  Predictive performance of mobile vis-near infrared spectroscopy for key soil properties at different geographical scales by using spiking and data mining techniques , 2017 .

[30]  Abdul Mounem Mouazen,et al.  Assessment of soil organic carbon at local scale with spiked NIR calibrations: effects of selection and extra-weighting on the spiking subset , 2013 .

[31]  M. Schaepman,et al.  Spectral reflectance based indices for soil organic carbon quantification , 2008 .

[32]  Rick L. Lawrence,et al.  Comparing local vs. global visible and near-infrared (VisNIR) diffuse reflectance spectroscopy (DRS) calibrations for the prediction of soil clay, organic C and inorganic C , 2008 .

[33]  Abdul Mounem Mouazen,et al.  Do we really need large spectral libraries for local scale SOC assessment with NIR spectroscopy , 2016 .

[34]  Robert D. Clark,et al.  OptiSim: An Extended Dissimilarity Selection Method for Finding Diverse Representative Subsets , 1997, J. Chem. Inf. Comput. Sci..

[35]  Keith D. Shepherd,et al.  Soil Spectroscopy: An Alternative to Wet Chemistry for Soil Monitoring , 2015 .

[36]  Giorgio Matteucci,et al.  Effect of calibration set size on prediction at local scale of soil carbon by Vis-NIR spectroscopy , 2017 .

[37]  César Guerrero,et al.  Spiking of NIR regional models using samples from target sites: effect of model size on prediction accuracy. , 2010 .

[38]  L. A. Stone,et al.  Computer Aided Design of Experiments , 1969 .

[39]  R. V. Rossel,et al.  Using data mining to model and interpret soil diffuse reflectance spectra. , 2010 .

[40]  Yongjie Li,et al.  Prediction of soil organic matter content in a litchi orchard of South China using spectral indices , 2012 .

[41]  H. Ramon,et al.  Towards development of on-line soil moisture content sensor using a fibre-type NIR spectrophotometer , 2005 .

[42]  Abdul Mounem Mouazen,et al.  Effect of spiking strategy and ratio on calibration of on-line visible and near infrared soil sensor for measurement in European farms , 2013 .

[43]  Karl H. Norris,et al.  Understanding and Correcting the Factors Which Affect Diffuse Transmittance Spectra , 2001 .

[44]  Christian Walter,et al.  Regional predictions of soil organic carbon content from spectral reflectance measurements , 2009 .

[45]  Henning Buddenbaum,et al.  Estimating the soil clay content and organic matter by means of different calibration methods of vis-NIR diffuse reflectance spectroscopy , 2016 .