A remote sensing adapted approach for soil organic carbon prediction based on the spectrally clustered LUCAS soil database

Abstract The estimation of the soil organic carbon (SOC) content plays an important role for carbon sequestration in the context of climate change, food security and soil degradation. Reflectance spectroscopy has proven to be a promising technique for SOC quantification in the laboratory and increasingly from air- and spaceborne platforms, where hyperspectral imagery provides great potential for mapping SOC on larger scales with regular updates. When applied on larger scales, soil prediction accuracy decreases due to the inhomogeneity of samples. In this paper, we examined if spectral clustering of the LUCAS EU-wide topsoil database is successful without using other covariates than the spectral database and can improve SOC model performance compared to a reference model that was calibrated on the whole database without clustering. Different clustering methodologies were tested, including a k-means clustering based on principal component analysis or based on spectral feature variables, combined with partial least squares regression (PLSR) models, and a clustering based on a local PLSR approach which builds a different multivariate model for each sample to be predicted. Furthermore, in order to allow for subsequent application to hyperspectral remote sensing data, atmospheric water wavelengths were removed from the analyses. The local PLSR approach achieved best results and was additionally applied to LUCAS spectra resampled to the upcoming hyperspectral EnMAP sensor which led to good results: R 2  = 0.66, RMSEP = 5.78 g kg − 1 and RPIQ = 1.93. The k-means clustering approach showed slightly better results than the reference model. Overall, our results showed similar performances for SOC prediction models compared to other approaches using PLSR with a larger spectral range and other soil parameters as covariates. This study shows that (i) it is possible to transfer the local PLSR approach onto a wavelengths reduced spectral library and to predict estimations of SOC at low-cost with reasonable accuracy based on large scale soil databases; and (ii) that the local regression approach is a valuable tool for SOC prediction models based solely on spectral data without the use of other soil covariates.

[1]  J. Hill,et al.  Using Imaging Spectroscopy to study soil properties , 2009 .

[2]  Luis Guanter,et al.  Assessment of the 1.75 μm absorption feature for gypsum estimation using laboratory, air- and spaceborne hyperspectral sensors , 2019, Int. J. Appl. Earth Obs. Geoinformation.

[3]  Roger,et al.  Spectroscopy of Rocks and Minerals , and Principles of Spectroscopy , 2002 .

[4]  R. V. Rossel,et al.  Visible and near infrared spectroscopy in soil science , 2010 .

[5]  Panos Panagos,et al.  Prediction of soil organic carbon content by diffuse reflectance spectroscopy using a local partial least square regression approach , 2014 .

[6]  B. Minasny,et al.  Digital Soil Map of the World , 2009, Science.

[7]  K. Shepherd,et al.  Global soil characterization with VNIR diffuse reflectance spectroscopy , 2006 .

[8]  Rocchina Guarini,et al.  Prisma: The Italian Hyperspectral Mission , 2018, IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium.

[9]  A. Davies An Introduction to near Infrared Spectroscopy , 2005 .

[10]  Bo Stenberg,et al.  Improving the prediction performance of a large tropical vis‐NIR spectroscopic soil library from Brazil by clustering into smaller subsets or use of data mining calibration techniques , 2014 .

[11]  Eyal Ben Dor,et al.  SHALOM – A Commercial Hyperspectral Space Mission , 2015 .

[12]  E. R. Stoner,et al.  REFLECTANCE PROPERTIES OF SOILS , 1986 .

[13]  N. Holden,et al.  Optical sensing and chemometric analysis of soil organic carbon – a cost effective alternative to conventional laboratory methods? , 2011 .

[14]  Alex B. McBratney,et al.  Simultaneous estimation of several soil properties by ultra-violet, visible, and near-infrared reflectance spectroscopy , 2003 .

[15]  L. A. Stone,et al.  Computer Aided Design of Experiments , 1969 .

[16]  Stefan Kaiser,et al.  Simulation of Spatial Sensor Characteristics in the Context of the EnMAP Hyperspectral Mission , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[17]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[18]  S. Hook,et al.  Laboratory Reflectance Spectra of 160 Minerals, 0.4 to 2.5 Micrometers , 1992 .

[19]  M. Schaepman,et al.  Spectral reflectance based indices for soil organic carbon quantification , 2008 .

[20]  Sabine Chabrillat,et al.  Use of hyperspectral images in the identification and mapping of expansive clay soils and the role of spatial resolution , 2002 .

[21]  Keith Paustian,et al.  Measuring and monitoring soil organic carbon stocks in agricultural lands for climate mitigation , 2011 .

[22]  J. Hill Mapping Complex Patterns of Erosion and Stability in Dry Mediterranean Ecosystems , 2000 .

[23]  Luca Montanarella,et al.  Prediction of Soil Organic Carbon at the European Scale by Visible and Near InfraRed Reflectance Spectroscopy , 2013, PloS one.

[24]  G. Hunt Visible and near-infrared spectra of minerals and rocks : I silicate minerals , 1970 .

[25]  Sabine Chabrillat,et al.  Prediction of Common Surface Soil Properties Based on Vis-NIR Airborne and Simulated EnMAP Imaging Spectroscopy Data: Prediction Accuracy and Influence of Spatial Resolution , 2016, Remote. Sens..

[26]  E. Ben-Dor,et al.  Visible and near-infrared (0.4–1.1 μm) analysis of arid and semiarid soils , 1994 .

[27]  Viacheslav I. Adamchuk,et al.  A global spectral library to characterize the world’s soil , 2016 .

[28]  Elaine B. Martin,et al.  Model selection for partial least squares regression , 2002 .

[29]  R. V. Rossel,et al.  Using data mining to model and interpret soil diffuse reflectance spectra. , 2010 .

[30]  Patrick Hostert,et al.  The EnMAP Spaceborne Imaging Spectroscopy Mission for Earth Observation , 2015, Remote. Sens..

[31]  A. McBratney,et al.  Near-infrared (NIR) and mid-infrared (MIR) spectroscopic techniques for assessing the amount of carbon stock in soils – Critical review and research perspectives , 2011 .

[32]  R. Lal,et al.  Soil Carbon Sequestration Impacts on Global Climate Change and Food Security , 2004, Science.

[33]  E. Ben-Dor The reflectance spectra of organic matter in the visible near-infrared and short wave infrared region (400-2500 nm) during a controlled decomposition process , 1997 .

[34]  Thomas Scholten,et al.  The spectrum-based learner: A new local approach for modeling soil vis–NIR spectra of complex datasets , 2013 .

[35]  S. Gaffey,et al.  Spectral reflectance of carbonate minerals in the visible and near infrared (0.35–2.55 um): Anhydrous carbonate minerals , 1987 .

[36]  C. Ballabio,et al.  LUCAS Soil, the largest expandable soil dataset for Europe: a review , 2018 .

[37]  P. Miller,et al.  Validation requirements for diffuse reflectance soil characterization models with a case study of VNIR soil C prediction in Montana , 2005 .

[38]  Hermann Kaufmann,et al.  A Comparison of Feature-Based MLR and PLS Regression Techniques for the Prediction of Three Soil Constituents in a Degraded South African Ecosystem , 2012 .

[39]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[40]  L. Montanarella,et al.  Legal frameworks for soil protection: current development and technical information requirements , 2012 .

[41]  S. Wold,et al.  PLS-regression: a basic tool of chemometrics , 2001 .