Convolutional neural network for simultaneous prediction of several soil properties using visible/near-infrared, mid-infrared, and their combined spectra

Abstract No single instrument can characterize all soil properties because soil is a complex material. With the advancement of technology, laboratories have become equipped with various spectrometers. By fusing output from different spectrometers, better prediction outcomes are expected than using any single spectrometer alone. In this study, model performance from a single spectrometer (visible-near-infrared spectroscopy, vis-NIR or mid-infrared spectroscopy, MIR) was compared to the combined spectrometers (vis-NIR and MIR). We selected a total of 14,594 samples from the Kellogg Soil Survey Laboratory (KSSL) database that had both vis-NIR and MIR spectra along with measurements of sand, clay, total C (TC) content, organic C (OC) content, cation exchange capacity (CEC), and pH. The dataset was randomly split into 75% training (n = 10,946) and the remaining (n = 3,648) as a test set. Prediction models were constructed with partial least squares regression (PLSR) and Cubist tree model. Additionally, we explored the use of a deep learning model, the convolutional neural network (CNN). We investigated various ways to feed spectral data to the CNN, either as one-dimensional (1D) data (as a spectrum) or as two-dimensional (2D) data (as a spectrogram). Compared to the PLSR model, we found that the CNN model provides an average improvement prediction of 33–42% using vis-NIR and 30–43% using MIR spectral data input. The relative accuracy improvement of CNN, when compared to the Cubist regression tree model, ranged between 22 and 36% with vis-NIR and 16–27% with MIR spectral data input. Various methods to fuse the vis-NIR and MIR spectral data were explored. We compared the performance of spectral concatenation (for PLSR and Cubist model), two-channel input method, and outer product analysis (OPA) method (for CNN model). We found that the performance of two-channel 1D CNN model was the best (R2 = 0.95–0.98) followed closely by the OPA with CNN (R2 = 0.93–0.98), Cubist model with spectral concatenation (R2= 0.91–0.97), two-channel 2D CNN model (R2 = 0.90–0.95) and PLSR with spectral concatenation (R2 = 0.87–0.95). Chemometric analysis of spectroscopy data relied on spectral pre-processing methods: such as spectral trimming, baseline correction, smoothing, and normalization before being fed into the model. CNN achieved higher performance than the PLSR and Cubist model without utilizing the pre-processed spectral data. We also found that the predictions using the CNN model retained similar correlations to the actual values in comparison to other models. By doing sensitivity analysis, we identified the important spectral wavelengths variables used by the CNN model to predict various soil properties. CNN is an effective model for modelling soil properties from a large spectral library.

[1]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[2]  L. Janik,et al.  Can mid infrared diffuse reflectance analysis replace soil extractions , 1998 .

[3]  Zou Xiaobo,et al.  Variables selection methods in near-infrared spectroscopy. , 2010, Analytica chimica acta.

[4]  K. Shepherd,et al.  Evaluating the utility of mid-infrared spectral subspaces for predicting soil properties , 2016, Chemometrics and intelligent laboratory systems : an international journal sponsored by the Chemometrics Society.

[5]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[6]  António S. Barros,et al.  Infrared spectroscopy and outer product analysis for quantification of fat, nitrogen, and moisture of cocoa powder. , 2007, Analytica chimica acta.

[7]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[8]  Jae Lim,et al.  Signal estimation from modified short-time Fourier transform , 1984 .

[9]  Sebastian Ruder,et al.  An Overview of Multi-Task Learning in Deep Neural Networks , 2017, ArXiv.

[10]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[11]  C. Hurburgh,et al.  Near-Infrared Reflectance Spectroscopy–Principal Components Regression Analyses of Soil Properties , 2001 .

[12]  Frans van den Berg,et al.  Review of the most common pre-processing techniques for near-infrared spectra , 2009 .

[13]  Dandan Wang,et al.  Synthesized use of VisNIR DRS and PXRF for soil characterization: Total carbon and total nitrogen☆ , 2015 .

[14]  A. McBratney,et al.  Critical review of chemometric indicators commonly used for assessing the quality of the prediction of soil attributes by NIR spectroscopy , 2010 .

[15]  Keith D. Shepherd,et al.  Prediction of Soil Fertility Properties from a Globally Distributed Soil Mid-Infrared Spectral Library , 2010 .

[16]  J. M. Soriano-Disla,et al.  The Performance of Visible, Near-, and Mid-Infrared Reflectance Spectroscopy for Prediction of Soil Physical, Chemical, and Biological Properties , 2014 .

[17]  R. V. Rossel,et al.  Visible and near infrared spectroscopy in soil science , 2010 .

[18]  Vijay S. Pande,et al.  Massively Multitask Networks for Drug Discovery , 2015, ArXiv.

[19]  Gustavo Carneiro,et al.  Multi-channel Convolutional Neural Network Ensemble for Pedestrian Detection , 2017, IbPRIA.

[20]  K. Shepherd,et al.  Development of Reflectance Spectral Libraries for Characterization of Soil Properties , 2002 .

[21]  Viacheslav I. Adamchuk,et al.  A global spectral library to characterize the world’s soil , 2016 .

[22]  Masakazu Matsugu,et al.  Subject independent facial expression recognition with robust face detection using a convolutional neural network , 2003, Neural Networks.

[23]  Raphael A. Viscarra Rossel,et al.  Spectral libraries for quantitative analyses of tropical Brazilian soils: Comparing vis–NIR and mid-IR reflectance data , 2015 .

[24]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[25]  António S. Barros,et al.  Outer-product analysis (OPA) using PCA to study the influence of temperature on NIR spectra of water ☆ , 2005 .

[26]  Alex B. McBratney,et al.  Simultaneous estimation of several soil properties by ultra-violet, visible, and near-infrared reflectance spectroscopy , 2003 .

[27]  R. V. Rossel,et al.  Spectral fusion by Outer Product Analysis (OPA) to improve predictions of soil organic C , 2019, Geoderma.

[28]  Keith D. Shepherd,et al.  Soil Spectroscopy: An Alternative to Wet Chemistry for Soil Monitoring , 2015 .

[29]  Hoeil Chung,et al.  Random forest as a potential multivariate method for near-infrared (NIR) spectroscopic analysis of complex mixture samples: Gasoline and naphtha , 2013 .

[30]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[31]  C. Johnston Infrared Studies of Clay Mineral-Water Interactions , 2017 .

[32]  G. Willgoose,et al.  Vertical distribution of charcoal in a sandy soil: evidence from DRIFT spectra and field emission scanning electron microscopy , 2014 .

[33]  Pietro Perona,et al.  Integral Channel Features , 2009, BMVC.

[34]  Andy Liaw,et al.  Demystifying Multitask Deep Neural Networks for Quantitative Structure-Activity Relationships , 2017, J. Chem. Inf. Model..

[35]  Ricard Boqué,et al.  Data fusion methodologies for food and beverage authentication and quality assessment - a review. , 2015, Analytica chimica acta.

[36]  Budiman Minasny,et al.  Using deep learning to predict soil properties from regional spectral data , 2019, Geoderma Regional.

[37]  Suhas P. Wani,et al.  Variable indicators for optimum wavelength selection in diffuse reflectance spectroscopy of soils , 2016 .

[38]  L. Duponchel,et al.  Support vector machines (SVM) in near infrared (NIR) spectroscopy: Focus on parameters optimization and model interpretation , 2009 .

[39]  Xudong Sun,et al.  NIR sensitive wavelength selection based on different methods , 2010, 2010 International Conference on Mechanic Automation and Control Engineering.

[40]  Martial Bernoux,et al.  National calibration of soil organic carbon concentration using diffuse infrared reflectance spectroscopy , 2016 .

[41]  B. Minasny,et al.  Regression rules as a tool for predicting soil properties from infrared reflectance spectroscopy , 2008 .

[42]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[43]  Gang Hua,et al.  Ordinal Regression with Multiple Output CNN for Age Estimation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Nitin K. Tripathi,et al.  Artificial neural network analysis of laboratory and in situ spectra for the estimation of macronutrients in soils of Lop Buri (Thailand) , 2003 .

[45]  Budiman Minasny,et al.  Synergistic Use of Vis-NIR, MIR, and XRF Spectroscopy for the Determination of Soil Geochemistry , 2016 .

[46]  G. McCarty,et al.  Mid-Infrared and Near-Infrared Diffuse Reflectance Spectroscopy for Soil Carbon Measurement , 2002 .

[47]  J. Sanderman,et al.  Accurate and Precise Prediction of Soil Properties from a Large Mid-Infrared Spectral Library , 2019, Soil Systems.

[48]  O. Francioso,et al.  Recent Applications Of Vibrational Mid-Infrared (Ir) Spectroscopy For Studying Soil Components: A Review , 2015 .

[49]  J. M. Soriano-Disla,et al.  Total Petroleum Hydrocarbon Concentration Prediction in Soils Using Diffuse Reflectance Infrared Spectroscopy , 2013 .

[50]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.