A newly developed method to extract the optimal hyperspectral feature for monitoring leaf biomass in wheat

Abstract Hyperspectral image provides a plethora of information, some of which may be redundant. Therefore, reducing information’s dimensionality using feature selection methods becomes essential. Here, we proposed a new technique, named synergy interval partial least squares (SIPLS) with successive projections algorithm (SPA) (SIPLS-SPA), combining SIPLS and SPA, to efficiently extract optimal spectral features of wheat biomass from hyperspectral image data. In this study, hyperspectral images and leaf biomass were acquired from two-year wheat field experiments with varied nitrogen rates, planting densities, and cultivars. The results showed that eight wavelengths (706, 724, 734, 806, 808, 810, 812, and 816 nm) were selected as the sensitive input variables to establish partial least squares regression (PLSR) model for wheat leaf biomass. The SIPLS-SPA biomass model performed better with higher Rc2 (0.79) in calibration, lower RMSEv (0.059 kg/m2) and RRMSEv (38.55%) in validation. Compared with other state-of-the-art feature selection techniques, the SIPLS-SPA method provided significantly fewer unrelated, collinear spectral variables, and showed a promising application in terms of lower overall complexity, reduced computational complexity, and shorter running time. Furthermore, this work demonstrates the potential of SIPLS-SPA method for extracting other plant traits from hyperspectral data in future agricultural applications.

[1]  M. C. U. Araújo,et al.  The successive projections algorithm for variable selection in spectroscopic multicomponent analysis , 2001 .

[2]  J. Clevers The use of imaging spectrometry for agricultural applications , 1999 .

[3]  J Kolar,et al.  Use of genetic algorithms with multivariate regression for determination of gelatine in historic papers based on FT-IR and NIR spectral data. , 2010, Talanta.

[4]  José Crossa,et al.  Semi-parametric genomic-enabled prediction of genetic values using reproducing kernel Hilbert spaces methods. , 2010, Genetics research.

[5]  Jiewen Zhao,et al.  Measurement of non-sugar solids content in Chinese rice wine using near infrared spectroscopy combined with an efficient characteristic variables selection algorithm. , 2015, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[6]  Limin Yang,et al.  Development of a global land cover characteristics database and IGBP DISCover from 1 km AVHRR data , 2000 .

[7]  Vincent Leemans,et al.  Selection of the most efficient wavelength bands for ‘Jonagold’ apple sorting , 2003 .

[8]  Anil K. Jain,et al.  Artificial neural networks for feature extraction and multivariate data projection , 1995, IEEE Trans. Neural Networks.

[9]  On-line monitoring the extract process of Fu-fang Shuanghua oral solution using near infrared spectroscopy and different PLS algorithms. , 2016, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[10]  Chunjiang Zhao,et al.  Optimization of Informative Spectral Variables for the Quantification of EGCG in Green Tea Using Fourier Transform Near-Infrared (FT-NIR) Spectroscopy and Multivariate Calibration , 2011, Applied spectroscopy.

[11]  Osval A. Montesinos-López,et al.  Bayesian functional regression as an alternative statistical analysis of high-throughput phenotyping data of modern agriculture , 2018, Plant Methods.

[12]  G. de los Campos,et al.  Predicting grain yield using canopy hyperspectral reflectance in wheat breeding data , 2017, Plant Methods.

[13]  José Crossa,et al.  Genomic Bayesian functional regression models with interactions for predicting wheat grain yield using hyper-spectral image data , 2017, Plant Methods.

[14]  R. M. Hoffer,et al.  Biomass estimation on grazed and ungrazed rangelands using spectral indices , 1998 .

[15]  R. Waring,et al.  The normalized difference vegetation index of small Douglas-fir canopies with varying chlorophyll concentrations , 1994 .

[16]  Riccardo Leardi,et al.  Genetic Algorithms as a Tool for Wavelength Selection in Multivariate Calibration , 1995 .

[17]  Yu Huang,et al.  Evaluation of Six Algorithms to Monitor Wheat Leaf Nitrogen Concentration , 2015, Remote. Sens..

[18]  Priyakant Sinha,et al.  Review of the use of remote sensing for biomass estimation to support renewable energy generation , 2015 .

[19]  Benoit Rivard,et al.  The Successive Projection Algorithm (SPA), an Algorithm with a Spatial Constraint for the Automatic Search of Endmembers in Hyperspectral Data , 2008, Sensors.

[20]  A. Qin,et al.  Development of a critical nitrogen dilution curve based on leaf dry matter for summer maize , 2017 .

[21]  J. Schjoerring,et al.  Reflectance measurement of canopy biomass and nitrogen status in wheat crops using normalized difference vegetation indices and partial least squares regression , 2003 .

[22]  Zou Xiaobo,et al.  Variables selection methods in near-infrared spectroscopy. , 2010, Analytica chimica acta.

[23]  Lorenzo Bruzzone,et al.  Classification of hyperspectral remote sensing images with support vector machines , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[24]  W. Verhoef,et al.  PROSPECT+SAIL models: A review of use for vegetation characterization , 2009 .