Comparison of locally weighted PLS strategies for regression and discrimination on agronomic NIR data

In multivariate calibrations, locally weighted partial least squared regression (LWPLSR) is an efficient prediction method when heterogeneity of data generates nonlinear relations (curvatures and clustering) between the response and the explicative variables. This is frequent in agronomic data sets that gather materials of different natures or origins. LWPLSR is a particular case of weighted PLSR (WPLSR; ie, a statistical weight different from the standard 1/n is given to each of the n calibration observations for calculating the PLS scores/loadings and the predictions). In LWPLSR, the weights depend from the dissimilarity (which has to be defined and calculated) to the new observation to predict. This article compares two strategies of LWPLSR: (a) “LW”: the usual strategy where, for each new observation to predict, a WPLSR is applied to the n calibration observations (ie, entire calibration set) vs (b) “KNN‐LW”: a number of k nearest neighbors to the observation to predict are preliminary selected in the training set and WPLSR is applied only to this selected KNN set. On three illustrating agronomic data sets (quantitative and discrimination predictions), both strategies overpassed the standard PLSR. LW and KNN‐LW had close prediction performances, but KNN‐LW was much faster in computation time. KNN‐LW strategy is therefore recommended for large data sets. The article also presents a new algorithm for WPLSR, on the basis of the “improved kernel #1” algorithm, which is competitor and in general faster to the already published weighted PLS nonlinear iterative partial least squares (NIPALS).

[1]  H. Wold Nonlinear Iterative Partial Least Squares (NIPALS) Modelling: Some Current Developments , 1973 .

[2]  M. Barker,et al.  Partial least squares for discrimination , 2003 .

[3]  Federico Marini,et al.  Local classification: Locally weighted-partial least squares-discriminant analysis (LW-PLS-DA). , 2014, Analytica chimica acta.

[4]  E. K. Kemsley,et al.  Discriminant analysis of high-dimensional data: a comparison of principal components analysis and partial least squares data reduction methods , 1996 .

[5]  Dolores Pérez-Marín,et al.  Improving NIRS predictions of ingredient composition in compound feedingstuffs using Bayesian non-parametric calibrations , 2012 .

[6]  A. Aastveit,et al.  Near-Infrared Reflectance Spectroscopy: Different Strategies for Local Calibrations in Analyses of Forage Quality , 1993 .

[7]  Manabu Kano,et al.  Estimation of active pharmaceutical ingredients content using locally weighted partial least squares and statistical wavelength selection. , 2011, International journal of pharmaceutics.

[8]  S. Wold,et al.  Partial least squares analysis with cross‐validation for the two‐class problem: A Monte Carlo study , 1987 .

[9]  Stefan Schaal,et al.  Scalable Techniques from Nonparametric Statistics for Real Time Robot Learning , 2002, Applied Intelligence.

[10]  D L Massart,et al.  Optimization in locally weighted regression. , 1998, Analytical chemistry.

[11]  Manabu Kano,et al.  Locally weighted kernel partial least squares regression based on sparse nonlinear features for virtual sensing of nonlinear time-varying processes , 2017, Comput. Chem. Eng..

[12]  Bhupinder S. Dayal,et al.  Improved PLS algorithms , 1997 .

[13]  T. Næs,et al.  Locally weighted regression and scatter correction for near-infrared reflectance data , 1990 .

[14]  Yvan Vander Heyden,et al.  Improved variable reduction in partial least squares modelling based on predictive-property-ranked variables and adaptation of partial least squares complexity. , 2011, Analytica chimica acta.

[15]  W. Cleveland,et al.  Locally Weighted Regression: An Approach to Regression Analysis by Local Fitting , 1988 .

[16]  M. Hubert,et al.  Robust methods for partial least squares regression , 2003 .

[17]  S. Wold Cross-Validatory Estimation of the Number of Components in Factor and Principal Components Models , 1978 .

[18]  P. Bastien RÉGRESSION PLS ET DONNÉES CENSURÉES , 2008 .

[19]  P. Dardenne,et al.  LOCAL Regression Algorithm Improves near Infrared Spectroscopy Predictions When the Target Constituent Evolves in Breeding Populations , 2016 .

[20]  R. Manne Analysis of two partial-least-squares algorithms for multivariate calibration , 1987 .

[21]  E. Sicard,et al.  Theoretical framework for local PLS1 regression, and application to a rainfall data set , 2006, Comput. Stat. Data Anal..

[22]  Martin Andersson,et al.  A comparison of nine PLS1 algorithms , 2009 .

[23]  S. Wold,et al.  PLS-regression: a basic tool of chemometrics , 2001 .

[24]  Martial Bernoux,et al.  National calibration of soil organic carbon concentration using diffuse infrared reflectance spectroscopy , 2016 .

[25]  S. Wold,et al.  Source contributions to ambient aerosol calculated by discriminat partial least squares regression (PLS) , 1988 .

[26]  Andrew W. Moore,et al.  Locally Weighted Learning for Control , 1997, Artificial Intelligence Review.

[27]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[28]  P. Filzmoser,et al.  Repeated double cross validation , 2009 .

[29]  Manabu Kano,et al.  Covariance-based Locally Weighted Partial Least Squares for High- Performance Adaptive Modeling , 2015 .

[30]  Paolo Berzaghi,et al.  Investigation of a LOCAL Calibration Procedure for near Infrared Instruments , 1997 .

[31]  Manabu Kano,et al.  Process Parameter Optimization based on LW-PLS in Pharmaceutical Granulation Process , 2015 .

[32]  Émile Sicard Choix de composantes optimales pour l'analyse spatiale et la modélisation : application aux pluies mensuelles du Nordeste brésilien , 2004 .

[33]  Vincent Baeten,et al.  Multivariate Calibration and Chemometrics for near Infrared Spectroscopy: Which Method? , 2000 .

[34]  S. D. Jong SIMPLS: an alternative approach to partial least squares regression , 1993 .

[35]  P Dardenne,et al.  "Global" and "local" predictions of dairy diet nutritional quality using near infrared reflectance spectroscopy. , 2010, Journal of dairy science.

[36]  Hui Wang,et al.  Local Partial Least Square classifier in high dimensionality classification , 2017, Neurocomputing.

[37]  Lujia Han,et al.  Local partial least squares based on global PLS scores , 2019, Journal of Chemometrics.

[38]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[39]  A. Höskuldsson PLS regression methods , 1988 .