Prediction of Brix Values of Intact Peaches with Least Squares-Support Vector Machine Regression Models

Second derivative of interactance spectra (731–926 nm) of intact peaches and Brix values of extracted juice were used to develop a least squares support vector machine (LS-SVM) regression (based on an RBF kernel) and a PLS regression model. An iterative approach was taken with the LS-SVM regression, involving a grid search with application of a gradient-based optimisation method using a validation set for the tuning of hyperparameters, followed by pruning of the LS-SVM model with the optimised hyperparameters. The grid search approach led to five-fold faster and better determination of hyperparameters. Less than 45% of the initial 1430 calibration samples were kept in the models. In prediction of an independent test set with 120 samples, the pruned LS-SVM models performed better than the PLS model (RMSEP decreased by 9% to 14%).

[1]  B. Walczak,et al.  About kernel latent variable approaches and SVM , 2005 .

[2]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[3]  J. Roger,et al.  Application of LS-SVM to non-linear phenomena in NIR spectroscopy: development of a robust and portable sensor for acidity prediction in grapes , 2004 .

[4]  V. Vapnik The Support Vector Method of Function Estimation , 1998 .

[5]  Di Wu,et al.  Study on infrared spectroscopy technique for fast measurement of protein content in milk powder based on LS-SVM , 2008 .

[6]  Jerry Workman Creating Two- and Three-Dimensional Images Using Spectroscopic Data and MATLAB® Software , 1998 .

[7]  L. Buydens,et al.  Comparing support vector machines to PLS for spectral regression applications , 2004 .

[8]  Tom Fearn,et al.  Comparing Standard Deviations , 1996 .

[9]  R. Shah,et al.  Least Squares Support Vector Machines , 2022 .

[10]  C. Sitthi-amorn,et al.  Bias , 1993, The Lancet.

[11]  M. Forina,et al.  Multivariate calibration. , 2007, Journal of chromatography. A.

[12]  Richard F. Daniels,et al.  Estimation of the Physical Wood Properties of Pinus Taeda L. Radial Strips Using Least Squares Support Vector Machines , 2004 .

[13]  Robert P. Cogdill,et al.  Least-Squares Support Vector Machines for Chemometrics: An Introduction and Evaluation , 2004 .

[14]  L. Buydens,et al.  Determination of optimal support vector regression parameters by genetic algorithms and simplex optimization , 2005 .

[15]  C. Greensill,et al.  Sorting of Fruit Using near Infrared Spectroscopy: Application to a Range of Fruit and Vegetables for Soluble Solids and Dry Matter Content , 2004 .

[16]  William J. Welch,et al.  Computer-aided design of experiments , 1981 .

[17]  Huirong Xu,et al.  Effect of fruit moving speed on predicting soluble solids content of ‘Cuiguan’ pears (Pomaceae pyrifolia Nakai cv. Cuiguan) using PLS and LS-SVM regression , 2009 .