Multivariate Calibration with Support Vector Regression Based on Random Projection

A multivariate calibration method with support vector regression based on random projection is proposed. The proposed algorithm reduces the dimensionality of high-dimensional spectral data by random projection, and then performs quantitative calibration using support vector regression. Support vector regression method can handle nonlinear regression which is usually encountered in spectral analysis. Moreover, model calculation and optimization on the projected data are relatively fast. A comparative study of the proposed method and partial least squares on four spectral datasets is presented. The proposed method allows drastic reduction in data size and computing time, while preserving the predict performance.

[1]  Dimitris Achlioptas,et al.  Database-friendly random projections: Johnson-Lindenstrauss with binary coins , 2003, J. Comput. Syst. Sci..

[2]  L. Buydens,et al.  Comparing support vector machines to PLS for spectral regression applications , 2004 .

[3]  L. A. Stone,et al.  Computer Aided Design of Experiments , 1969 .

[4]  Han Li,et al.  On the performance of regularized regression learning in Hilbert space , 2012, Neurocomputing.

[5]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[6]  Juan José Rodríguez Diez,et al.  Random projections for linear SVM ensembles , 2011, Applied Intelligence.

[7]  Heikki Mannila,et al.  Random projection in dimensionality reduction: applications to image and text data , 2001, KDD '01.

[8]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[9]  Rémi Munos,et al.  Compressed Least-Squares Regression , 2009, NIPS.

[10]  Peter D. Wentzell,et al.  Comparison of principal components regression and partial least squares regression through generic simulations of complex mixtures , 2003 .

[11]  L. Buydens,et al.  Multivariate calibration with least-squares support vector machines. , 2004, Analytical chemistry.

[12]  S. Wold,et al.  Some recent developments in PLS modeling , 2001 .

[13]  Søren Balling Engelsen,et al.  Rapid Spectroscopic Analysis of Marzipan—Comparative Instrumentation , 2004 .

[14]  Han Li,et al.  The coefficient Regularized Regression with Random Projection , 2012, Int. J. Wavelets Multiresolution Inf. Process..

[15]  A. Belousov,et al.  Applicational aspects of support vector machines , 2002 .

[16]  Felipe Cucker,et al.  On the mathematical foundations of learning , 2001 .

[17]  M. Dyrby,et al.  Chemometric Quantitation of the Active Substance (Containing C≡N) in a Pharmaceutical Tablet Using Near-Infrared (NIR) Transmittance and NIR FT-Raman Spectra , 2002 .

[18]  A K Smilde,et al.  Influence of temperature on vibrational spectra and consequences for the predictive ability of multivariate models. , 1998, Analytical chemistry.

[19]  S. Peng,et al.  Partial least squares and random sample consensus in outlier detection. , 2012, Analytica chimica acta.