Partial least squares with outlier detection in spectral analysis: A tool to predict gasoline properties

The aim of this study is to propose a novel partial least squares with outlier detection (PLS_OD) calibration method and show its usefulness in calibration successfully with data containing outlying objects. We apply this method in gasoline spectral analysis to predict gasoline properties. In particular, a comparative study of PLS_OD and other five methods is presented. The performances of the proposed method are illustrated on spectral data set with and without outliers. The obtained results suggest that the proposed method can be used for constructing satisfactory gasoline prediction model whether there are some outliers or not.

[1]  Mia Hubert,et al.  Robustness properties of a robust partial least squares regression method , 2004 .

[2]  T. Fearn,et al.  Near infrared spectroscopy in food analysis , 1986 .

[3]  Erik Johansson,et al.  Time-resolved QSAR: an approach to PLS modelling of three-way biological data , 2004 .

[4]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[5]  Roman M. Balabin,et al.  Comparison of linear and nonlinear calibration models based on near infrared (NIR) spectroscopy data for gasoline properties prediction , 2007 .

[6]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[7]  Werner A. Stahel,et al.  Robust Statistics: The Approach Based on Influence Functions , 1987 .

[8]  R. Barnes,et al.  Standard Normal Variate Transformation and De-Trending of Near-Infrared Diffuse Reflectance Spectra , 1989 .

[9]  R. E. Aries,et al.  Determination of gas oil cetane number and cetane index using near-infrared Fourier-transform Raman spectroscopy , 1990 .

[10]  Roman M. Balabin,et al.  Gasoline classification by source and type based on near infrared (NIR) spectroscopy data , 2008 .

[11]  Emilio Marengo,et al.  Modeling of the polluting emissions from a cement production plant by partial least-squares, principal component regression, and artificial neural networks. , 2006, Environmental science & technology.

[12]  Hoeil Chung,et al.  Compositional Analysis of Naphtha by FT-Raman Spectroscopy , 1999 .

[13]  Peter Filzmoser,et al.  Partial robust M-regression , 2005 .

[14]  J. Callis,et al.  Prediction of gasoline octane numbers from near-infrared spectral features in the range 660-1215 nm , 1989 .

[15]  Roman M. Balabin,et al.  Wavelet neural network (WNN) approach for calibration model building based on gasoline near infrared (NIR) spectra , 2008 .

[16]  James Cheney,et al.  The chemometric analysis of point and dynamic data in pharmaceutical and biotech production (PAT) — some objectives and approaches , 2006 .