Automated wavelength selection for spectroscopic fuel models by symmetrically contracting repeated unmoving window partial least squares

The need for automated quality surveillance of liquid hydrocarbon fuels has driven the development of rapid fuel property modeling from spectroscopic sensor data. The correlation of near-infrared (NIR) and Raman spectroscopic data with jet and diesel fuel properties can be improved by the deliberate selection of continuous wavelength sub-ranges. An automatic wavelength selection strategy would allow for the unsupervised construction of partial least squares (PLS) regression models of increased predictive utility when supervised model construction and maintenance is not feasible. Changeable size moving window partial least squares (CSMWPLS) is one of the most thorough operations suited for this task. Unfortunately, the necessarily large number of PLS model constructions required by an automated version of this procedure limits the evaluation of the predictive ability of the resulting models through full cross-validation results. Presented here is a novel restricted version of the CSMWPLS algorithm in which the initial spectral range selection is accomplished through multiple interval PLS (iPLS) analyses, where analysis windows for the refinement step no longer move, and size changes are limited to a series of symmetric attenuations. It is shown that the proposed algorithm can provide significant PLS model improvements during the course of a fully automated analysis of jet and diesel fuel spectra in less time than an automated CSMWPLS algorithm.

[1]  S. Wold,et al.  Wavelength interval selection in multicomponent spectral analysis by moving window partial least-squares regression with applications to mid-infrared and near-infrared spectroscopic data. , 2002, Analytical chemistry.

[2]  Roberto Todeschini,et al.  Kohonen artificial neural networks as a tool for wavelength selection in multicomponent spectrofluorimetric PLS modelling: application to phenol, o-cresol, m-cresol and p-cresol mixtures , 1999 .

[3]  Bahram Hemmateenejad,et al.  Ant colony optimisation: a powerful tool for wavelength selection , 2006 .

[4]  Ron Wehrens,et al.  Wavelength selection with Tabu Search , 2003 .

[5]  Riccardo Leardi,et al.  Genetic-algorithm-based wavelength selection in multicomponent spectrophotometric determination by PLS: application on copper and zinc mixture. , 2003, Talanta.

[6]  R Todeschini,et al.  Resolution of mixtures of three nonsteroidal anti-inflammatory drugs by fluorescence using partial least squares multivariate calibration with previous wavelength selection by Kohonen artificial neural networks. , 2000, Talanta.

[7]  Jian-Hui Jiang,et al.  Optimized Block-wise Variable Combination by Particle Swarm Optimization for Partial Least Squares Modeling in Quantitative Structure-Activity Relationship Studies , 2005, J. Chem. Inf. Model..

[8]  Kevin J. Johnson,et al.  Monitoring Diesel Fuel Degradation by Gas Chromatography−Mass Spectroscopy and Chemometric Analysis , 2004 .

[9]  Luis A. Sarabia,et al.  Genetic-algorithm-based wavelength selection in multicomponent spectrometric determinations by PLS: application on indomethacin and acemethacin mixture , 1997 .

[10]  John H. Kalivas,et al.  Further investigation on a comparative study of simulated annealing and genetic algorithm for wavelength selection , 1995 .

[11]  Richard G. Brereton,et al.  Chemometrics: Data Analysis for the Laboratory and Chemical Plant , 2003 .

[12]  Susan L. Rose-Pehrsson,et al.  Evaluating the Predictive Powers of Spectroscopy and Chromatography for Fuel Quality Assessment , 2006 .

[13]  Randall D. Tobias,et al.  Chemometrics: A Practical Guide , 1998, Technometrics.

[14]  C. B. Lucasius,et al.  Genetic algorithms in wavelength selection: a comparative study , 1994 .

[15]  Paul J. Gemperline,et al.  Wavelength selection and optimization of pattern recognition methods using the genetic algorithm , 2000 .

[16]  S. Engelsen,et al.  Interval Partial Least-Squares Regression (iPLS): A Comparative Chemometric Study with an Example from Near-Infrared Spectroscopy , 2000 .

[17]  Hamid Abdollahi,et al.  Simultaneous spectrophotometric determination of Vitamin K3 and 1,4-naphthoquinone after cloud point extraction by using genetic algorithm based wavelength selection-partial least squares regression , 2004 .

[18]  M Bernabei,et al.  Determination of total and polycyclic aromatic hydrocarbons in aviation jet fuel. , 2003, Journal of chromatography. A.

[19]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1996, Springer Berlin Heidelberg.

[20]  M. P. Gómez-Carracedo,et al.  Multivariate prediction of eight kerosene properties employing vapour-phase mid-infrared spectrometry , 2003 .

[21]  T. Fearn,et al.  Bayesian wavelength selection in multicomponent analysis , 1998 .

[22]  Marc Dejardin,et al.  Simulated annealing and neural networks as alternative methods for nonlinear constrained optimization , 1996 .

[23]  The Application of Chemometric Methods to Correlate Fuel Performance with Composition from Gas Chromatography , 2004 .

[24]  Desire L. Massart,et al.  Comparison of multivariate methods based on latent vectors and methods based on wavelength selection for the analysis of near-infrared spectroscopic data , 1995 .

[25]  Jian-hui Jiang,et al.  Spectral regions selection to improve prediction ability of PLS models by changeable size moving window partial least squares and searching combination moving window partial least squares , 2004 .

[26]  Pierre Dardenne,et al.  Validation and verification of regression in small data sets , 1998 .

[27]  M P Gómez-Carracedo,et al.  Generation and mid-IR measurement of a gas-phase to predict security parameters of aviation jet fuel. , 2003, Talanta.

[28]  Kevin J. Johnson,et al.  Pattern recognition of jet fuels: comprehensive GC×GC with ANOVA-based feature selection and principal component analysis , 2002 .

[29]  Mark R Smith,et al.  Optimisation of partial least squares regression calibration models in near-infrared spectroscopy: a novel algorithm for wavelength selection. , 2003, The Analyst.

[30]  John H. Kalivas,et al.  Cyclic subspace regression with analysis of wavelength-selection criteria , 1999 .

[31]  Celio Pasquini,et al.  Determination of total sulfur in diesel fuel employing NIR spectroscopy and multivariate calibration. , 2003, The Analyst.

[32]  D. Massart,et al.  Elimination of uninformative variables for multivariate calibration. , 1996, Analytical chemistry.

[33]  Ehl Emile Aarts,et al.  Simulated annealing and Boltzmann machines , 2003 .

[34]  Nancy Forbes,et al.  Imitation of Life: How Biology Is Inspiring Computing , 2004 .

[35]  E. V. Thomas,et al.  Partial least-squares methods for spectral analyses. 1. Relation to other quantitative calibration methods and the extraction of qualitative information , 1988 .

[36]  Lutgarde M. C. Buydens,et al.  Improvement of PLS model transferability by robust wavelength selection , 1998 .

[37]  E. V. Thomas,et al.  Non‐parametric statistical methods for multivariate calibration model selection and comparison , 2003 .

[38]  C. Spiegelman,et al.  Theoretical Justification of Wavelength Selection in PLS Calibration:  Development of a New Algorithm. , 1998, Analytical Chemistry.