Optimal wavelet filter construction using X and Y data

Abstract It has been recently shown that the predictive ability of wavelet models in multivariate calibration problems can be improved by optimizing the filters employed in the Discrete Wavelet Transform (DWT) with respect to the statistics of the matrix of instrumental responses. However, no attempt has been made at exploiting the statistics of the matrix of predicted parameters in the optimization process. This work addresses this issue and proposes a novel strategy for wavelet filter optimization that aims at directly minimizing the prediction error of a wavelet regression model with respect to a given validation set. Moreover, some theoretical and algorithmic aspects of the angular parameterization of wavelet filters needed for the optimization procedure are clarified. The parameterization is explained in a simple graphical manner. A requirement on the sum of the angular parameters and its implications in the optimization strategy are discussed. Finally, a procedure for obtaining the angular parameters associated to any traditional mother wavelet is provided, both as an algorithm and as a Matlab code. The proposed strategy is illustrated in a simulated multivariate calibration example involving two analytes and also in a problem of total sulphur determination in diesel samples by near-infrared (NIR) absorption spectrometry. Both the simulated and the real examples show that the proposed filter optimization procedure improves the prediction ability of wavelet regression models. Moreover, in the sulphur determination problem, the wavelet models result in a smaller prediction error than a traditional Partial Least Squares Regression (PLS) model.

[1]  Maria Fernanda Pimentel,et al.  A solution to the wavelet transform optimization problem in multicomponent analysis , 2003 .

[2]  Jeffrey J. Kelly,et al.  Nondestructive analytical procedure for simultaneous estimation of the major classes of hydrocarbon constituents of finished gasolines , 1990 .

[3]  R. Jones Determination of Sulfur in Gasoline by X-Ray Emission Spectrography , 1961 .

[4]  K. Jetter,et al.  Principles and applications of wavelet transformation to chemometrics , 2000 .

[5]  Israel Schechter,et al.  On-line remote prediction of gasoline properties by combined optical methods , 1997 .

[6]  L. Bokobza Near Infrared Spectroscopy , 1998 .

[7]  T. Fearn,et al.  Bayesian Wavelet Regression on Curves With Application to a Spectroscopic Calibration Problem , 2001 .

[8]  Maria Fernanda Pimentel,et al.  Aspects of the successive projections algorithm for variable selection in multivariate calibration applied to plasma emission spectrometry , 2001 .

[9]  M. C. U. Araújo,et al.  The successive projections algorithm for variable selection in spectroscopic multicomponent analysis , 2001 .

[10]  R. Wormsbecher,et al.  Quantitative determination of sulfur compounds in FCC gasolines by AED. A study of the effect of catalyst type and catalytic conditions on sulfur distribution , 1993 .

[11]  K. Jetter,et al.  Quantitative analysis of near infrared spectra by wavelet coefficient regression using a genetic algorithm , 1999 .

[12]  Maria Fernanda Pimentel,et al.  A Linear Semi-infinite Programming Strategy for Constructing Optimal Wavelet Transforms in Multivariate Calibration Problems , 2003, J. Chem. Inf. Comput. Sci..

[13]  Truong Q. Nguyen,et al.  Wavelets and filter banks , 1996 .

[14]  M. J. Navas,et al.  Chemiluminescent Methods in Petroleum Products Analysis , 2000 .

[15]  Barry G. Sherlock,et al.  On the space of orthonormal wavelets , 1998, IEEE Trans. Signal Process..

[16]  D. Kell,et al.  An introduction to wavelet transforms for chemometricians: A time-frequency approach , 1997 .

[17]  O. Rioul,et al.  Wavelets and signal processing , 1991, IEEE Signal Processing Magazine.

[18]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[19]  P. Vaidyanathan Multirate Systems And Filter Banks , 1992 .

[20]  Rasmus Bro,et al.  Orthogonal signal correction, wavelet analysis, and multivariate calibration of complicated process fluorescence data , 2000 .

[21]  Kus Hidajat,et al.  Quality Characterisation of Crude Oils by Partial Least Square Calibration of NIR Spectral Profiles , 2000 .

[22]  Mihai Anitescu,et al.  The role of linear semi-infinite programming in signal-adapted QMF bank design , 1997, IEEE Trans. Signal Process..

[23]  John B. Cooper,et al.  Comparison of Near-IR, Raman, and Mid-IR Spectroscopies for the Determination of BTEX in Petroleum Fuels , 1997 .

[24]  Frank Vogt,et al.  Fast principal component analysis of large data sets , 2001 .

[25]  A. Salgó,et al.  Prediction of Gasoline Properties with near Infrared Spectroscopy , 1998 .

[26]  S. Wold,et al.  PLS regression on wavelet compressed NIR spectra , 1998 .

[27]  Beata Walczak,et al.  Wavelets in Chemistry , 2001 .

[28]  D. Kell,et al.  Variable selection in wavelet regression models , 1998 .

[29]  R. Jungers,et al.  Determination of total sulfur in gasoline by gas chromatography with a flame photometric detector , 1977 .