Detection of discoloration in diesel fuel based on gas chromatographic fingerprints

In the countries of the European Community, diesel fuel samples are spiked with Solvent Yellow 124 and either Solvent Red 19 or Solvent Red 164. Their presence at a given concentration indicates the specific tax rate and determines the usage of fuel. The removal of these so-called excise duty components, which is known as fuel “laundering”, is an illegal action that causes a substantial loss in a government’s budget. The aim of our study was to prove that genuine diesel fuel samples and their counterfeit variants (obtained from a simulated sorption process) can be differentiated by using their gas chromatographic fingerprints that are registered with a flame ionization detector. To achieve this aim, a discriminant partial least squares analysis, PLS-DA, for the genuine and counterfeit oil fingerprints after a baseline correction and the alignment of peaks was constructed and validated. Uninformative variables elimination (UVE), variable importance in projection (VIP), and selectivity ratio (SR), which were coupled with a bootstrap procedure, were adapted in PLS-DA in order to limit the possibility of model overfitting. Several major chemical components within the regions that are relevant to the discriminant problem were suggested as being the most influential. We also found that the bootstrap variants of UVE-PLS-DA and SR-PLS-DA have excellent predictive abilities for a limited number of gas chromatographic features, 14 and 16, respectively. This conclusion was also supported by the unitary values that were obtained for the area under the receiver operating curve (AUC) independently for the model and test sets.

[1]  P. Eilers A perfect smoother. , 2003, Analytical chemistry.

[2]  D. Massart,et al.  Elimination of uninformative variables for multivariate calibration. , 1996, Analytical chemistry.

[3]  Stefania Favilla,et al.  Assessing feature relevance in NPLS models by VIP , 2013 .

[4]  Y. Heyden,et al.  Potential antioxidant compounds in Mallotus species fingerprints. Part II: fingerprint alignment, data analysis and peak identification. , 2012, Analytica chimica acta.

[5]  M. Jalali-Heravi,et al.  Chromatographic fingerprint analysis of secondary metabolites in citrus fruits peels using gas chromatography-mass spectrometry combined with advanced chemometric methods. , 2012, Journal of chromatography. A.

[6]  M. Barker,et al.  Partial least squares for discrimination , 2003 .

[7]  M Daszykowski,et al.  Simultaneous determination of Solvent Yellow 124 and Solvent Red 19 in diesel oil using fluorescence spectroscopy and chemometrics. , 2012, Talanta.

[8]  D. B. Hibbert Multivariate calibration and classification - T. Naes, T. Isaksson, T. Fearn and T. Davis, NIR Publications, Chichester, 2002, ISBN 0 9528666 2 5, UK @$45.00, US$75.00 , 2004 .

[9]  L. Eriksson Multi- and megavariate data analysis , 2006 .

[10]  R. Todeschini,et al.  Chemometric analysis of gas chromatography with flame ionisation detection chromatograms: a novel method for classification of petroleum products. , 2012, Journal of chromatography. A.

[11]  H. Nielsen,et al.  Data fusion in metabolomic cancer diagnostics , 2012, Metabolomics.

[12]  Tahir Mehmood,et al.  A review of variable selection methods in Partial Least Squares Regression , 2012 .

[13]  M Daszykowski,et al.  Target selection for alignment of chromatographic signals obtained using monochannel detectors. , 2007, Journal of chromatography. A.

[14]  Rasmus Bro,et al.  Variable selection in regression—a tutorial , 2010 .

[15]  P. Erhardt,et al.  Reproducibility of the high-performance liquid chromatographic fingerprints obtained from two soybean cultivars and a selected progeny. , 2001, Journal of chromatography. A.

[16]  Frederick E. Petry,et al.  Principles and Applications , 1997 .

[17]  J. Carstensen,et al.  Aligning of single and multiple wavelength chromatographic profiles for chemometric data analysis using correlation optimised warping , 1998 .

[18]  Olav M. Kvalheim,et al.  Interpretation of latent-variable regression models , 1989 .

[19]  Yi-Zeng Liang,et al.  Monte Carlo cross validation , 2001 .

[20]  Beata Walczak,et al.  USE AND ABUSE OF CHEMOMETRICS IN CHROMATOGRAPHY , 2006 .

[21]  Margaret A. Nemeth,et al.  Multi- and Megavariate Data Analysis , 2003, Technometrics.

[22]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[23]  Hein Putter,et al.  The bootstrap: a tutorial , 2000 .

[24]  Desire L. Massart,et al.  Projection methods in chemistry , 2003 .

[25]  M. Daszykowski,et al.  Identifying the illegal removal from diesel oil of certain chemical markers that designate excise duty , 2014 .

[26]  Y. Heyden,et al.  Prediction of total green tea antioxidant capacity from chromatograms by multivariate modeling. , 2005, Journal of chromatography. A.

[27]  Age K. Smilde,et al.  UvA-DARE ( Digital Academic Repository ) Assessment of PLSDA cross validation , 2008 .

[28]  Masoud Nikravesh,et al.  Feature Extraction - Foundations and Applications , 2006, Feature Extraction.

[29]  O. Kvalheim,et al.  Biomarker discovery in mass spectral profiles by means of selectivity ratio plot , 2009 .

[30]  R. Brereton,et al.  Partial least squares discriminant analysis: taking the magic away , 2014 .

[31]  Desire L. Massart,et al.  Representative subset selection , 2002 .