Chemical fingerprinting of petroleum biomarkers using time warping and PCA.

A new method for chemical fingerprinting of petroleum biomakers is described. The method consists of GC-MS analysis, preprocessing of GC-MS chromatograms, and principal component analysis (PCA) of selected regions. The preprocessing consists of baseline removal by derivatization, normalization, and alignment using correlation optimized warping. The method was applied to chromatograms of m/z 217 (tricyclic and tetracyclic steranes) of oil spill samples and source oils. Oil spill samples collected from the coastal environment in the weeks after the Baltic Carrier oil spill were clustered in principal components 1 to 4 with oil samples from the tank of the Baltic Carrier (source oil). The discriminative power of PCA was enhanced by deselecting the most uncertain variables or scaling them according to their uncertainty, using a weighted least squares criterion. The four principal components were interpreted as follows: boiling point range (PC1), clay content (PC2), carbon number distribution of sterols in the source rock (PC3), and thermal maturity of the oil (PC4). In summary, the method allows for analyses of chromatograms using a fast and objective procedure and with more comprehensive data usage compared to other fingerprinting methods.