Biodiesel classification by base stock type (vegetable oil) using near infrared spectroscopy data.

The use of biofuels, such as bioethanol or biodiesel, has rapidly increased in the last few years. Near infrared (near-IR, NIR, or NIRS) spectroscopy (>4000cm(-1)) has previously been reported as a cheap and fast alternative for biodiesel quality control when compared with infrared, Raman, or nuclear magnetic resonance (NMR) methods; in addition, NIR can easily be done in real time (on-line). In this proof-of-principle paper, we attempt to find a correlation between the near infrared spectrum of a biodiesel sample and its base stock. This correlation is used to classify fuel samples into 10 groups according to their origin (vegetable oil): sunflower, coconut, palm, soy/soya, cottonseed, castor, Jatropha, etc. Principal component analysis (PCA) is used for outlier detection and dimensionality reduction of the NIR spectral data. Four different multivariate data analysis techniques are used to solve the classification problem, including regularized discriminant analysis (RDA), partial least squares method/projection on latent structures (PLS-DA), K-nearest neighbors (KNN) technique, and support vector machines (SVMs). Classifying biodiesel by feedstock (base stock) type can be successfully solved with modern machine learning techniques and NIR spectroscopy data. KNN and SVM methods were found to be highly effective for biodiesel classification by feedstock oil type. A classification error (E) of less than 5% can be reached using an SVM-based approach. If computational time is an important consideration, the KNN technique (E=6.2%) can be recommended for practical (industrial) implementation. Comparison with gasoline and motor oil data shows the relative simplicity of this methodology for biodiesel classification.

[1]  R. Stafford,et al.  Nanoshell-mediated near-infrared thermal therapy of tumors under magnetic resonance guidance , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Daniel E. Resasco,et al.  Solid Nanoparticles that Catalyze Biofuel Upgrade Reactions at the Water/Oil Interface , 2010, Science.

[3]  Roman M. Balabin,et al.  Quantitative Measurement of Ethanol Distribution over Fractions of Ethanol−Gasoline Fuel , 2007 .

[4]  Roman M. Balabin,et al.  Adsorption of Petroleum Asphaltenes onto Reservoir Rock Sands Studied by Near-Infrared (NIR) Spectroscopy , 2009 .

[5]  Roman M. Balabin,et al.  Molar enthalpy of vaporization of ethanol–gasoline mixtures and their colloid state , 2007 .

[6]  Oliver C. Mullins,et al.  Asphaltenes in Crude Oil: Absorbers and/or Scatterers in the Near- Infrared Region? , 1990 .

[7]  Roman M. Balabin,et al.  Wavelet neural network (WNN) approach for calibration model building based on gasoline near infrared (NIR) spectra , 2008 .

[8]  Richard F. Daniels,et al.  Nondestructive estimation of wood chemical composition of sections of radial wood strips by diffuse reflectance near infrared spectroscopy , 2006, Wood Science and Technology.

[9]  Roman M. Balabin,et al.  Gasoline classification by source and type based on near infrared (NIR) spectroscopy data , 2008 .

[10]  T. Fearn,et al.  Near infrared spectroscopy in food analysis , 1986 .

[11]  Frank J. Duarte Tunable lasers handbook , 1995 .

[12]  Bonnie F. Sloane,et al.  Chiral porphyrazine near-IR optical imaging agent exhibiting preferential tumor accumulation , 2009, Proceedings of the National Academy of Sciences.

[13]  D C Malins,et al.  Infrared spectral models demonstrate that exposure to environmental chemicals leads to new forms of DNA. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[14]  J. Lakowicz Principles of fluorescence spectroscopy , 1983 .

[15]  J. Menezes,et al.  Multivariate near infrared spectroscopy models for predicting the iodine value, CFPP, kinematic viscosity at 40 degrees C and density at 15 degrees C of biodiesel. , 2008, Talanta.

[16]  Roman M. Balabin,et al.  Gasoline classification using near infrared (NIR) spectroscopy data: comparison of multivariate techniques. , 2010, Analytica chimica acta.

[17]  J. Keeler Understanding NMR Spectroscopy , 2005 .

[18]  Giovanni Luca Christian Masala,et al.  A comparative study of K-Nearest Neighbour, Support Vector Machine and Multi-Layer Perceptron for Thalassemia screening , 2003 .

[19]  Roman M. Balabin,et al.  Comparison of linear and nonlinear calibration models based on near infrared (NIR) spectroscopy data for gasoline properties prediction , 2007 .

[20]  Joseph R. Lakowicz,et al.  Introduction to Fluorescence , 1983 .

[21]  Seoung Bum Kim,et al.  An effective classification procedure for diagnosis of prostate cancer in near infrared spectra , 2010, Expert Syst. Appl..

[22]  J. Logan,et al.  Real-time PCR : current technology and applications , 2009 .

[23]  Roman M. Balabin,et al.  Petroleum resins adsorption onto quartz sand: near infrared (NIR) spectroscopy study. , 2008, Journal of colloid and interface science.

[24]  J. Keasling,et al.  Microbial production of fatty-acid-derived fuels and chemicals from plant biomass , 2010, Nature.

[25]  Roman M. Balabin Enthalpy difference between conformations of normal alkanes: Intramolecular basis set superposition error (BSSE) in the case of n-butane and n-hexane. , 2008, The Journal of chemical physics.

[26]  Roman M. Balabin,et al.  Neural network approach to quantum-chemistry data: accurate prediction of density functional theory energies. , 2009, The Journal of chemical physics.

[27]  Michael J. Therien,et al.  Physical chemistry: How to improve your image , 2009, Nature.

[28]  S. Gambhir,et al.  Quantum Dots for Live Cells, in Vivo Imaging, and Diagnostics , 2005, Science.

[29]  Andrew D. Jones,et al.  Supporting Online Material for: Ethanol Can Contribute To Energy and Environmental Goals , 2006 .

[30]  Kazumasa Sakurai,et al.  Principal component analysis of the pH-dependent conformational transitions of bovine β-lactoglobulin monitored by heteronuclear NMR , 2007, Proceedings of the National Academy of Sciences.

[31]  G. Foca,et al.  Amperometric sensors based on poly(3,4-ethylenedioxythiophene)-modified electrodes: discrimination of white wines. , 2008, Analytica chimica acta.

[32]  David L. Kaplan,et al.  Ultra-sensitive vibrational spectroscopy of protein monolayers with plasmonic nanoantenna arrays , 2009, Proceedings of the National Academy of Sciences.

[33]  Jerry Workman,et al.  Applied Spectroscopy: A Compact Reference for Practitioners , 1998 .

[34]  W E Moerner,et al.  Principal-components analysis of shape fluctuations of single DNA molecules , 2007, Proceedings of the National Academy of Sciences.

[35]  G. Foca,et al.  Classification of red wines by chemometric analysis of voltammetric signals from PEDOT-modified electrodes. , 2009, Analytica chimica acta.

[36]  Tormod Næs,et al.  A user-friendly guide to multivariate calibration and classification , 2002 .

[37]  J. L. Hodges,et al.  Discriminatory Analysis - Nonparametric Discrimination: Consistency Properties , 1989 .

[38]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[39]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[40]  Thomas F. Krauss,et al.  Two-dimensional photonic-bandgap structures operating at near-infrared wavelengths , 1996, Nature.

[41]  J. Kister,et al.  Geographic origins and compositions of virgin olive oils determinated by chemometric analysis of NIR spectra. , 2007, Analytica chimica acta.

[42]  Pedro Felizardo,et al.  Multivariate near infrared spectroscopy models for predicting methanol and water content in biodiesel. , 2007, Analytica chimica acta.

[43]  J. Eisinger,et al.  Front-face fluorometry of liquid samples. , 1979, Analytical biochemistry.

[44]  J. Friedman Regularized Discriminant Analysis , 1989 .

[45]  Paul Geladi,et al.  Replicate analysis and outlier detection in multivariate NIR calibration, illustrated with biofuel analysis , 2005 .

[46]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[47]  Roman M. Balabin,et al.  Capabilities of near Infrared Spectroscopy for the Determination of Petroleum Macromolecule Content in Aromatic Solutions , 2007 .

[48]  L. Harwood,et al.  Experimental Organic Chemistry: Principles and Practice , 1990 .

[49]  Charlotte K. Williams,et al.  The Path Forward for Biofuels and Biomaterials , 2006, Science.

[50]  Nicolas Thomas,et al.  Distribution of Mid-Latitude Ground Ice on Mars from New Impact Craters , 2009, Science.

[51]  Roman M. Balabin,et al.  Motor oil classification by base stock and viscosity based on near infrared (NIR) spectroscopy data , 2008 .

[52]  M. Monteiro,et al.  Evaluation of biodiesel-diesel blends quality using 1H NMR and chemometrics. , 2009, Talanta.

[53]  H. Mao,et al.  Rotational dynamics of confined C60 from near-infrared Raman studies under high pressure , 2009, Proceedings of the National Academy of Sciences.

[54]  S. Schulman,et al.  Introduction to fluorescence spectroscopy , 1999 .

[55]  F. Duarte Tunable Laser Applications, Second Edition , 2008 .

[56]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[57]  Joel C Rubim,et al.  Adulteration of diesel/biodiesel blends by vegetable oil as determined by Fourier transform (FT) near infrared spectrometry and FT-Raman spectroscopy. , 2007, Analytica chimica acta.

[58]  B. Kowalski,et al.  Partial least-squares regression: a tutorial , 1986 .

[59]  L. Dubrovinsky,et al.  Optical Absorption and Radiative Thermal Conductivity of Silicate Perovskite to 125 Gigapascals , 2008, Science.

[60]  P. Ajayan,et al.  Long-term survival following a single treatment of kidney tumors with multiwalled carbon nanotubes and near-infrared radiation , 2009, Proceedings of the National Academy of Sciences.

[61]  S. Affatato,et al.  Vibrational spectroscopy of ultra-high molecular weight polyethylene hip prostheses: influence of the sterilisation method on crystallinity and surface oxidation , 2002 .

[62]  Gerhard Knothe,et al.  Determining the blend level of mixtures of biodiesel with conventional diesel fuel by fiber-optic near-infrared spectroscopy and 1H nuclear magnetic resonance spectroscopy , 2001 .

[63]  Joseph Irudayaraj,et al.  Discriminant analysis of edible oils and fats by FTIR, FT-NIR and FT-Raman spectroscopy , 2005 .

[64]  Gautam Vasisht,et al.  The presence of methane in the atmosphere of an extrasolar planet , 2008, Nature.

[65]  B. Manly Multivariate Statistical Methods : A Primer , 1986 .

[66]  L. Schimleck,et al.  Estimation ofPinus radiata D. Don clear wood properties by near-infrared spectroscopy , 2002, Journal of Wood Science.

[67]  Miguel de la Guardia,et al.  Nondestructive direct determination of heroin in seized illicit street drugs by diffuse reflectance near-infrared spectroscopy. , 2008, Analytical chemistry.