High and low frequency unfolded partial least squares regression based on empirical mode decomposition for quantitative analysis of fuel oil samples.

Accurate prediction of the model is fundamental to the successful analysis of complex samples. To utilize abundant information embedded over frequency and time domains, a novel regression model is presented for quantitative analysis of hydrocarbon contents in the fuel oil samples. The proposed method named as high and low frequency unfolded PLSR (HLUPLSR), which integrates empirical mode decomposition (EMD) and unfolded strategy with partial least squares regression (PLSR). In the proposed method, the original signals are firstly decomposed into a finite number of intrinsic mode functions (IMFs) and a residue by EMD. Secondly, the former high frequency IMFs are summed as a high frequency matrix and the latter IMFs and residue are summed as a low frequency matrix. Finally, the two matrices are unfolded to an extended matrix in variable dimension, and then the PLSR model is built between the extended matrix and the target values. Coupled with Ultraviolet (UV) spectroscopy, HLUPLSR has been applied to determine hydrocarbon contents of light gas oil and diesel fuels samples. Comparing with single PLSR and other signal processing techniques, the proposed method shows superiority in prediction ability and better model interpretation. Therefore, HLUPLSR method provides a promising tool for quantitative analysis of complex samples.

[1]  Xueguang Shao,et al.  Multivariate calibration methods in near infrared spectroscopic analysis , 2010 .

[2]  Li Yan-kun,et al.  Determination of diesel cetane number by consensus modeling based on uninformative variable elimination , 2012 .

[3]  G. Wachtmeister,et al.  Diesel/biofuel exhaust particles from modern internal combustion engines: Microstructure, composition, and hygroscopicity , 2015 .

[4]  Steven D. Brown,et al.  Stacked partial least squares regression analysis for spectral calibration and prediction , 2009 .

[5]  W. Cai,et al.  An improved boosting partial least squares method for near-infrared spectroscopic quantitative analysis. , 2010, Analytica chimica acta.

[6]  R. Poppi,et al.  Simultaneous determination of hydrocarbon renewable diesel, biodiesel and petroleum diesel contents in diesel fuel blends using near infrared (NIR) spectroscopy and chemometrics. , 2013, The Analyst.

[7]  Xueguang Shao,et al.  A consensus least squares support vector regression (LS-SVR) for analysis of near-infrared spectra of plant samples. , 2007, Talanta.

[8]  D. Jones,et al.  Elemental and spectroscopic characterization of fractions of an acidic extract of oil sands process water. , 2013, Chemosphere.

[9]  Alexander Kai-man Leung,et al.  Wavelet: a new trend in chemistry. , 2003, Accounts of chemical research.

[10]  M. C. U. Araújo,et al.  Unfolded partial least squares/residual bilinearization combined with the Successive Projections Algorithm for interval selection: enhanced excitation-emission fluorescence data modeling in the presence of the inner filter effect , 2015, Analytical and Bioanalytical Chemistry.

[11]  Hadi Parastar,et al.  Chemometrics-assisted effect-directed analysis of crude and refined oil using comprehensive two-dimensional gas chromatography-time-of-flight mass spectrometry. , 2014, Environmental science & technology.

[12]  G. M. Escandar,et al.  Second-order advantage achieved by unfolded-partial least-squares/residual bilinearization modeling of excitation-emission fluorescence data presenting inner filter effects. , 2006, Analytical chemistry.

[13]  Xiaofeng Liu,et al.  Bearing faults diagnostics based on hybrid LS-SVM and EMD method , 2015 .

[14]  Shi-Miao Tan,et al.  Quantitative analysis of tea using ytterbium‐based internal standard near‐infrared spectroscopy coupled with boosting least‐squares support vector regression , 2013 .

[15]  S. D. Jong SIMPLS: an alternative approach to partial least squares regression , 1993 .

[16]  M. Farajzadeh,et al.  Gas chromatographic determination of some phenolic compounds in fuels and engine oil after simultaneous derivatization and microextraction. , 2014, Journal of separation science.

[17]  L. Dai,et al.  A hard modeling approach to determine methanol concentration in methanol gasoline by Raman spectroscopy , 2012 .

[18]  Dong-Sheng Cao,et al.  The boosting: A new idea of building models , 2010 .

[19]  Yvan Vander Heyden,et al.  Predictive-property-ranked variable reduction with final complexity adapted models in partial least squares modeling for multiple responses. , 2013, Analytical chemistry.

[20]  Israel Schechter,et al.  On-line remote prediction of gasoline properties by combined optical methods , 1997 .

[21]  Xueguang Shao,et al.  A weighted multiscale regression for multivariate calibration of near infrared spectra. , 2009, The Analyst.

[22]  Ching-Chiang Yeh,et al.  A Hybrid Model by Empirical Mode Decomposition and Support Vector Regression for Tourist Arrivals Forecasting , 2013 .

[23]  Romà Tauler,et al.  Chemometric strategy for untargeted lipidomics: biomarker detection and identification in stressed human placental cells. , 2015, Analytica chimica acta.

[24]  R. Poppi,et al.  Prediction of the distillation temperatures of crude oils using ¹H NMR and support vector regression with estimated confidence intervals. , 2015, Talanta.

[25]  Roberto Kawakami Harrop Galvão,et al.  Ensemble wavelet modelling for determination of wheat and gasoline properties by near and middle infrared spectroscopy. , 2010, Analytica chimica acta.

[26]  Yi-Zeng Liang,et al.  Monte Carlo cross validation , 2001 .

[27]  N. Filho,et al.  A new spectrophotometric method for determination of biodiesel content in biodiesel/diesel blends , 2015 .

[28]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[29]  Dong-Sheng Cao,et al.  A new strategy to prevent over-fitting in partial least squares models based on model population analysis. , 2015, Analytica chimica acta.

[30]  Jian-hui Jiang,et al.  MCCV stacked regression for model combination and fast spectral interval selection in multivariate calibration , 2007 .

[31]  M. Mørup,et al.  Non-linear calibration models for near infrared spectroscopy. , 2014, Analytica chimica acta.

[32]  Steven D. Brown,et al.  Dual‐domain regression analysis for spectral calibration models , 2003 .

[33]  Romà Tauler,et al.  Application of correlation constrained multivariate curve resolution alternating least-squares methods for determination of compounds of interest in biodiesel blends using NIR and UV-visible spectroscopic data. , 2014, Talanta.

[34]  Zhongyi Hu,et al.  Interval Forecasting of Electricity Demand: A Novel Bivariate EMD-based Support Vector Regression Modeling Framework , 2014, ArXiv.

[35]  Ke Wang,et al.  Bagging for robust non-linear multivariate calibration of spectroscopy , 2011 .

[36]  Xueguang Shao,et al.  Wavelet unfolded partial least squares for near-infrared spectral quantitative analysis of blood and tobacco powder samples. , 2011, The Analyst.

[37]  S. Peng,et al.  Dual stacked partial least squares for analysis of near-infrared spectra. , 2013, Analytica chimica acta.

[38]  D L Massart,et al.  Boosting partial least squares. , 2005, Analytical chemistry.

[39]  Peter D. Wentzell,et al.  Estimation of hydrocarbon types in light gas oils and diesel fuels by ultraviolet absorption spectroscopy and multivariate calibration , 1999 .

[40]  Lutgarde M. C. Buydens,et al.  Breaking with trends in pre-processing? , 2013 .

[41]  Menglong Li,et al.  Determination of nicotine in tobacco samples by near-infrared spectroscopy and boosting partial least squares , 2010 .

[42]  L. Buydens,et al.  Opening the kernel of kernel partial least squares and support vector machines. , 2011, Analytica chimica acta.