Improvement of NIR model by fractional order Savitzky–Golay derivation (FOSGD) coupled with wavelength selection

Abstract Spectral pretreatment is of great importance in near infrared (NIR) spectral analysis since NIR spectra of samples almost always contain overlapped bands due to different chemical compositions of the samples, which may strongly affect the performance of the analysis system. Derivation is a good and commonly used spectral pretreatment method, which may enhance spectral resolution with increase of derivative order, but reduce strength of the spectral signals meanwhile. In this study, the derivative method of fractional order Savitzky–Golay derivation (FOSGD) and the wavelength selection method of stability competitive adaptive reweighted sampling (SCARS) were coupled to optimize the NIR spectral model. FOSGD could use a decimal number between two adjacent integral numbers as the derivative order to supply a better chance to balance the contradiction of resolution and signal strength than integer order Savitzky–Golay derivation (IOSGD). And wavelength selection could efficiently extract the informative variables with improved resolution and eliminate the influence of the uninformative variables. Three kinds of NIR datasets including simulated datasets, diesel dataset and tobacco dataset were utilized to assess this method. The results showed that FOSGD–SCARS had better performance on optimizing PLS models with smaller RMSECV and RMSEP values than FOSGD or SCARS. Comparing with IOSGD, FOSGD often shows greater advantages, especially for the interest components with narrower bandwidth. This method is convenient and has strong application potential in spectral analysis.

[1]  J. Schmitt Fractional Derivative Analysis of Diffuse Reflectance Spectra , 1998 .

[2]  Xueguang Shao,et al.  Rapid and nondestructive analysis of pharmaceutical products using near-infrared diffuse reflectance spectroscopy. , 2012, Journal of pharmaceutical and biomedical analysis.

[3]  Ainara López,et al.  A review of the application of near-infrared spectroscopy for the analysis of potatoes. , 2013, Journal of agricultural and food chemistry.

[4]  D. K. Buslov Modification of Derivatives for Resolution Enhancement of Bands in Overlapped Spectra , 2004, Applied spectroscopy.

[5]  Jian-hui Jiang,et al.  Spectral regions selection to improve prediction ability of PLS models by changeable size moving window partial least squares and searching combination moving window partial least squares , 2004 .

[6]  H. H. Madden Comments on the Savitzky-Golay convolution method for least-squares-fit smoothing and differentiation of digital data , 1976 .

[7]  R. Teófilo,et al.  Sorting variables by using informative vectors as a strategy for feature selection in multivariate regression , 2009 .

[8]  Di Wu,et al.  Determination of alpha-linolenic acid and linoleic acid in edible oils using near-infrared spectroscopy improved by wavelet transform and uninformative variable elimination. , 2009, Analytica chimica acta.

[9]  D. Massart,et al.  Elimination of uninformative variables for multivariate calibration. , 1996, Analytical chemistry.

[10]  Gabriele Reich,et al.  Optimization of near-infrared spectroscopic process monitoring at low signal-to-noise ratio. , 2011, Analytical chemistry.

[11]  K. Peiponen,et al.  Phase retrieval of reflectance for nanoparticle optical identification. , 2012, Optics letters.

[12]  Kaiyi Zheng,et al.  Stability competitive adaptive reweighted sampling (SCARS) and its applications to multivariate calibration of NIR spectra , 2012 .

[13]  Dong-Sheng Cao,et al.  A strategy that iteratively retains informative variables for selecting optimal variable subset in multivariate calibration. , 2014, Analytica chimica acta.

[14]  Philippe Lagacherie,et al.  Continuum removal versus PLSR method for clay and calcium carbonate content estimation from laboratory and airborne hyperspectral measurements , 2008 .

[15]  R. Hilfer Applications Of Fractional Calculus In Physics , 2000 .

[16]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[17]  Yiping Du,et al.  Multivariate calibration of on-line enrichment near-infrared (NIR) spectra and determination of trace lead in water , 2009 .

[18]  Hamid A. Jalab,et al.  Texture Enhancement Based on the Savitzky-Golay Fractional Differential Operator , 2013 .

[19]  Romà Tauler,et al.  Application of the local regression method interval partial least-squares to the elucidation of protein secondary structure. , 2005, Analytical biochemistry.

[20]  Y. Aizu,et al.  Digital holographic fractal speckle , 2013 .

[21]  Yizeng Liang,et al.  A perspective demonstration on the importance of variable selection in inverse calibration for complex analytical systems. , 2013, The Analyst.

[22]  Kaiyi Zheng,et al.  Pretreating near infrared spectra with fractional order Savitzky–Golay differentiation (FOSGD) , 2015 .

[23]  V. P. Petrov,et al.  Deconvolution versus Derivative Spectroscopy , 1989 .

[24]  Shuangyan Huan,et al.  Preliminary study on the application of near infrared spectroscopy and pattern recognition methods to classify different types of apple samples. , 2011, Food chemistry.

[25]  Riccardo Leardi,et al.  Genetic Algorithms as a Tool for Wavelength Selection in Multivariate Calibration , 1995 .

[26]  Zhenqi Shi,et al.  Scattering orthogonalization of near-infrared spectra for analysis of pharmaceutical tablets. , 2009, Analytical chemistry.

[27]  Jun Uozumi,et al.  Computer-generated holograms for producing fractal speckles , 2010 .

[28]  Biao Huang,et al.  Recursive Wavelength-Selection Strategy to Update Near-Infrared Spectroscopy Model with an Industrial Application , 2013 .

[29]  T. Næs,et al.  The Effect of Multiplicative Scatter Correction (MSC) and Linearity Improvement in NIR Spectroscopy , 1988 .

[30]  J. Gottfries,et al.  Quantitative analysis of film coating in a fluidized bed process by in-line NIR spectrometry and multivariate batch calibration. , 2000, Analytical chemistry.

[31]  R. Barnes,et al.  Standard Normal Variate Transformation and De-Trending of Near-Infrared Diffuse Reflectance Spectra , 1989 .

[32]  Qing-Song Xu,et al.  Random frog: an efficient reversible jump Markov Chain Monte Carlo-like approach for variable selection with applications to gene selection and disease classification. , 2012, Analytica chimica acta.

[33]  Jiewen Zhao,et al.  Measurement of total flavone content in snow lotus (Saussurea involucrate) using near infrared spectroscopy combined with interval PLS and genetic algorithm. , 2010, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[34]  Israel Schechter,et al.  Wavelength Selection for Simultaneous Spectroscopic Analysis. Experimental and Theoretical Study , 1996 .

[35]  Rafael Font,et al.  Visible and near-infrared spectroscopy as a technique for screening the inorganic arsenic content in the red crayfish (procambarus clarkii Girard). , 2004, Analytical chemistry.

[36]  Ning Wang,et al.  Early detection of apple bruises on different background colors using hyperspectral imaging , 2008 .

[37]  Error Reduction in Spectrum Estimation by Means of Concentration-Spectrum Correlation , 1990 .

[38]  Zhi-Zhong Sun,et al.  A new fractional numerical differentiation formula to approximate the Caputo fractional derivative and its applications , 2014, J. Comput. Phys..

[39]  Susan L. Rose-Pehrsson,et al.  Rapid Fuel Quality Surveillance Through Chemometric Modeling of Near-Infrared Spectra , 2009 .

[40]  Hongdong Li,et al.  Key wavelengths screening using competitive adaptive reweighted sampling method for multivariate calibration. , 2009, Analytica chimica acta.

[41]  Ronald R. Coifman,et al.  The prediction error in CLS and PLS: the importance of feature selection prior to multivariate calibration , 2005 .

[42]  S. Wold,et al.  Wavelength interval selection in multicomponent spectral analysis by moving window partial least-squares regression with applications to mid-infrared and near-infrared spectroscopic data. , 2002, Analytical chemistry.

[43]  W. Cai,et al.  A variable selection method based on uninformative variable elimination for multivariate calibration of near-infrared spectra , 2008 .

[44]  Nathalie Dupuy,et al.  Automated principal component-based orthogonal signal correction applied to fused near infrared-mid-infrared spectra of French olive oils. , 2009, Analytical chemistry.

[45]  Gerard Downey,et al.  Feasibility study on the use of visible-near-infrared spectroscopy for the screening of individual and total glucosinolate contents in broccoli. , 2012, Journal of agricultural and food chemistry.

[46]  James W. McNicol,et al.  A designed experiment for the examination of techniques used in the analysis of near infrared spectra. Part 1. Analysis of spectral structure , 1985 .

[47]  C. Spiegelman,et al.  Theoretical Justification of Wavelength Selection in PLS Calibration:  Development of a New Algorithm. , 1998, Analytical Chemistry.

[48]  Yangquan Chen,et al.  Digital Fractional Order Savitzky-Golay Differentiator , 2011, IEEE Transactions on Circuits and Systems II: Express Briefs.

[49]  M. Räsänen,et al.  Development and validation of a near-infrared method for the quantitation of caffeine in intact single tablets. , 2003, Analytical chemistry.

[50]  Dong-Sheng Cao,et al.  An efficient method of wavelength interval selection based on random frog for multivariate spectral calibration. , 2013, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[51]  J. Staggs Savitzky-Golay smoothing and numerical differentiation of cone calorimeter mass data , 2005 .

[52]  R. Yu,et al.  An ensemble of Monte Carlo uninformative variable elimination for wavelength selection. , 2008, Analytica chimica acta.

[53]  D. I. Kamalova,et al.  Resolution Enhancement of Composite Spectra with Fractal Noise in Derivative Spectrometry , 2000 .