Baseline and interferent correction by the Tikhonov regularization framework for linear least squares modeling

Spectroscopic data are usually perturbed by noise from various sources that should be removed prior to model calibration. After conducting a preprocessing step to eliminate unwanted multiplicative effects (effects that scale the pure signal in a multiplicative manner), we discuss how to correct a model for unwanted additive effects in the spectra. Our approach is described within the Tikhonov regularization (TR) framework for linear regression model building, and our focus is on ignoring the influence of noninformative polynomial trends. This is obtained by including an additional criterion in the TR problem penalizing the resulting regression coefficients away from a selected set of possibly disturbing directions in the sample space. The presented method builds on the extended multiplicative signal correction, and we compare the two approaches on several real data sets showing that the suggested TR‐based method may improve the predictive power of the resulting model. We discuss the possibilities of imposing smoothness in the calculation of regression coefficients as well as imposing selection of wavelength regions within the TR framework. To implement TR efficiently in the model building, we use an algorithm that is heavily based on the singular value decomposition. Because of some favorable properties of the singular value decomposition, it is possible to explore the models (including their generalized cross‐validation error estimates) associated with a large number of regularization parameter values at low computational cost.

[1]  H. Martens,et al.  Extended multiplicative signal correction and spectral interference subtraction: new preprocessing methods for near infrared spectroscopy. , 1991, Journal of pharmaceutical and biomedical analysis.

[2]  J. Kalivas,et al.  Interrelationships between generalized Tikhonov regularization, generalized net analyte signal, and generalized least squares for desensitizing a multivariate calibration to interferences , 2013 .

[3]  Achim Kohler,et al.  Extended multiplicative signal correction in vibrational spectroscopy, a tutorial , 2012 .

[4]  John H. Kalivas,et al.  Tikhonov regularization in standardized and general form for multivariate calibration with application towards removing unwanted spectral artifacts , 2006 .

[5]  Christopher D. Brown,et al.  Critical factors limiting the interpretation of regression vectors in multivariate calibration , 2009 .

[6]  Frank Vogt,et al.  Chemometric Correction of Drift Effects in Optical Spectra , 2004, Applied spectroscopy.

[7]  R. Barnes,et al.  Standard Normal Variate Transformation and De-Trending of Near-Infrared Diffuse Reflectance Spectra , 1989 .

[8]  Elling-Olav Rukke,et al.  Quantitative determination of saturated-, monounsaturated- and polyunsaturated fatty acids in pork adipose tissue with non-destructive Raman spectroscopy. , 2007, Meat science.

[9]  Frans van den Berg,et al.  Review of the most common pre-processing techniques for near-infrared spectra , 2009 .

[10]  T. Fearn,et al.  On the geometry of SNV and MSC , 2009 .

[11]  John H. Kalivas,et al.  Ridge regression optimization using a harmonious approach , 2004 .

[12]  Rekha Gautam,et al.  Review of multidimensional data processing approaches for Raman and infrared spectroscopy , 2015, EPJ Techniques and Instrumentation.

[13]  Kristian Hovde Liland,et al.  Customized baseline correction , 2011 .

[14]  Jens Petter Wold,et al.  Raman Spectra of Biological Samples: A Study of Preprocessing Methods , 2006, Applied spectroscopy.

[15]  Åsmund Rinnan,et al.  Pre-processing in vibrational spectroscopy – when, why and how , 2014 .

[16]  Lutgarde M. C. Buydens,et al.  Breaking with trends in pre-processing? , 2013 .

[17]  E. Martin,et al.  Extracting chemical information from spectral data with multiplicative light scattering effects by optical path-length estimation and correction. , 2006, Analytical chemistry.

[18]  John H. Kalivas,et al.  Overview of two‐norm (L2) and one‐norm (L1) Tikhonov regularization variants for full wavelength or sparse spectral multivariate calibration models or maintenance , 2012 .

[19]  Gene H. Golub,et al.  Generalized cross-validation as a method for choosing a good ridge parameter , 1979, Milestones in Matrix Computation.

[20]  K. Liland,et al.  Model-based pre-processing in Raman spectroscopy of biological samples , 2016 .

[21]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[22]  V. Segtnan,et al.  The potential of Raman spectroscopy for characterisation of the fatty acid unsaturation of salmon. , 2006, Analytica chimica acta.

[23]  P. Geladi,et al.  Linearization and Scatter-Correction for Near-Infrared Reflectance Spectra of Meat , 1985 .

[24]  Kristian Hovde Liland,et al.  Optimal Choice of Baseline Correction for Multivariate Calibration of Spectra , 2010, Applied spectroscopy.