Optimal Choice of Baseline Correction for Multivariate Calibration of Spectra

Baselines are often chosen by visual inspection of their effect on selected spectra. A more objective procedure for choosing baseline correction algorithms and their parameter values for use in statistical analysis is presented. When the goal of the baseline correction is spectra with a pleasing appearance, visual inspection can be a satisfactory approach. If the spectra are to be used in a statistical analysis, objectivity and reproducibility are essential for good prediction. Variations in baselines from dataset to dataset means we have no guarantee that the best-performing algorithm from one analysis will be the best when applied to a new dataset. This paper focuses on choosing baseline correction algorithms and optimizing their parameter values based on the performance of the quality measure from the given analysis. Results presented in this paper illustrate the potential benefits of the optimization and points out some of the possible pitfalls of baseline correction.

[1]  Kristian Hovde Liland,et al.  Quantitative whole spectrum analysis with MALDI-TOF MS, Part II: Determining the concentration of milk in mixtures , 2009 .

[2]  Rasmus Bro,et al.  Exploring complex interactions in designed data using GEMANOVA. Color changes in fresh beef during storage , 2002 .

[3]  T. Næs,et al.  Ensemble methods and partial least squares regression , 2004 .

[4]  Kristian Hovde Liland,et al.  Testing effects of experimental design factors using multi-way analysis , 2009 .

[5]  P. Eilers Parametric time warping. , 2004, Analytical chemistry.

[6]  A. Mahadevan-Jansen,et al.  Automated Method for Subtraction of Fluorescence from Biological Raman Spectra , 2003, Applied spectroscopy.

[7]  M. Friedrichs A model-free algorithm for the removal of baseline artifacts , 1995, Journal of biomolecular NMR.

[8]  Jens Petter Wold,et al.  Raman Spectra of Biological Samples: A Study of Preprocessing Methods , 2006, Applied spectroscopy.

[9]  Robert W. Field,et al.  Baseline subtraction using robust local regression estimation , 2001 .

[10]  Jeffrey S. Morris,et al.  Improved peak detection and quantification of mass spectrometry data acquired from surface‐enhanced laser desorption and ionization by denoising spectra with the undecimated discrete wavelet transform , 2005, Proteomics.

[11]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[12]  Jeffrey S. Morris,et al.  Quality control and peak finding for proteomics data collected from nipple aspirate fluid by surface-enhanced laser desorption and ionization. , 2003, Clinical chemistry.

[13]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[14]  J. Mo,et al.  Baseline correction by improved iterative polynomial fitting with automatic threshold , 2006 .