Goldindec: A Novel Algorithm for Raman Spectrum Baseline Correction

Raman spectra have been widely used in biology, physics, and chemistry and have become an essential tool for the studies of macromolecules. Nevertheless, the raw Raman signal is often obscured by a broad background curve (or baseline) due to the intrinsic fluorescence of the organic molecules, which leads to unpredictable negative effects in quantitative analysis of Raman spectra. Therefore, it is essential to correct this baseline before analyzing raw Raman spectra. Polynomial fitting has proven to be the most convenient and simplest method and has high accuracy. In polynomial fitting, the cost function used and its parameters are crucial. This article proposes a novel iterative algorithm named Goldindec, freely available for noncommercial use as noted in text, with a new cost function that not only conquers the influence of great peaks but also solves the problem of low correction accuracy when there is a high peak number. Goldindec automatically generates parameters from the raw data rather than by empirical choice, as in previous methods. Comparisons with other algorithms on the benchmark data show that Goldindec has a higher accuracy and computational efficiency, and is hardly affected by great peaks, peak number, and wavenumber.

[1]  A. Mahadevan-Jansen,et al.  Automated Method for Subtraction of Fluorescence from Biological Raman Spectra , 2003, Applied spectroscopy.

[2]  Jérôme Idier,et al.  Convex half-quadratic criteria and interacting auxiliary variables for image restoration , 2001, IEEE Trans. Image Process..

[3]  J. Duguid,et al.  Raman spectroscopy of DNA-metal complexes. I. Interactions and conformational effects of the divalent cations: Mg, Ca, Sr, Ba, Mn, Co, Ni, Cu, Pd, and Cd. , 1993, Biophysical journal.

[4]  R. Richards-Kortum,et al.  Raman spectroscopy for the detection of cancers and precancers. , 1996, Journal of biomedical optics.

[5]  Vladimir Shusterman,et al.  Enhancing the Precision of ECG Baseline Correction: Selective Filtering and Removal of Residual Error , 2000, Comput. Biomed. Res..

[6]  Yi-Zeng Liang,et al.  Peak alignment using wavelet pattern matching and differential evolution. , 2011, Talanta.

[7]  H. Akaike A new look at the statistical model identification , 1974 .

[8]  Steven D. Brown,et al.  Robust Calibration with Respect to Background Variation , 2001 .

[9]  P. Gemperline,et al.  Multivariate background correction for hyphenated chromatography detectors , 1999 .

[10]  Habtom W. Ressom,et al.  Peak selection from MALDI-TOF mass spectra using ant colony optimization , 2007, Bioinform..

[11]  G. Hortin,et al.  Data and text mining Peak selection from MALDI-TOF mass spectra using ant colony optimization , 2007 .

[12]  Benli Yu,et al.  A Review of Signal Enhancement and Noise Reduction Techniques for Tunable Diode Laser Absorption Spectroscopy , 2014 .

[13]  T. Vickers,et al.  Curve Fitting and Linearity: Data Processing in Raman Spectroscopy , 2001 .

[14]  S. Lieberman,et al.  Fluorescence Rejection in Raman Spectroscopy by Shifted-Spectra, Edge Detection, and FFT Filtering Techniques , 1995 .

[15]  Da Chen,et al.  Adaptive wavelet transform suppresses background and noise for quantitative analysis by Raman spectrometry , 2011, Analytical and bioanalytical chemistry.

[16]  R. P. Goehner,et al.  Background subtract subroutine for spectral data , 1978 .

[17]  Heng Huang,et al.  Mass spectrometry data processing using zero-crossing lines in multi-scale of Gaussian derivative wavelet , 2010, Bioinform..

[18]  D. McLean,et al.  Automated Autofluorescence Background Subtraction Algorithm for Biomedical Raman Spectroscopy , 2007, Applied Spectroscopy.

[19]  Aiguo Shen,et al.  A background elimination method based on linear programming for Raman spectra , 2011 .

[20]  J. Greve,et al.  Raman microspectroscopic study of low-pH-induced changes in DNA structure of polytene chromosomes. , 1994, Biochemistry.

[21]  R. Tuma Raman spectroscopy of proteins: from peptides to large assemblies , 2005 .

[22]  Milos Doroslovacki,et al.  Signal Decomposition of Transmembrane Voltage-Sensitive Dye Fluorescence Using a Multiresolution Wavelet Analysis , 2011, IEEE Transactions on Biomedical Engineering.

[23]  Julian Moger,et al.  A Bayesian Whittaker–Henderson smoother for general-purpose and sample-based spectral baseline estimation and peak extraction , 2012 .

[24]  Benli Yu,et al.  Wavelet Transform Based on the Optimal Wavelet Pairs for Tunable Diode Laser Absorption Spectroscopy Signal Processing , 2015, Applied spectroscopy.

[25]  D. Geman,et al.  Nonlinear Image Recovery with , 1995 .

[26]  Lu Wang,et al.  The prediction of nitrogen concentration in soil by VNIR reflectance spectrum , 2005, Proceedings. 2005 IEEE International Geoscience and Remote Sensing Symposium, 2005. IGARSS '05..

[27]  Steven D. Brown,et al.  Wavelet analysis applied to removing non‐constant, varying spectroscopic background in multivariate calibration , 2002 .

[28]  Mass spectrometry-based proteomic analysis of Kashin-Beck disease. , 2010, Molecular medicine reports.

[29]  Andrej Likar,et al.  A peak-search method based on spectrum convolution , 2003 .

[30]  Vincent Mazet,et al.  Background removal from spectra by designing and minimising a non-quadratic cost function , 2005 .

[31]  Yi-Zeng Liang,et al.  Baseline correction using adaptive iteratively reweighted penalized least squares. , 2010, The Analyst.

[32]  Premananda Indic,et al.  Wavelet based algorithm for the estimation of frequency flow from electroencephalogram data during epileptic seizure , 2011, Clinical Neurophysiology.

[33]  Andrew Jirasek,et al.  Investigation of Selected Baseline Removal Techniques as Candidates for Automated Implementation , 2005, Applied spectroscopy.

[34]  Pan Du,et al.  Bioinformatics Original Paper Improved Peak Detection in Mass Spectrum by Incorporating Continuous Wavelet Transform-based Pattern Matching , 2022 .