A Fully Customized Baseline Removal Framework for Spectroscopic Applications

The task of proper baseline or continuum removal is common to nearly all types of spectroscopy. Its goal is to remove any portion of a signal that is irrelevant to features of interest while preserving any predictive information. Despite the importance of baseline removal, median or guessed default parameters are commonly employed, often using commercially available software supplied with instruments. Several published baseline removal algorithms have been shown to be useful for particular spectroscopic applications but their generalizability is ambiguous. The new Custom Baseline Removal (Custom BLR) method presented here generalizes the problem of baseline removal by combining operations from previously proposed methods to synthesize new correction algorithms. It creates novel methods for each technique, application, and training set, discovering new algorithms that maximize the predictive accuracy of the resulting spectroscopic models. In most cases, these learned methods either match or improve on the performance of the best alternative. Examples of these advantages are shown for three different scenarios: quantification of components in near-infrared spectra of corn and laser-induced breakdown spectroscopy data of rocks, and classification/matching of minerals using Raman spectroscopy. Software to implement this optimization is available from the authors. By removing subjectivity from this commonly encountered task, Custom BLR is a significant step toward completely automatic and general baseline removal in spectroscopic and other applications.

[1]  D. E. Aston,et al.  Automatic Baseline Subtraction of Vibrational Spectra Using Minima Identification and Discrimination via Adaptive, Least-Squares Thresholding , 2012, Applied spectroscopy.

[2]  Amos J Storkey,et al.  Machine Learning and Pattern Recognition : Preliminaries Course , 2009 .

[3]  W. Dietrich,et al.  Fast and precise automatic baseline correction of one- and two-dimensional nmr spectra , 1991 .

[4]  Golotvin,et al.  Improved baseline recognition and modeling of FT NMR spectra , 2000, Journal of magnetic resonance.

[5]  M. Friedrichs A model-free algorithm for the removal of baseline artifacts , 1995, Journal of biomolecular NMR.

[6]  M. D. Dyar,et al.  Interlaboratory and Cross-Instrument Comparison of Raman Spectra of 96 Minerals , 2016 .

[7]  Benjamin C. Pierce,et al.  Types and programming languages: the next generation , 2003, 18th Annual IEEE Symposium of Logic in Computer Science, 2003. Proceedings..

[8]  Manuel Martín-Pastor,et al.  A new general-purpose fully automatic baseline-correction procedure for 1D and 2D NMR data. , 2006, Journal of magnetic resonance.

[9]  Robert T. Downs,et al.  The power of databases: The RRUFF project , 2016 .

[10]  Sridhar Mahadevan,et al.  Automatic Whole-Spectrum Matching , 2015 .

[11]  Chen Chen,et al.  Selective iteratively reweighted quantile regression for baseline correction , 2014, Analytical and Bioanalytical Chemistry.

[12]  Sridhar Mahadevan,et al.  Comparison of univariate and multivariate models for prediction of major and minor elements from laser-induced breakdown spectra with and without masking , 2016 .

[13]  Wei Zhang,et al.  Baseline correction for Raman spectra using an improved asymmetric least squares method , 2014 .

[14]  Michael J. C. Gordon,et al.  Programming language theory and its implementation , 1988 .

[15]  Stewart Clegg,et al.  Optimization of laser-induced breakdown spectroscopy for rapid geochemical analysis , 2010 .

[16]  Fang Chen,et al.  A new automatic baseline correction method based on iterative method. , 2012, Journal of magnetic resonance.

[17]  Stewart Clegg,et al.  Recalibration of the Mars Science Laboratory ChemCam instrument with an expanded geochemical database , 2017 .

[18]  Kristian Hovde Liland,et al.  Optimal Choice of Baseline Correction for Multivariate Calibration of Spectra , 2010, Applied spectroscopy.

[19]  Gerald J. Sussman,et al.  Scheme: A Interpreter for Extended Lambda Calculus , 1998, High. Order Symb. Comput..

[20]  J. Mo,et al.  Baseline correction by improved iterative polynomial fitting with automatic threshold , 2006 .

[21]  Wojciech M. Kwiatek,et al.  Nonpolynomial approximation of background in X-ray spectra , 1987 .

[22]  Stefan Reimann,et al.  Robust extraction of baseline signal of atmospheric trace species using local regression , 2010 .

[23]  Christopher J. Rowlands,et al.  Automated algorithm for baseline subtraction in spectra , 2011 .

[24]  Yi-Zeng Liang,et al.  Baseline correction using adaptive iteratively reweighted penalized least squares. , 2010, The Analyst.

[25]  M. D. Dyar,et al.  Quantifying Mineral Abundances in Mixtures Using Raman Spectroscopy: Toward a Method for Spectral Unmixing , 2016 .

[26]  Limin Shao,et al.  Automatic baseline correction by wavelet transform for quantitative open-path Fourier transform infrared spectroscopy. , 2007, Environmental science & technology.

[27]  Qing-Song Xu,et al.  Morphological weighted penalized least squares for background correction. , 2013, The Analyst.

[28]  Andrew Jirasek,et al.  Investigation of Selected Baseline Removal Techniques as Candidates for Automated Implementation , 2005, Applied spectroscopy.

[29]  S. Giguere,et al.  Comparison of baseline removal methods for laser-induced breakdown spectroscopy of geological samples , 2016 .

[30]  Sridhar Mahadevan,et al.  Machine learning tools for mineral recognition and , 2015 .

[31]  Dennis Tasa,et al.  Mineralogy And Optical Mineralogy , 2007 .

[32]  A. Mahadevan-Jansen,et al.  Automated Method for Subtraction of Fluorescence from Biological Raman Spectra , 2003, Applied spectroscopy.

[33]  Harold J. Annegarn,et al.  Algorithm for fitting XRF, SEM and PIXE X-ray spectra backgrounds , 1996 .