Probabilistic peak detection for first-order chromatographic data.

We present a novel algorithm for probabilistic peak detection in first-order chromatographic data. Unlike conventional methods that deliver a binary answer pertaining to the expected presence or absence of a chromatographic peak, our method calculates the probability of a point being affected by such a peak. The algorithm makes use of chromatographic information (i.e. the expected width of a single peak and the standard deviation of baseline noise). As prior information of the existence of a peak in a chromatographic run, we make use of the statistical overlap theory. We formulate an exhaustive set of mutually exclusive hypotheses concerning presence or absence of different peak configurations. These models are evaluated by fitting a segment of chromatographic data by least-squares. The evaluation of these competing hypotheses can be performed as a Bayesian inferential task. We outline the potential advantages of adopting this approach for peak detection and provide several examples of both improved performance and increased flexibility afforded by our approach.

[1]  David Brynn Hibbert,et al.  An introduction to Bayesian methods for analyzing chemistry data Part II: A review of applications of Bayesian methods in chemistry , 2009 .

[2]  Glenn Shafer Scientific Reasoning: The Bayesian Approach , 2007 .

[3]  Anders Hald,et al.  On the history of maximum likelihood in relation to inverse probability and least squares , 1999 .

[4]  D. O. North,et al.  An Analysis of the factors which determine signal/noise discrimination in pulsed-carrier systems , 1963 .

[5]  Ronald L. Graham,et al.  Concrete mathematics - a foundation for computer science , 1991 .

[6]  David Brynn Hibbert,et al.  An introduction to Bayesian methods for analyzing chemistry data: Part 1: An introduction to Bayesian theory and methods , 2009 .

[7]  Hans F. M. Boelens,et al.  Quantification of chromatographic data using a matched filter: robustness towards noise model errors , 1993 .

[8]  Cavan Reilly,et al.  A Bayesian approach to the alignment of mass spectra , 2009, Bioinform..

[9]  J. R. Torres-Lapasió,et al.  Automatic program for peak detection and deconvolution of multi-overlapped chromatographic signals part II: peak model and deconvolution algorithms. , 2005, Journal of chromatography. A.

[10]  Christophe Collet,et al.  A unified framework for peak detection and alignment: application to HR-MAS 2D NMR spectroscopy , 2013, Signal Image Video Process..

[11]  Glenn Shafer,et al.  Scientific Reasoning: The Bayesian Approach (3rd ed.), Colin Howson and Peter Urbach , 2007 .

[12]  David R. Bickel,et al.  THE STRENGTH OF STATISTICAL EVIDENCE FOR COMPOSITE HYPOTHESES: INFERENCE TO THE BEST EXPLANATION , 2010 .

[13]  Joe M. Davis,et al.  Statistical theory of component overlap in multicomponent chromatograms , 1983 .

[14]  Leepika Tuli,et al.  Analysis of LC-MS data for characterizing the metabolic changes in response to radiation. , 2010, Journal of proteome research.

[15]  Thomas Hankemeier,et al.  Instrument and process independent binning and baseline correction methods for liquid chromatography-high resolution-mass spectrometry deconvolution. , 2012, Analytica chimica acta.

[16]  Simon J. Godsill,et al.  Bayesian inference for multidimensional NMR image reconstruction , 2006, 2006 14th European Signal Processing Conference.

[17]  C. A. Hastings,et al.  New algorithms for processing and peak detection in liquid chromatography/mass spectrometry data. , 2002, Rapid communications in mass spectrometry : RCM.

[18]  T. Rejtar,et al.  A universal denoising and peak picking algorithm for LC-MS based on matched filtration in the chromatographic time domain. , 2003, Analytical chemistry.

[19]  J. Knoll Estimation of the limit of detection in chromatography , 1985 .

[20]  Attila Felinger,et al.  Data Analysis and Signal Processing in Chromatography , 2011 .

[21]  B. Bakshi,et al.  Toward Bayesian chemometrics--a tutorial on some recent advances. , 2007, Analytica chimica acta.

[22]  J. Listgarten,et al.  Statistical and Computational Methods for Comparative Proteomic Profiling Using Liquid Chromatography-Tandem Mass Spectrometry , 2005, Molecular & Cellular Proteomics.

[23]  G. Vivó-Truyols,et al.  Bayesian approach for peak detection in two-dimensional chromatography. , 2012, Analytical chemistry.

[24]  Paul H. C. Eilers,et al.  Mixture models for baseline estimation , 2012 .

[25]  B. Kowalski,et al.  Theory of analytical chemistry , 1994 .

[26]  Karisa M. Pierce,et al.  A Review of Chemometrics Applied to Comprehensive Two-dimensional Separations from 2008–2010 , 2012 .

[27]  D L Massart,et al.  Automatic program for peak detection and deconvolution of multi-overlapped chromatographic signals part I: peak detection. , 2005, Journal of chromatography. A.

[28]  Rolf Danielsson,et al.  Matched filtering with background suppression for improved quality of base peak chromatograms and mass spectra in liquid chromatography - mass spectrometry , 2002 .

[29]  Bart M. ter Haar Romeny,et al.  Front-End Vision and Multi-Scale Image Analysis , 2003, Computational Imaging and Vision.

[30]  Karin Noy,et al.  Shape-Based Feature Matching Improves Protein Identification via LC-MS and Tandem MS , 2011, J. Comput. Biol..

[31]  Jacob D. Jaffe,et al.  MapQuant: Open‐source software for large‐scale protein quantification , 2006, Proteomics.

[32]  Erin E. Carlson,et al.  Targeted profiling: quantitative analysis of 1H NMR metabolomics data. , 2006, Analytical chemistry.

[33]  J. R. Torres-Lapasió,et al.  Limits of multi-linear gradient optimisation in reversed-phase liquid chromatography. , 2005, Journal of chromatography. A.

[34]  V B Di Marco,et al.  Mathematical functions for the representation of chromatographic peaks. , 2001, Journal of chromatography. A.