Normal-Gamma-Bernoulli peak detection for analysis of comprehensive two-dimensional gas chromatography mass spectrometry data

Compared to other analytical platforms, comprehensive two-dimensional gas chromatography coupled with mass spectrometry (GC×GC-MS) has much increased separation power for analysis of complex samples and thus is increasingly used in metabolomics for biomarker discovery. However, accurate peak detection remains a bottleneck for wide applications of GC×GC-MS. Therefore, the normal-exponential-Bernoulli (NEB) model is generalized by gamma distribution and a new peak detection algorithm using the normal-gamma-Bernoulli (NGB) model is developed. Unlike the NEB model, the NGB model has no closed-form analytical solution, hampering its practical use in peak detection. To circumvent this difficulty, three numerical approaches, which are fast Fourier transform (FFT), the first-order and the second-order delta methods (D1 and D2), are introduced. The applications to simulated data and two real GC×GC-MS data sets show that the NGB-D1 method performs the best in terms of both computational expense and peak detection performance.

[1]  Stephen E. Reichenbach,et al.  Information technologies for comprehensive two-dimensional gas chromatography , 2004 .

[2]  S. Gregory,et al.  Comparison of GC-MS and GC×GC-MS in the analysis of human serum samples for biomarker discovery. , 2015, Journal of proteome research.

[3]  Sandra Plancade,et al.  Generalization of the normal-exponential model: exploration of a more accurate parametrisation for the signal distribution on Illumina BeadArrays , 2011, BMC Bioinformatics.

[4]  Philip J Marriott,et al.  Development of an algorithm for peak detection in comprehensive two-dimensional chromatography. , 2007, Journal of chromatography. A.

[5]  Christina Kendziorski,et al.  On Differential Variability of Expression Ratios: Improving Statistical Inference about Gene Expression Changes from Microarray Data , 2001, J. Comput. Biol..

[6]  J. Tukey,et al.  An algorithm for the machine calculation of complex Fourier series , 1965 .

[7]  Robin H. Schmidt,et al.  Metabolomic analysis of the effects of chronic arsenic exposure in a mouse model of diet-induced Fatty liver disease. , 2014, Journal of proteome research.

[8]  Jiri Adamec,et al.  Development of GCxGC/TOF-MS metabolomics for use in ecotoxicological studies with invertebrates. , 2008, Aquatic toxicology.

[9]  G. Vivó-Truyols,et al.  Bayesian approach for peak detection in two-dimensional chromatography. , 2012, Analytical chemistry.

[10]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[11]  Xinmin Yin,et al.  Olanzapine Activates Hepatic Mammalian Target of Rapamycin: New Mechanistic Insight into Metabolic Dysregulation with Atypical Antipsychotic Drugs , 2013, The Journal of Pharmacology and Experimental Therapeutics.

[12]  Ming Ouyang,et al.  A NEW METHOD OF PEAK DETECTION FOR ANALYSIS OF COMPREHENSIVE TWO-DIMENSIONAL GAS CHROMATOGRAPHY MASS SPECTROMETRY DATA. , 2014, The annals of applied statistics.

[13]  Xiang Zhang,et al.  Detection of an Extended Human Volatome with Comprehensive Two-Dimensional Gas Chromatography Time-of-Flight Mass Spectrometry , 2013, PloS one.