Probabilistic assignment of formulas to mass peaks in metabolomics experiments

MOTIVATION High-accuracy mass spectrometry is a popular technology for high-throughput measurements of cellular metabolites (metabolomics). One of the major challenges is the correct identification of the observed mass peaks, including the assignment of their empirical formula, based on the measured mass. RESULTS We propose a novel probabilistic method for the assignment of empirical formulas to mass peaks in high-throughput metabolomics mass spectrometry measurements. The method incorporates information about possible biochemical transformations between the empirical formulas to assign higher probability to formulas that could be created from other metabolites in the sample. In a series of experiments, we show that the method performs well and provides greater insight than assignments based on mass alone. In addition, we extend the model to incorporate isotope information to achieve even more reliable formula identification. AVAILABILITY A supplementary document, Matlab code, data and further information are available from http://www.dcs.gla.ac.uk/inference/metsamp.

[1]  R. Breitling,et al.  Precision mapping of the metabolome. , 2006, Trends in biotechnology.

[2]  Rainer Breitling,et al.  Ab initio prediction of metabolic networks using Fourier transform mass spectrometry data , 2006, Metabolomics.

[3]  Guido Sanguinetti,et al.  MMG: a probabilistic tool to identify submodules of metabolic pathways , 2008, Bioinform..

[4]  Gerhard Kattner,et al.  Fundamentals of molecular formula assignment to ultrahigh resolution mass data of natural organic matter. , 2007, Analytical chemistry.

[5]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[6]  Oliver Fiehn,et al.  Metabolomic database annotations via query of elemental compositions: Mass accuracy is insufficient even at less than 1 ppm , 2006, BMC Bioinformatics.

[7]  Oliver Fiehn,et al.  Seven Golden Rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry , 2007, BMC Bioinformatics.

[8]  Edward L Huttlin,et al.  Stable isotope assisted assignment of elemental compositions for metabolomics. , 2007, Analytical chemistry.

[9]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[10]  D. Kell,et al.  Metabolic profiling of serum using Ultra Performance Liquid Chromatography and the LTQ-Orbitrap mass spectrometry system. , 2008, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[11]  J. Rabinowitz,et al.  Analytical strategies for LC-MS-based targeted metabolomics. , 2008, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[12]  Elizabeth B Kujawinski,et al.  Automated analysis of electrospray ionization fourier transform ion cyclotron resonance mass spectra of natural organic matter. , 2006, Analytical chemistry.

[13]  Alexander Makarov,et al.  Dynamic range of mass accuracy in LTQ orbitrap hybrid mass spectrometer , 2006, Journal of the American Society for Mass Spectrometry.