Metabomatching: Using genetic association to identify metabolites in proton NMR spectroscopy

A metabolome-wide genome-wide association study (mGWAS) aims to discover the effects of genetic variants on metabolome phenotypes. Most mGWASes use as phenotypes concentrations of limited sets of metabolites that can be identified and quantified from spectral information. In contrast, in an untargeted mGWAS both identification and quantification are forgone and, instead, all measured metabolome features are tested for association with genetic variants. While the untargeted approach does not discard data that may have eluded identification, the interpretation of associated features remains a challenge. To address this issue, we developed metabomatching to identify the metabolites underlying significant associations observed in untargeted mGWASes on proton NMR metabolome data. Metabomatching capitalizes on genetic spiking, the concept that because metabolome features associated with a genetic variant tend to correspond to the peaks of the NMR spectrum of the underlying metabolite, genetic association can allow for identification. Applied to the untargeted mGWASes in the SHIP and CoLaus cohorts and using 180 reference NMR spectra of the urine metabolome database, metabomatching successfully identified the underlying metabolite in 14 of 19, and 8 of 9 associations, respectively. The accuracy and efficiency of our method make it a strong contender for facilitating or complementing metabolomics analyses in large cohorts, where the availability of genetic, or other data, enables our approach, but targeted quantification is limited.

[1]  Gabi Kastenmüller,et al.  Biochemical insights from population studies with genetics and metabolomics. , 2016, Archives of biochemistry and biophysics.

[2]  Christian Gieger,et al.  Mining the Unknown: A Systems Approach to Metabolite Identification Combining Genetic and Metabolic Information , 2012, PLoS genetics.

[3]  Christian Gieger,et al.  Genetics Meets Metabolomics: A Genome-Wide Association Study of Metabolite Profiles in Human Serum , 2008, PLoS genetics.

[4]  W. Rathmann,et al.  Cohort profile: the study of health in Pomerania. , 2011, International journal of epidemiology.

[5]  Xavier Correig,et al.  Focus: a robust workflow for one-dimensional NMR spectral analysis. , 2014, Analytical chemistry.

[6]  David S. Wishart,et al.  HMDB 3.0—The Human Metabolome Database in 2013 , 2012, Nucleic Acids Res..

[7]  Harish Dharuri,et al.  Insight in Genome-Wide Association of Metabolite Quantitative Traits by Exome Sequence Analyses , 2015, PLoS genetics.

[8]  Markus Perola,et al.  Genome-wide association study identifies multiple loci influencing human serum metabolite levels , 2012, Nature Genetics.

[9]  Vincent Mooser,et al.  The CoLaus study: a population-based study to investigate the epidemiology and genetic determinants of cardiovascular risk factors and metabolic syndrome , 2008, BMC cardiovascular disorders.

[10]  Michael L. Raymer,et al.  Dynamic adaptive binning: an improved quantification technique for NMR spectroscopic data , 2011, Metabolomics.

[11]  Santosh Kumar Bharti,et al.  Quantitative 1H NMR spectroscopy , 2012 .

[12]  Christian Gieger,et al.  Genome-Wide Association Study with Targeted and Non-targeted NMR Metabolomics Identifies 15 Novel Loci of Urinary Human Metabolic Individuality , 2015, PLoS genetics.

[13]  Peter Donnelly,et al.  A Genome-Wide Metabolic QTL Analysis in Europeans Implicates Two Loci Shaped by Recent Positive Selection , 2011, PLoS genetics.

[14]  Christian Gieger,et al.  Identification and MS-assisted interpretation of genetically influenced NMR signals in human plasma , 2012, Genome Medicine.

[15]  K. Strimmer,et al.  Statistical Applications in Genetics and Molecular Biology A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional Genomics , 2011 .

[16]  C. Gieger,et al.  Human metabolic individuality in biomedical and pharmaceutical research , 2011, Nature.

[17]  Arnald Alonso,et al.  Analytical Methods in Untargeted Metabolomics: State of the Art in 2015 , 2015, Front. Bioeng. Biotechnol..

[18]  Christian Gieger,et al.  A genome-wide association study of metabolic traits in human urine , 2011, Nature Genetics.

[19]  Dan C. Tulpan,et al.  MetaboHunter: an automatic approach for identification of metabolites from 1H-NMR spectra of complex mixtures , 2011, BMC Bioinformatics.

[20]  Christian Gieger,et al.  Genetic variation in metabolic phenotypes: study designs and applications , 2012, Nature Reviews Genetics.

[21]  Christoph Steinbeck,et al.  Genome-Wide Association Study of Metabolic Traits Reveals Novel Gene-Metabolite-Disease Links , 2014, PLoS genetics.

[22]  Mark Harrison,et al.  Adaptive binning: An improved binning method for metabolomics data using the undecimated wavelet transform , 2007 .

[23]  Jean-Baptiste Cazier,et al.  mQTL.NMR: an integrated suite for genetic mapping of quantitative variations of (1)H NMR-based metabolic profiles. , 2015, Analytical chemistry.

[24]  Márcia M. C. Ferreira,et al.  Optimized bucketing for NMR spectra: Three case studies , 2013 .

[25]  C. Gieger,et al.  Genetics of human metabolism: an update , 2015, Human molecular genetics.

[26]  Daniel Raftery,et al.  Quantitating Metabolites in Protein Precipitated Serum Using NMR Spectroscopy , 2014, Analytical chemistry.

[27]  Christian Gieger,et al.  A genome-wide perspective of genetic variation in human metabolism , 2010, Nature Genetics.

[28]  Zerihun T. Dame,et al.  The Human Urine Metabolome , 2013, PloS one.

[29]  J. Everett A New Paradigm for Known Metabolite Identification in Metabonomics/Metabolomics: Metabolite Identification Efficiency , 2015, Computational and structural biotechnology journal.

[30]  Miron Livny,et al.  BioMagResBank , 2007, Nucleic Acids Res..

[31]  John P. Overington,et al.  An atlas of genetic influences on human blood metabolites , 2014, Nature Genetics.