Insight in Genome-Wide Association of Metabolite Quantitative Traits by Exome Sequence Analyses

Metabolite quantitative traits carry great promise for epidemiological studies, and their genetic background has been addressed using Genome-Wide Association Studies (GWAS). Thus far, the role of less common variants has not been exhaustively studied. Here, we set out a GWAS for metabolite quantitative traits in serum, followed by exome sequence analysis to zoom in on putative causal variants in the associated genes. 1H Nuclear Magnetic Resonance (1H-NMR) spectroscopy experiments yielded successful quantification of 42 unique metabolites in 2,482 individuals from The Erasmus Rucphen Family (ERF) study. Heritability of metabolites were estimated by SOLAR. GWAS was performed by linear mixed models, using HapMap imputations. Based on physical vicinity and pathway analyses, candidate genes were screened for coding region variation using exome sequence data. Heritability estimates for metabolites ranged between 10% and 52%. GWAS replicated three known loci in the metabolome wide significance: CPS1 with glycine (P-value  = 1.27×10−32), PRODH with proline (P-value  = 1.11×10−19), SLC16A9 with carnitine level (P-value  = 4.81×10−14) and uncovered a novel association between DMGDH and dimethyl-glycine (P-value  = 1.65×10−19) level. In addition, we found three novel, suggestively significant loci: TNP1 with pyruvate (P-value  = 1.26×10−8), KCNJ16 with 3-hydroxybutyrate (P-value  = 1.65×10−8) and 2p12 locus with valine (P-value  = 3.49×10−8). Exome sequence analysis identified potentially causal coding and regulatory variants located in the genes CPS1, KCNJ2 and PRODH, and revealed allelic heterogeneity for CPS1 and PRODH. Combined GWAS and exome analyses of metabolites detected by high-resolution 1H-NMR is a robust approach to uncover metabolite quantitative trait loci (mQTL), and the likely causative variants in these loci. It is anticipated that insight in the genetics of intermediate phenotypes will provide additional insight into the genetics of complex traits.

[1]  R. Collins,et al.  Newly identified loci that influence lipid concentrations and risk of coronary artery disease , 2008, Nature Genetics.

[2]  Jaana M. Hartikainen,et al.  Large-scale genotyping identifies 41 new loci associated with breast cancer risk , 2013, Nature Genetics.

[3]  Jussi Paananen,et al.  Genetic Variants Associated With Glycine Metabolism and Their Role in Insulin Sensitivity and Type 2 Diabetes , 2013, Diabetes.

[4]  Yurii S. Aulchenko,et al.  A Genomic Background Based Method for Association Analysis in Related Individuals , 2007, PloS one.

[5]  W. Guan,et al.  Genome-Wide Association Study Identifies Novel Loci Associated With Concentrations of Four Plasma Phospholipid Fatty Acids in the De Novo Lipogenesis Pathway: Results From the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium , 2013, Circulation. Cardiovascular genetics.

[6]  Yun Li,et al.  Genome-wide association study of homocysteine levels in Filipinos provides evidence for CPS1 in women and a stronger MTHFR effect in young adults. , 2010, Human molecular genetics.

[7]  Tanya M. Teslovich,et al.  Biological, Clinical, and Population Relevance of 95 Loci for Blood Lipids , 2010, Nature.

[8]  Mark I. McCarthy,et al.  Genome-Wide Association Study Reveals Multiple Loci Associated with Primary Tooth Development during Infancy , 2010, PLoS genetics.

[9]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[10]  C. Hoggart,et al.  Genome-wide association analysis of metabolic traits in a birth cohort from a founder population , 2008, Nature Genetics.

[11]  J. Naylor,et al.  Mendelian inheritance in man: A catalog of human genes and genetic disorders , 1996 .

[12]  Ralf Herwig,et al.  ConsensusPathDB: toward a more complete picture of cell biology , 2010, Nucleic Acids Res..

[13]  Markus Perola,et al.  Genome-wide association study identifies multiple loci influencing human serum metabolite levels , 2012, Nature Genetics.

[14]  Andrew J. Saykin,et al.  Hippocampal Atrophy as a Quantitative Trait in a Genome-Wide Association Study Identifying Novel Susceptibility Genes for Alzheimer's Disease , 2009, PloS one.

[15]  Christian Gieger,et al.  A genome-wide association study of metabolic traits in human urine , 2011, Nature Genetics.

[16]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[17]  M. McCarthy,et al.  Genome-wide association studies for complex traits: consensus, uncertainty and challenges , 2008, Nature Reviews Genetics.

[18]  Y. Pawitan,et al.  A Genome‐Wide Assessment of Variability in Human Serum Metabolism , 2013, Human mutation.

[19]  Shah Ebrahim,et al.  Common variants in the GDF5-UQCC region are associated with variation in human height , 2008, Nature Genetics.

[20]  Jennifer Mulle,et al.  A Genome-Wide Scan of Ashkenazi Jewish Crohn's Disease Suggests Novel Susceptibility Loci , 2012, PLoS genetics.

[21]  P Henneman,et al.  Prevalence and heritability of the metabolic syndrome and its individual components in a Dutch isolate: the Erasmus Rucphen Family study , 2008, Journal of Medical Genetics.

[22]  Jeffrey A Lieberman,et al.  Genome-Wide Pharmacogenomic Study of Neurocognition As an Indicator of Antipsychotic Treatment Response in Schizophrenia , 2011, Neuropsychopharmacology.

[23]  C. Gieger,et al.  Human metabolic individuality in biomedical and pharmaceutical research , 2011, Nature.

[24]  Christian Gieger,et al.  Loci influencing lipid levels and coronary heart disease risk in 16 European population cohorts , 2009, Nature Genetics.

[25]  Fabian J Theis,et al.  Genome-wide association analyses identify 18 new loci associated with serum urate concentrations , 2012, Nature Genetics.

[26]  L. Essioux,et al.  Investigation of single nucleotide polymorphisms and biological pathways associated with response to TNF&agr; inhibitors in patients with rheumatoid arthritis , 2012, Pharmacogenetics and genomics.

[27]  Christian Gieger,et al.  Genome-Wide Association Studies of Serum Magnesium, Potassium, and Sodium Concentrations Identify Six Loci Influencing Serum Magnesium Levels , 2010, PLoS genetics.

[28]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[29]  Yusuke Nakamura,et al.  A genome-wide association study identifies novel susceptibility genetic variation for thyrotoxic hypokalemic periodic paralysis , 2012, Journal of Human Genetics.

[30]  P. Donnelly,et al.  Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region , 2010 .

[31]  Pedro G. Ferreira,et al.  Transcriptome and genome sequencing uncovers functional variation in humans , 2013, Nature.

[32]  Lorna M. Lopez,et al.  A Meta-Analysis of Thyroid-Related Traits Reveals Novel Loci and Gender-Specific Differences in the Regulation of Thyroid Function , 2013, PLoS genetics.

[33]  David C. Wilson,et al.  Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease , 2012, Nature.

[34]  Yu-Cho Woo,et al.  Genome-wide association study identifies a susceptibility locus for thyrotoxic periodic paralysis at 17q24.3 , 2012, Nature Genetics.

[35]  Gary K. Chen,et al.  Correction: Identification, Replication, and Fine-Mapping of Loci Associated with Adult Height in Individuals of African Ancestry , 2011, PLoS Genetics.

[36]  R. Collins,et al.  Common variants at 30 loci contribute to polygenic dyslipidemia , 2009, Nature Genetics.

[37]  R. Kahn,et al.  The 22q11.2 deletion in children: high rate of autistic disorders and early onset of psychotic symptoms. , 2006, Journal of the American Academy of Child and Adolescent Psychiatry.

[38]  Christian Gieger,et al.  Common variants at ten loci modulate the QT interval duration in the QTSCD Study , 2009, Nature Genetics.

[39]  Anbupalam Thalamuthu,et al.  A combined analysis of genome-wide association studies in breast cancer , 2011, Breast Cancer Research and Treatment.

[40]  K. Shianna,et al.  Genomewide Association Study for Determinants of HIV-1 Acquisition and Viral Set Point in HIV-1 Serodiscordant Couples with Quantified Virus Exposure , 2011, PloS one.

[41]  Michael Jones,et al.  Novel breast cancer susceptibility locus at 9q31.2: results of a genome-wide association study. , 2011, Journal of the National Cancer Institute.

[42]  David M. Evans,et al.  Genome-Wide Association Study Identifies Four Loci Associated with Eruption of Permanent Teeth , 2011, PLoS genetics.

[43]  Paula J. Griffin,et al.  Genome-Wide Association for Abdominal Subcutaneous and Visceral Adipose Reveals a Novel Locus for Visceral Fat in Women , 2012, PLoS genetics.

[44]  Christian Gieger,et al.  Meta-Analysis of 28,141 Individuals Identifies Common Variants within Five New Loci That Influence Uric Acid Concentrations , 2009, PLoS genetics.

[45]  Uwe Völker,et al.  New loci associated with kidney function and chronic kidney disease , 2010, Nature Genetics.

[46]  Wilfred F. J. van IJcken,et al.  NARWHAL, a primary analysis pipeline for NGS data , 2012, Bioinform..

[47]  Richard A. Gibbs,et al.  Novel Genetic Loci Identified for the Pathophysiology of Childhood Obesity in the Hispanic Population , 2012, PloS one.

[48]  G. Abecasis,et al.  Merlin—rapid analysis of dense genetic maps using sparse gene flow trees , 2002, Nature Genetics.

[49]  Michele Magrane,et al.  UniProt Knowledgebase: a hub of integrated protein data , 2011, Database J. Biol. Databases Curation.

[50]  Leena Peltonen,et al.  Variants in TF and HFE explain approximately 40% of genetic variation in serum-transferrin levels. , 2009, American journal of human genetics.

[51]  Christoph Steinbeck,et al.  Genome-Wide Association Study of Metabolic Traits Reveals Novel Gene-Metabolite-Disease Links , 2014, PLoS genetics.

[52]  C. Barnes,et al.  Genome-Wide Screen for Metabolic Syndrome Susceptibility Loci Reveals Strong Lipid Gene Contribution But No Evidence for Common Genetic Basis for Clustering of Metabolic Syndrome Traits , 2012, Circulation. Cardiovascular genetics.

[53]  K. Sirotkin,et al.  The NCBI dbGaP database of genotypes and phenotypes , 2007, Nature Genetics.

[54]  Christian Gieger,et al.  A genome-wide perspective of genetic variation in human metabolism , 2010, Nature Genetics.

[55]  Tariq Ahmad,et al.  Meta-analysis identifies 29 additional ulcerative colitis risk loci, increasing the number of confirmed associations to 47 , 2011, Nature Genetics.

[56]  中尾 光輝,et al.  KEGG(Kyoto Encyclopedia of Genes and Genomes)〔和文〕 (特集 ゲノム医学の現在と未来--基礎と臨床) -- (データベース) , 2000 .

[57]  Milton H. Saier,et al.  TCDB: the Transporter Classification Database for membrane transport protein analyses and information , 2005, Nucleic Acids Res..

[58]  P. Ridker,et al.  Novel Loci, Including Those Related to Crohn Disease, Psoriasis, and Inflammation, Identified in a Genome-Wide Association Study of Fibrinogen in 17 686 Women: The Women's Genome Health Study , 2009, Circulation. Cardiovascular genetics.

[59]  Niku Oksala,et al.  Novel Loci for Metabolic Networks and Multi-Tissue Expression Studies Reveal Genes for Atherosclerosis , 2012, PLoS genetics.

[60]  P. Ridker,et al.  Forty-Three Loci Associated with Plasma Lipoprotein Size, Concentration, and Cholesterol Content in Genome-Wide Analysis , 2009, PLoS genetics.

[61]  W. Willett,et al.  A multistage genome-wide association study in breast cancer identifies two new risk alleles at 1p11.2 and 14q24.1 (RAD51L1) , 2009, Nature Genetics.

[62]  J. Sato,et al.  PRODH Polymorphisms, Cortical Volumes and Thickness in Schizophrenia , 2014, PloS one.

[63]  Peter Donnelly,et al.  A Genome-Wide Metabolic QTL Analysis in Europeans Implicates Two Loci Shaped by Recent Positive Selection , 2011, PLoS genetics.

[64]  Joseph T. Glessner,et al.  Common variants at 5q22 associate with pediatric eosinophilic esophagitis , 2010, Nature Genetics.

[65]  D. Gudbjartsson,et al.  Common variants on chromosomes 2q35 and 16q12 confer susceptibility to estrogen receptor–positive breast cancer , 2007, Nature Genetics.

[66]  Daniel J. Benjamin,et al.  The genetic architecture of economic and political preferences , 2012, Proceedings of the National Academy of Sciences.

[67]  L. Peltonen,et al.  A common variant near the KCNJ2 gene is associated with T-peak to T-end interval. , 2012, Heart rhythm.

[68]  C. Rotimi,et al.  UGT1A1 is a major locus influencing bilirubin levels in African Americans , 2011, European Journal of Human Genetics.

[69]  Donald W. Bowden,et al.  Genome-Wide Association Study of Coronary Heart Disease and Its Risk Factors in 8,090 African Americans: The NHLBI CARe Project , 2011, PLoS genetics.

[70]  Yurii S. Aulchenko,et al.  BIOINFORMATICS APPLICATIONS NOTE doi:10.1093/bioinformatics/btm108 Genetics and population analysis GenABEL: an R library for genome-wide association analysis , 2022 .

[71]  Kyong-Ah Yoon,et al.  Prognostic implications of genetic variants in advanced non-small cell lung cancer: a genome-wide association study. , 2013, Carcinogenesis.

[72]  Ayellet V. Segrè,et al.  Hundreds of variants clustered in genomic loci and biological pathways affect human height , 2010, Nature.

[73]  Ron D. Appel,et al.  ExPASy: the proteomics server for in-depth protein knowledge and analysis , 2003, Nucleic Acids Res..

[74]  S. Cichon,et al.  Genome-Wide Association-, Replication-, and Neuroimaging Study Implicates HOMER1 in the Etiology of Major Depression , 2010, Biological Psychiatry.