Mapping the Genetic Architecture of Gene Expression in Human Liver

Genetic variants that are associated with common human diseases do not lead directly to disease, but instead act on intermediate, molecular phenotypes that in turn induce changes in higher-order disease traits. Therefore, identifying the molecular phenotypes that vary in response to changes in DNA and that also associate with changes in disease traits has the potential to provide the functional information required to not only identify and validate the susceptibility genes that are directly affected by changes in DNA, but also to understand the molecular networks in which such genes operate and how changes in these networks lead to changes in disease traits. Toward that end, we profiled more than 39,000 transcripts and we genotyped 782,476 unique single nucleotide polymorphisms (SNPs) in more than 400 human liver samples to characterize the genetic architecture of gene expression in the human liver, a metabolically active tissue that is important in a number of common human diseases, including obesity, diabetes, and atherosclerosis. This genome-wide association study of gene expression resulted in the detection of more than 6,000 associations between SNP genotypes and liver gene expression traits, where many of the corresponding genes identified have already been implicated in a number of human diseases. The utility of these data for elucidating the causes of common human diseases is demonstrated by integrating them with genotypic and expression data from other human and mouse populations. This provides much-needed functional support for the candidate susceptibility genes being identified at a growing number of genetic loci that have been identified as key drivers of disease from genome-wide association studies of disease. By using an integrative genomics approach, we highlight how the gene RPS26 and not ERBB3 is supported by our data as the most likely susceptibility gene for a novel type 1 diabetes locus recently identified in a large-scale, genome-wide association study. We also identify SORT1 and CELSR2 as candidate susceptibility genes for a locus recently associated with coronary artery disease and plasma low-density lipoprotein cholesterol levels in the process.

[1]  A. Roses,et al.  Novel polymorphism in the A4 region of the amyloid precursor protein gene in a patient without Alzheimer's disease , 1993, Neurology.

[2]  Satoshi Tanaka,et al.  PPARγ Mediates High-Fat Diet–Induced Adipocyte Hypertrophy and Insulin Resistance , 1999 .

[3]  S. Aizawa,et al.  PPAR gamma mediates high-fat diet-induced adipocyte hypertrophy and insulin resistance. , 1999, Molecular cell.

[4]  T. Hughes,et al.  Signaling and circuitry of multiple MAPK pathways revealed by a matrix of global gene expression profiles. , 2000, Science.

[5]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[6]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[7]  Yudong D. He,et al.  Expression profiling using microarrays fabricated by an ink-jet oligonucleotide synthesizer , 2001, Nature Biotechnology.

[8]  Stephen W. Edwards,et al.  Microarray Standard Data Set and Figures of Merit for Comparing Data Processing Methods and Experiment Designs , 2003, Bioinform..

[9]  C. Molony,et al.  Genetic analysis of genome-wide variation in human gene expression , 2004, Nature.

[10]  E. Schadt,et al.  Genetic inheritance of gene expression in human cell lines. , 2004, American journal of human genetics.

[11]  Zhengyan Kan,et al.  Expression of alternatively spliced sodium channel alpha-subunit genes. Unique splicing patterns are observed in dorsal root ganglia. , 2004, The Journal of biological chemistry.

[12]  J. Zhu,et al.  An integrative genomics approach to the reconstruction of gene networks in segregating populations , 2004, Cytogenetic and Genome Research.

[13]  J. Gulcher,et al.  The gene encoding 5-lipoxygenase activating protein confers risk of myocardial infarction and stroke , 2004, Nature Genetics.

[14]  Eric E Schadt,et al.  Cis-acting expression quantitative trait loci in mice. , 2005, Genome research.

[15]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[16]  Deborah A Nickerson,et al.  Effect of VKORC1 haplotypes on transcriptional regulation and warfarin dose. , 2005, The New England journal of medicine.

[17]  J. Ott,et al.  Complement Factor H Polymorphism in Age-Related Macular Degeneration , 2005, Science.

[18]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[19]  A. Edwards,et al.  Complement Factor H Polymorphism and Age-Related Macular Degeneration , 2005, Science.

[20]  E E Schadt,et al.  Integrating genotypic and expression data in a segregating mouse population to identify 5-lipoxygenase as a susceptibility gene for obesity and bone traits , 2005, Nature Genetics.

[21]  Joshua T. Burdick,et al.  Mapping determinants of human gene expression by regional and genome-wide association , 2005, Nature.

[22]  J. Castle,et al.  An integrative genomics approach to infer causal associations between gene expression and disease , 2005, Nature Genetics.

[23]  Matthew C. Wiener,et al.  Increasing the Power to Detect Causal Associations among Genes and between Genes and Complex Traits by Combining Genotypic and Gene Expression Data in Segregating Populations , 2005 .

[24]  J. Ott,et al.  Complement Factor H Polymorphism in Age-Related Macular Degeneration , 2005, Science.

[25]  J. Gilbert,et al.  Complement Factor H Variant Increases the Risk of Age-Related Macular Degeneration , 2005, Science.

[26]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[27]  H. Stefánsson,et al.  Variant of transcription factor 7-like 2 (TCF7L2) gene confers risk of type 2 diabetes , 2006, Nature Genetics.

[28]  J. Castle,et al.  Expression profiles of 50 xenobiotic transporter genes in humans and pre-clinical species: A resource for investigations into drug disposition , 2006, Xenobiotica; the fate of foreign compounds in biological systems.

[29]  A. Arnold,et al.  Tissue-specific expression and regulation of sexually dimorphic genes in mice. , 2006, Genome research.

[30]  Editorial: The absorption, distribution, metabolism and excretion (ADME) transcriptome , 2006, Xenobiotica; the fate of foreign compounds in biological systems.

[31]  H. Kaneto,et al.  Increased stress protein ORP150 autoantibody production in Type 1 diabetic patients , 2006, Diabetic medicine : a journal of the British Diabetic Association.

[32]  F. Hu,et al.  A Common Genetic Variant Is Associated with Adult and Childhood Obesity , 2006, Science.

[33]  E. Yilmaz,et al.  Chemical Chaperones Reduce ER Stress and Restore Glucose Homeostasis in a Mouse Model of Type 2 Diabetes , 2006, Science.

[34]  D. Stephan,et al.  A survey of genetic human cortical gene expression , 2007, Nature Genetics.

[35]  R. A. Bailey,et al.  Robust associations of four new chromosome regions from genome-wide analyses of type 1 diabetes , 2007, Nature Genetics.

[36]  T. Hudson,et al.  A genome-wide association study identifies novel risk loci for type 2 diabetes , 2007, Nature.

[37]  C. Gieger,et al.  Genomewide association analysis of coronary artery disease. , 2007, The New England journal of medicine.

[38]  L. Liang,et al.  A genome-wide association study of global gene expression , 2007, Nature Genetics.

[39]  Jonathan C. Cohen,et al.  A Common Allele on Chromosome 9 Associated with Coronary Heart Disease , 2007, Science.

[40]  Margaret F. Gregor,et al.  Thematic review series: Adipocyte Biology. Adipocyte stress: the endoplasmic reticulum and metabolic disease Published, JLR Papers in Press, May 9, 2007. , 2007, Journal of Lipid Research.

[41]  Kari Stefansson,et al.  A common variant on chromosome 9p21 affects the risk of myocardial infarction. , 2007, Science.

[42]  Y. Shoenfeld,et al.  Are anti-ribosomal P protein antibodies relevant in systemic lupus erythematosus? , 2007, Clinical reviews in allergy & immunology.

[43]  Gonçalo R. Abecasis,et al.  Genetic variants regulating ORMDL3 expression contribute to the risk of childhood asthma , 2007, Nature.

[44]  Jacques Fellay,et al.  A Whole-Genome Association Study of Major Determinants for Host Control of HIV-1 , 2007, Science.

[45]  Marcia M. Nizzari,et al.  Genome-Wide Association Analysis Identifies Loci for Type 2 Diabetes and Triglyceride Levels , 2007, Science.

[46]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[47]  Jun Zhu,et al.  Increasing the Power to Detect Causal Associations by Combining Genotypic and Expression Data in Segregating Populations , 2007, PLoS Comput. Biol..

[48]  Dolores Corella,et al.  Six new loci associated with blood low-density lipoprotein cholesterol, high-density lipoprotein cholesterol or triglycerides in humans , 2008, Nature Genetics.

[49]  Eric E. Schadt,et al.  Calibrating the Performance of SNP Arrays for Whole-Genome Association Studies , 2008, PLoS genetics.

[50]  Rachel B. Brem,et al.  Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks , 2008, Nature Genetics.

[51]  R. Collins,et al.  Newly identified loci that influence lipid concentrations and risk of coronary artery disease , 2008, Nature Genetics.

[52]  H. Stefánsson,et al.  Genetics of gene expression and its effect on disease , 2008, Nature.

[53]  S. Horvath,et al.  Variations in DNA elucidate molecular networks that cause disease , 2008, Nature.