Integrating Genome-Wide Association and eQTLs Studies Identifies the Genes and Gene Sets Associated with Diabetes

Aim To identify novel candidate genes and gene sets for diabetes. Methods We performed an integrative analysis of genome-wide association studies (GWAS) and expression quantitative trait loci (eQTLs) data for diabetes. Summary data was driven from a large-scale GWAS of diabetes, totally involving 58,070 individuals. eQTLs dataset included 923,021 cis-eQTL for 14,329 genes and 4,732 trans-eQTL for 2,612 genes. Integrative analysis of GWAS and eQTLs data was conducted by summary data-based Mendelian randomization (SMR). To identify the gene sets associated with diabetes, the SMR single gene analysis results were further subjected to gene set enrichment analysis (GSEA). A total of 13,311 annotated gene sets were analyzed in this study. Results SMR analysis identified 6 genes significantly associated with fasting glucose, such as C11ORF10 (p value = 6.04 × 10−8), MRPL33 (p value = 1.24 × 10−7), and FADS1 (p value = 2.39 × 10−7). Gene set analysis identified HUANG_FOXA2_TARGETS_UP (false discovery rate = 0.047) associated with fasting glucose. Conclusion Our study provides novel clues for clarifying the genetic mechanism of diabetes. This study also illustrated the good performance of SMR approach and extended it to gene set association analysis for complex diseases.

[1]  P. Poulsen,et al.  A common variation of the PTEN gene is associated with peripheral insulin resistance. , 2016, Diabetes & metabolism.

[2]  May E. Montasser,et al.  Genome-Wide Association Study of the Modified Stumvoll Insulin Sensitivity Index Identifies BCL2 and FAM19A2 as Novel Insulin Sensitivity Loci , 2016, Diabetes.

[3]  Z. Ding,et al.  Fatty acid desaturase 1 knockout mice are lean with improved glycemic control and decreased development of atheromatous plaque , 2016, Diabetes, metabolic syndrome and obesity : targets and therapy.

[4]  Donald W. Bowden,et al.  Mapping adipose and muscle tissue expression quantitative trait loci in African Americans to identify genes for type 2 diabetes and obesity , 2016, Human Genetics.

[5]  J. Auwerx,et al.  KAT2B Is Required for Pancreatic Beta Cell Adaptation to Metabolic Stress by Controlling the Unfolded Protein Response. , 2016, Cell reports.

[6]  P. Visscher,et al.  Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets , 2016, Nature Genetics.

[7]  Yan Wen,et al.  PAPA: a flexible tool for identifying pleiotropic pathways using genome-wide association study summaries , 2016, Bioinform..

[8]  L. Hou,et al.  Polymorphisms of rs174616 in the FADS1-FADS2 gene cluster is associated with a reduced risk of type 2 diabetes mellitus in northern Han Chinese people. , 2015, Diabetes research and clinical practice.

[9]  A. Chan,et al.  Association of TNFAIP3 and TNFRSF1A variation with multiple sclerosis in a German case–control cohort , 2015, International journal of immunogenetics.

[10]  Xuchu Hu,et al.  Analysis of global gene expression profiles suggests a role of acute inflammation in type 3C diabetes mellitus caused by pancreatic ductal adenocarcinoma , 2015, Diabetologia.

[11]  Lindsey J. Leach,et al.  Genome-wide eQTLs and heritability for gene expression traits in unrelated individuals , 2014, BMC Genomics.

[12]  M. Peters,et al.  Systematic identification of trans eQTLs as putative drivers of known disease associations , 2013, Nature Genetics.

[13]  M. Vohl,et al.  Polymorphisms in Fatty Acid Desaturase (FADS) Gene Cluster: Effects on Glycemic Controls Following an Omega-3 Polyunsaturated Fatty Acids (PUFA) Supplementation , 2013, Genes.

[14]  Philip Rosenstiel,et al.  The large non-coding RNA ANRIL, which is associated with atherosclerosis, periodontitis and several forms of cancer, regulates ADIPOR1, VAMP3 and C11ORF10. , 2013, Human molecular genetics.

[15]  Claude Bouchard,et al.  A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance , 2012, Nature Genetics.

[16]  I. Borecki,et al.  Meta‐analysis of gene‐environment interaction: joint estimation of SNP and SNP × environment regression coefficients , 2011, Genetic epidemiology.

[17]  G. Abecasis,et al.  MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes , 2010, Genetic epidemiology.

[18]  D. Balding,et al.  A Genome-Wide Association Study of the Metabolic Syndrome in Indian Asian Men , 2010, PloS one.

[19]  M. Tsujimoto,et al.  Adaptor protein sorting nexin 17 interacts with the scavenger receptor FEEL-1/stabilin-1 and modulates its expression on the cell surface. , 2010, Biochimica et biophysica acta.

[20]  N. Cox,et al.  Trait-Associated SNPs Are More Likely to Be eQTLs: Annotation to Enhance Discovery from GWAS , 2010, PLoS genetics.

[21]  T. Hansen,et al.  Association of Variants in the Sterol Regulatory Element-Binding Factor 1 (SREBF1) Gene With Type 2 Diabetes, Glycemia, and Insulin Resistance , 2008, Diabetes.

[22]  M. McCarthy,et al.  Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes , 2008, Nature Genetics.

[23]  Kai Wang,et al.  Pathway-based approaches for analysis of genomewide association studies. , 2007, American journal of human genetics.

[24]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Wolfgang Rathmann,et al.  Global prevalence of diabetes: estimates for the year 2000 and projections for 2030. , 2004, Diabetes care.

[26]  S. Wild,et al.  Global prevalence of diabetes: estimates for the year 2000 and projections for 2030. , 2004, Diabetes care.

[27]  E. Petretto Genetic regulation of gene expression , 2013 .

[28]  Peter Donnelly,et al.  A new multipoint method for genome-wide association studies via imputation of genotypes : Supplementary Methods , 2007 .