EPS: an empirical Bayes approach to integrating pleiotropy and tissue-specific information for prioritizing risk genes

MOTIVATION Researchers worldwide have generated a huge volume of genomic data, including thousands of genome-wide association studies (GWAS) and massive amounts of gene expression data from different tissues. How to perform a joint analysis of these data to gain new biological insights has become a critical step in understanding the etiology of complex diseases. Due to the polygenic architecture of complex diseases, the identification of risk genes remains challenging. Motivated by the shared risk genes found in complex diseases and tissue-specific gene expression patterns, we propose as an Empirical Bayes approach to integrating Pleiotropy and Tissue-Specific information (EPS) for prioritizing risk genes. RESULTS As demonstrated by extensive simulation studies, EPS greatly improves the power of identification for disease-risk genes. EPS enables rigorous hypothesis testing of pleiotropy and tissue-specific risk gene expression patterns. All of the model parameters can be adaptively estimated from the developed expectation-maximization (EM) algorithm. We applied EPS to the bipolar disorder and schizophrenia GWAS from the Psychiatric Genomics Consortium, along with the gene expression data for multiple tissues from the Genotype-Tissue Expression project. The results of the real data analysis demonstrate many advantages of EPS. AVAILABILITY AND IMPLEMENTATION The EPS software is available on https://sites.google.com/site/liujin810822 CONTACT: eeyang@hkbu.edu.hk SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Yoav Benjamini,et al.  Microarrays, Empirical Bayes and the Two-Groups Model. Comment. , 2008 .

[2]  Isaac Dialsingh,et al.  Large-scale inference: empirical Bayes methods for estimation, testing, and prediction , 2012 .

[3]  R. Tibshirani,et al.  Statistical Applications in Genetics and Molecular Biology Pre-validation and inference in microarrays , 2011 .

[4]  Subhajyoti De,et al.  Common variants near MC4R are associated with fat mass, weight and risk of obesity , 2008, Nature Genetics.

[5]  E. Dermitzakis,et al.  Tissue-Specific Effects of Genetic and Epigenetic Variation on Gene Regulation and Splicing , 2015, PLoS genetics.

[6]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[7]  Jianqing Fan,et al.  High Dimensional Classification Using Features Annealed Independence Rules. , 2007, Annals of statistics.

[8]  Wenjie Chen,et al.  GRASP v2.0: an update on the Genome-Wide Repository of Associations between SNPs and phenotypes , 2014, Nucleic Acids Res..

[9]  W. Willett,et al.  Multiple loci identified in a genome-wide association study of prostate cancer , 2008, Nature Genetics.

[10]  M. Daly,et al.  Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis , 2013, The Lancet.

[11]  Melissa P. DelBello,et al.  MRI Analysis of the Cerebellum in Bipolar Disorder: A Pilot Study , 1999, Neuropsychopharmacology.

[12]  Qian Wang,et al.  Pervasive pleiotropy between psychiatric disorders and immune disorders revealed by integrative analysis of multiple GWAS , 2015, Human Genetics.

[13]  M. Stephens,et al.  A Statistical Framework for Joint eQTL Analysis in Multiple Tissues , 2012, PLoS genetics.

[14]  C. Spencer,et al.  Biological Insights From 108 Schizophrenia-Associated Genetic Loci , 2014, Nature.

[15]  Qian Wang,et al.  Implications of pleiotropy: challenges and opportunities for mining Big Data in biomedicine , 2015, Front. Genet..

[16]  Adan Valladares-Salgado,et al.  Cross-tissue and tissue-specific eQTLs: partitioning the heritability of a complex trait. , 2014, American journal of human genetics.

[17]  M. Ritchie,et al.  Methods of integrating data to uncover genotype–phenotype interactions , 2015, Nature Reviews Genetics.

[18]  S. Purcell,et al.  Pleiotropy in complex traits: challenges and strategies , 2013, Nature Reviews Genetics.

[19]  P. Visscher,et al.  A versatile gene-based test for genome-wide association studies. , 2010, American journal of human genetics.

[20]  W. Maier,et al.  Schizophrenia and bipolar disorder: differences and overlaps , 2006, Current opinion in psychiatry.

[21]  Oliver Sieber,et al.  A genome-wide association scan of tag SNPs identifies a susceptibility variant for colorectal cancer at 8q24.21 , 2007, Nature Genetics.

[22]  Helen M. Moore Acquisition of normal tissues for the GTEx program. , 2013, Biopreservation and biobanking.

[23]  Matti Pirinen,et al.  A genome-wide association study identifies new psoriasis susceptibility loci and an interaction between HLA-C and ERAP1 , 2010, Nature Genetics.

[24]  Jiang Qian,et al.  TiGER: A database for tissue-specific gene expression and regulation , 2008, BMC Bioinformatics.

[25]  Yi Wang,et al.  Genome-wide association study identifies a susceptibility locus for schizophrenia in Han Chinese at 11p11.2 , 2011, Nature Genetics.

[26]  K. Tessner,et al.  Stress and the hypothalamic pituitary adrenal axis in the developmental course of schizophrenia. , 2008, Annual review of clinical psychology.

[27]  Daniel Shriner,et al.  Moving toward System Genetics through Multiple Trait Analysis in Genome-Wide Association Studies , 2011, Front. Gene..

[28]  Bradley Efron,et al.  Large-scale inference , 2010 .

[29]  Yi Wang,et al.  Genome-wide association study identifies a susceptibility locus for schizophrenia in Han Chinese at 11 p 11 . 2 , 2011 .

[30]  Hongyu Zhao,et al.  GPA: A Statistical Approach to Prioritizing GWAS Results by Integrating Pleiotropy and Annotation , 2014, PLoS genetics.

[31]  Anders M. Dale,et al.  Covariate-modulated local false discovery rate for genome-wide association studies , 2014, Bioinform..

[32]  Frank W. Stearns One Hundred Years of Pleiotropy: A Retrospective , 2010, Genetics.

[33]  Ipek Oguz,et al.  Brain Abnormalities in Bipolar Disorder Detected by Quantitative T1ρ Mapping , 2014, Molecular Psychiatry.

[34]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[35]  M. McCarthy,et al.  Improved detection of common variants associated with schizophrenia by leveraging pleiotropy with cardiovascular-disease risk factors. , 2013, American journal of human genetics.

[36]  A. Lusis,et al.  Systems genetics approaches to understand complex traits , 2013, Nature Reviews Genetics.

[37]  M. Daly,et al.  Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis , 2013, The Lancet.

[38]  Tariq Ahmad,et al.  Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci , 2010, Nature Genetics.

[39]  Armin Schwartzman,et al.  Empirical null and false discovery rate inference for exponential families , 2008, 0901.4007.

[40]  P. Sham,et al.  The future of association studies: gene-based analysis and replication. , 2004, American journal of human genetics.

[41]  Jianxin Shi,et al.  Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs , 2013, Nature Genetics.

[42]  Donghyung Lee,et al.  JEPEG: a summary statistics based tool for gene-level joint testing of functional variants , 2014, Bioinform..

[43]  S. MacGregor,et al.  VEGAS2: Software for More Flexible Gene-Based Testing , 2014, Twin Research and Human Genetics.

[44]  Y Wang,et al.  Genetic pleiotropy between multiple sclerosis and schizophrenia but not bipolar disorder: differential involvement of immune-related gene loci , 2014, Molecular Psychiatry.

[45]  Johnny S. H. Kwan,et al.  GATES: a rapid and powerful gene-based association test using extended Simes procedure. , 2011, American journal of human genetics.

[46]  P. Visscher,et al.  Five years of GWAS discovery. , 2012, American journal of human genetics.

[47]  R. Tibshirani,et al.  Penalized classification using Fisher's linear discriminant , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[48]  D. Schutter,et al.  The role of the cerebellum in the pathophysiology and treatment of neuropsychiatric disorders: A review , 2008, Brain Research Reviews.

[49]  Stephen M Strakowski,et al.  MRI analysis of cerebellar vermal abnormalities in bipolar disorder. , 2005, The American journal of psychiatry.

[50]  P. Visscher,et al.  Common polygenic variation contributes to risk of schizophrenia and bipolar disorder , 2009, Nature.