Analysing biological pathways in genome-wide association studies

Genome-wide association (GWA) studies have typically focused on the analysis of single markers, which often lacks the power to uncover the relatively small effect sizes conferred by most genetic variants. Recently, pathway-based approaches have been developed, which use prior biological knowledge on gene function to facilitate more powerful analysis of GWA study data sets. These approaches typically examine whether a group of related genes in the same functional pathway are jointly associated with a trait of interest. Here we review the development of pathway-based approaches for GWA studies, discuss their practical use and caveats, and suggest that pathway-based approaches may also be useful for future GWA studies with sequencing data.

[1]  M. Neurath,et al.  Antibodies to interleukin 12 abrogate established experimental colitis in mice , 1995, The Journal of experimental medicine.

[2]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[3]  Pat Levitt,et al.  Molecular Characterization of Schizophrenia Viewed by Microarray Analysis of Gene Expression in Prefrontal Cortex , 2000, Neuron.

[4]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[5]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[6]  M. Daly,et al.  PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes , 2003, Nature Genetics.

[7]  J. Ott,et al.  Complement Factor H Polymorphism in Age-Related Macular Degeneration , 2005, Science.

[8]  A. Edwards,et al.  Complement Factor H Polymorphism and Age-Related Macular Degeneration , 2005, Science.

[9]  J. Gilbert,et al.  Complement Factor H Variant Increases the Risk of Age-Related Macular Degeneration , 2005, Science.

[10]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[11]  R. Tibshirani,et al.  On testing the significance of sets of genes , 2006, math/0610667.

[12]  Larry Wasserman,et al.  Using linkage genome scans to improve power of association in genome scans. , 2006, American journal of human genetics.

[13]  Michael A. Black,et al.  Microarray-based gene set analysis: a comparison of current methods , 2008, BMC Bioinformatics.

[14]  Kai Wang,et al.  Pathway-based approaches for analysis of genomewide association studies. , 2007, American journal of human genetics.

[15]  Hongyu Zhao,et al.  Evidence for association between multiple complement pathway genes and AMD , 2007, Genetic epidemiology.

[16]  Jill P. Mesirov,et al.  GSEA-P: a desktop application for Gene Set Enrichment Analysis , 2007, Bioinform..

[17]  Tao Wang,et al.  Improved power by use of a weighted score test for linkage disequilibrium mapping. , 2007, American journal of human genetics.

[18]  D. Maraganore,et al.  A Genomic Pathway Approach to a Complex Disease: Axon Guidance and Parkinson Disease , 2007, PLoS genetics.

[19]  M. Neurath IL-23: a master regulator in Crohn disease , 2007, Nature Medicine.

[20]  Zhen Jiang,et al.  Bioconductor Project Bioconductor Project Working Papers Year Paper Extensions to Gene Set Enrichment , 2013 .

[21]  David V Conti,et al.  Testing association between disease and multiple SNPs in a candidate gene , 2007, Genetic epidemiology.

[22]  James W Baurley,et al.  Hierarchical Bayes prioritization of marker associations from a genome‐wide association scan for further investigation , 2007, Genetic epidemiology.

[23]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[24]  Peter Bühlmann,et al.  Analyzing gene expression data in terms of gene sets: methodological issues , 2007, Bioinform..

[25]  Qi Liu,et al.  Improving gene set analysis of microarray data by SAM-GS , 2007, BMC Bioinformatics.

[26]  C. Pang,et al.  Multiple gene polymorphisms in the complement factor h gene are associated with exudative age-related macular degeneration in chinese. , 2008, Investigative ophthalmology & visual science.

[27]  Korbinian Strimmer,et al.  A general modular framework for gene set enrichment analysis , 2009, BMC Bioinformatics.

[28]  S. Leal,et al.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. , 2008, American journal of human genetics.

[29]  Francis S Collins,et al.  A HapMap harvest of insights into the genetics of common disease. , 2008, The Journal of clinical investigation.

[30]  D. Maraganore,et al.  Beyond Parkinson Disease: Amyotrophic Lateral Sclerosis and the Axon Guidance Pathway , 2008, PloS one.

[31]  Jean-Pierre A. Kocher,et al.  GLOSSI: a method to assess the association of genetic loci-sets with complex diseases , 2009, BMC Bioinformatics.

[32]  Wei Pan,et al.  Network-based model weighting to detect multiple loci influencing complex diseases , 2008, Human Genetics.

[33]  Kai Wang,et al.  A principal components regression approach to multilocus genetic association studies , 2008, Genetic epidemiology.

[34]  David M. Evans,et al.  Genome-wide association analysis identifies 20 loci that influence adult height , 2008, Nature Genetics.

[35]  A. Day,et al.  Local and systemic interleukin‐18 and interleukin‐18‐binding protein in children with inflammatory bowel disease , 2008, Inflammatory bowel diseases.

[36]  N. Schork,et al.  Pathway analysis of seven common diseases assessed by genome-wide association. , 2008, Genomics.

[37]  Marit Holden,et al.  GSEA-SNP: applying gene set enrichment analysis to SNP data from genome-wide association studies , 2008, Bioinform..

[38]  Chen Dong,et al.  TH17 cells in development: an updated view of their molecular identity and genetic programming , 2008, Nature Reviews Immunology.

[39]  D. Chasman On the utility of gene set methods in genomewide association studies of quantitative traits , 2008, Genetic epidemiology.

[40]  T. Frayling,et al.  A genetic link between type 2 diabetes and prostate cancer , 2008, Diabetologia.

[41]  Judy H. Cho,et al.  Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease , 2008, Nature Genetics.

[42]  E. G. de la Concha,et al.  Association of the STAT4 gene with increased susceptibility for some immune-mediated diseases. , 2008, Arthritis and rheumatism.

[43]  Emmanuel Barillot,et al.  BiNoM: a Cytoscape plugin for manipulating and analyzing biological networks , 2008, Bioinform..

[44]  Xihong Lin,et al.  A powerful and flexible multilocus association test for quantitative traits. , 2008, American journal of human genetics.

[45]  Jason H. Moore,et al.  Pathways-based analyses of whole-genome association study data in bipolar disorder reveal genes mediating ion channel activity and synaptic neurotransmission , 2009, Human Genetics.

[46]  A. Zhernakova,et al.  Genetic analysis of innate immunity in Crohn's disease and ulcerative colitis identifies two susceptibility loci harboring CARD9 and IL18RAP. , 2008, American journal of human genetics.

[47]  Mark I. McCarthy,et al.  Concept, Design and Implementation of a Cardiovascular Gene-Centric 50 K SNP Array for Large-Scale Genomic Association Studies , 2008, PloS one.

[48]  G. Abecasis,et al.  Genotype imputation. , 2009, Annual review of genomics and human genetics.

[49]  Pieter B. T. Neerincx,et al.  Methods for interpreting lists of affected genes obtained in a DNA microarray experiment , 2009, BMC proceedings.

[50]  Hong Wang,et al.  Prioritizing risk pathways: a novel association approach to searching for disease pathways fusing SNPs and pathways , 2009, Bioinform..

[51]  Elizabeth A. Heron,et al.  The SNP ratio test: pathway analysis of genome-wide association datasets , 2009, Bioinform..

[52]  S. Kondo,et al.  Strong Evidence of a Combination Polymorphism of the Tyrosine Kinase 2 Gene and the Signal Transducer and Activator of Transcription 3 Gene as a DNA-Based Biomarker for Susceptibility to Crohn’s Disease in the Japanese Population , 2009, Journal of Clinical Immunology.

[53]  Joseph T. Glessner,et al.  From Disease Association to Risk Assessment: An Optimistic View from Genome-Wide Association Studies on Type 1 Diabetes , 2009, PLoS genetics.

[54]  Joaquín Dopazo,et al.  Gene set-based analysis of polymorphisms: finding pathways or biological processes associated to traits in genome-wide association studies , 2009, Nucleic Acids Res..

[55]  H. Bickeböller,et al.  Integration of a Priori Gene Set Information into Genome-wide Association Studies , 2022 .

[56]  Judy H. Cho,et al.  Interleukin-23/Th17 pathways and inflammatory bowel disease. , 2009, Inflammatory bowel diseases.

[57]  David C. Wilson,et al.  Diverse genome-wide association studies associate the IL12/IL23 pathway with Crohn Disease. , 2009, American journal of human genetics.

[58]  Kai Wang,et al.  ATOM: a powerful gene-based association test by combining optimally weighted markers , 2009, Bioinform..

[59]  Hongyu Zhao,et al.  A pathway analysis applied to Genetic Analysis Workshop 16 genome-wide rheumatoid arthritis data , 2009, BMC proceedings.

[60]  Hans C van Houwelingen,et al.  Integration of gene ontology pathways with North American Rheumatoid Arthritis Consortium genome-wide association data via linear modeling , 2009, BMC proceedings.

[61]  P. Rosenberg,et al.  Pathway analysis by adaptive combination of P‐values , 2009, Genetic epidemiology.

[62]  M. McCarthy,et al.  Interrogating Type 2 Diabetes Genome-Wide Association Data Using a Biological Pathway-Based Approach , 2009, Diabetes.

[63]  H. Yoshida,et al.  Interleukin 27: a double‐edged sword for offense and defense , 2009, Journal of leukocyte biology.

[64]  Y. Pawitan,et al.  Strategies and issues in the detection of pathway enrichment in genome-wide association studies , 2009, Human Genetics.

[65]  Rafael A Irizarry,et al.  Gene set enrichment analysis made simple , 2009, Statistical methods in medical research.

[66]  Yuan Chen,et al.  A new permutation strategy of pathway-based approach for genome-wide association study , 2009, BMC Bioinformatics.

[67]  Frank Emmert-Streib,et al.  Unite and conquer: univariate and multivariate approaches for finding differentially expressed gene sets , 2009, Bioinform..

[68]  P. Matthews,et al.  Pathway and network-based analysis of genome-wide association studies in multiple sclerosis , 2009, Human molecular genetics.

[69]  Tero Aittokallio,et al.  Genoscape: a Cytoscape plug-in to automate the retrieval and integration of gene expression data and molecular networks , 2009, Bioinform..

[70]  H. Bickeböller,et al.  Inclusion of a priori information in genome‐wide association analysis , 2009, Genetic epidemiology.

[71]  Judy H. Cho,et al.  IL-23 and autoimmunity: new insights into the pathogenesis of inflammatory bowel disease. , 2009, Annual review of medicine.

[72]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[73]  David V Conti,et al.  Use of pathway information in molecular epidemiology , 2009, Human Genomics.

[74]  C. Wijmenga,et al.  Using genome‐wide pathway analysis to unravel the etiology of complex diseases , 2009, Genetic epidemiology.

[75]  A. Paterson,et al.  Pathway-based analysis of a genome-wide case-control association study of rheumatoid arthritis. , 2009, BMC proceedings.

[76]  Gregory R. Grant,et al.  A flexible two-stage procedure for identifying gene sets that are differentially expressed , 2009, Bioinform..

[77]  Manuel A. R. Ferreira,et al.  Gene ontology analysis of GWA study data sets provides insights into the biology of bipolar disorder. , 2009, American journal of human genetics.

[78]  C. Hoggart,et al.  Pathway Analysis of GWAS Provides New Insights into Genetic Susceptibility to 3 Inflammatory Diseases , 2009, PloS one.

[79]  Judy H. Cho,et al.  Pathway analysis comparison using Crohn's disease genome wide association studies , 2010, BMC Medical Genomics.

[80]  Trevor J. Hastie,et al.  Genome-wide association analysis by lasso penalized logistic regression , 2009, Bioinform..

[81]  Robert T. Schultz,et al.  Common genetic variants on 5p14.1 associate with autism spectrum disorders , 2009, Nature.

[82]  Airat Bekmetjev,et al.  Comparing gene set analysis methods on single-nucleotide polymorphism data from Genetic Analysis Workshop 16 , 2009, BMC proceedings.

[83]  E. Schadt Molecular networks as sensors and drivers of common human diseases , 2009, Nature.

[84]  Christian Gieger,et al.  Six new loci associated with body mass index highlight a neuronal influence on body weight regulation , 2009, Nature Genetics.

[85]  Peter Kraft,et al.  Complex diseases, complex genes: keeping pathways on the right track. , 2009, Epidemiology.

[86]  J. Marchini,et al.  Genotype imputation for genome-wide association studies , 2010, Nature Reviews Genetics.

[87]  Shamil R Sunyaev,et al.  Pooled association tests for rare variants in exon-resequencing studies. , 2010, American journal of human genetics.

[88]  Suhua Chang,et al.  i-GSEA4GWAS: a web server for identification of pathways/gene sets associated with traits by applying an improved gene set enrichment analysis to genome-wide association study , 2010, Nucleic Acids Res..

[89]  David V Conti,et al.  Discovery of complex pathways from observational data , 2010, Statistics in medicine.

[90]  Xia Yang,et al.  Integrating pathway analysis and genetics of gene expression for genome-wide association studies. , 2010, American journal of human genetics.

[91]  Hua Zhou,et al.  Association screening of common and rare genetic variants by penalized regression , 2010, Bioinform..

[92]  Ayellet V. Segrè,et al.  Hundreds of variants clustered in genomic loci and biological pathways affect human height , 2010, Nature.

[93]  Qi Zhou,et al.  Pathway-based genome-wide association analysis identified the importance of EphrinA-EphR pathway for femoral neck bone geometry. , 2010, Bone.

[94]  M. Xiong,et al.  Genome-wide gene and pathway analysis , 2010, European Journal of Human Genetics.

[95]  Momiao Xiong,et al.  Gene and pathway-based second-wave analysis of genome-wide association studies , 2010, European Journal of Human Genetics.

[96]  Edward Giovannucci,et al.  Diabetes and Cancer , 2010, Diabetes Care.

[97]  Lin S. Chen,et al.  Insights into colon cancer etiology via a regularized approach to gene set analysis of GWAS data. , 2010, American journal of human genetics.

[98]  Tie-Lin Yang,et al.  Pathway-Based Genome-Wide Association Analysis Identified the Importance of Regulation-of-Autophagy Pathway for Ultradistal Radius BMD , 2010, Journal of bone and mineral research : the official journal of the American Society for Bone and Mineral Research.

[99]  P. Visscher,et al.  A versatile gene-based test for genome-wide association studies. , 2010, American journal of human genetics.

[100]  X. Wen,et al.  Gene, region and pathway level analyses in whole‐genome studies , 2009, Genetic epidemiology.

[101]  B. Fridley,et al.  Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods , 2010, PloS one.

[102]  K. Lange,et al.  Prioritizing GWAS results: A review of statistical methods and recommendations for their application. , 2010, American journal of human genetics.

[103]  Constantin Polychronakos,et al.  Comparative genetic analysis of inflammatory bowel disease and type 1 diabetes implicates multiple loci with opposite effects. , 2010, Human molecular genetics.

[104]  B. Müller-Myhsok,et al.  Evidence for STAT4 as a Common Autoimmune Gene: rs7574865 Is Associated with Colonic Crohn's Disease and Early Disease Onset , 2010, PloS one.

[105]  Simon Heath,et al.  Implication of the immune system in Alzheimer's disease: evidence from genome-wide pathway analysis. , 2010, Journal of Alzheimer's disease : JAD.

[106]  Nicole Soranzo,et al.  An Integration of Genome-Wide Association Study and Gene Expression Profiling to Prioritize the Discovery of Novel Susceptibility Loci for Osteoporosis-Related Traits , 2010, PLoS genetics.

[107]  Deanne M. Taylor,et al.  Powerful SNP-set analysis for case-control genome-wide association studies. , 2010, American journal of human genetics.

[108]  Raymond L. White,et al.  Human variation in alcohol response is influenced by variation in neuronal signaling genes. , 2010, Alcoholism, clinical and experimental research.

[109]  Wei Pan,et al.  A Data-Adaptive Sum Test for Disease Association with Multiple Common or Rare Variants , 2010, Human Heredity.

[110]  Sangsoo Kim,et al.  GSA-SNP: a general approach for gene set analysis of polymorphisms , 2010, Nucleic Acids Res..

[111]  M. Gill,et al.  Molecular pathways involved in neuronal cell adhesion and membrane scaffolding contribute to schizophrenia and bipolar disorder susceptibility , 2011, Molecular Psychiatry.

[112]  Holger Schwender,et al.  Testing SNPs and sets of SNPs for importance in association studies. , 2011, Biostatistics.

[113]  Dariusz Plewczynski,et al.  Protein-protein interaction and pathway databases, a graphical review , 2011, Briefings Bioinform..