Pathway-based analysis using reduced gene subsets in genome-wide association studies

BackgroundSingle Nucleotide Polymorphism (SNP) analysis only captures a small proportion of associated genetic variants in Genome-Wide Association Studies (GWAS) partly due to small marginal effects. Pathway level analysis incorporating prior biological information offers another way to analyze GWAS's of complex diseases, and promises to reveal the mechanisms leading to complex diseases. Biologically defined pathways are typically comprised of numerous genes. If only a subset of genes in the pathways is associated with disease then a joint analysis including all individual genes would result in a loss of power. To address this issue, we propose a pathway-based method that allows us to test for joint effects by using a pre-selected gene subset. In the proposed approach, each gene is considered as the basic unit, which reduces the number of genetic variants considered and hence reduces the degrees of freedom in the joint analysis. The proposed approach also can be used to investigate the joint effect of several genes in a candidate gene study.ResultsWe applied this new method to a published GWAS of psoriasis and identified 6 biologically plausible pathways, after adjustment for multiple testing. The pathways identified in our analysis overlap with those reported in previous studies. Further, using simulations across a range of gene numbers and effect sizes, we demonstrate that the proposed approach enjoys higher power than several other approaches to detect associated pathways.ConclusionsThe proposed method could increase the power to discover susceptibility pathways and to identify associated genes using GWAS. In our analysis of genome-wide psoriasis data, we have identified a number of relevant pathways for psoriasis.

[1]  Elizabeth A. Heron,et al.  The SNP ratio test: pathway analysis of genome-wide association datasets , 2009, Bioinform..

[2]  G. Del Prete,et al.  The concept of type-1 and type-2 helper T cells and their cytokines in humans. , 1998, International reviews of immunology.

[3]  Kai Wang,et al.  A principal components regression approach to multilocus genetic association studies , 2008, Genetic epidemiology.

[4]  Pui-Yan Kwok,et al.  Genomewide Scan Reveals Association of Psoriasis with IL-23 and NF-κB Pathways , 2008, Nature Genetics.

[5]  Laura J. Scott,et al.  Comprehensive Association Study of Type 2 Diabetes and Related Quantitative Traits With 222 Candidate Genes , 2008, Diabetes.

[6]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[7]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[8]  P. Rosenberg,et al.  Pathway analysis by adaptive combination of P‐values , 2009, Genetic epidemiology.

[9]  BMC Bioinformatics , 2005 .

[10]  Sunit P. Jariwala,et al.  The role of dendritic cells in the immunopathogenesis of psoriasis , 2007, Archives of Dermatological Research.

[11]  Jelle J. Goeman,et al.  Testing association of a pathway with survival using gene expression data , 2005, Bioinform..

[12]  C. Wijmenga,et al.  Using genome‐wide pathway analysis to unravel the etiology of complex diseases , 2009, Genetic epidemiology.

[13]  Lon R. Cardon,et al.  The complex interplay among factors that influence allelic association , 2004, Nature Reviews Genetics.

[14]  N. Schork,et al.  Generalized genomic distance-based regression methodology for multilocus association analysis. , 2006, American journal of human genetics.

[15]  N. Schork,et al.  Pathway analysis of seven common diseases assessed by genome-wide association. , 2008, Genomics.

[16]  B S Weir,et al.  Truncated product method for combining P‐values , 2002, Genetic epidemiology.

[17]  A. Ma,et al.  Failure to regulate TNF-induced NF-kappaB and cell death responses in A20-deficient mice. , 2000, Science.

[18]  Jonathan D. Licht,et al.  BRCA1 Augments Transcription by the NF-κB Transcription Factor by Binding to the Rel Domain of the p65/RelA Subunit* , 2003, Journal of Biological Chemistry.

[19]  Judy H. Cho,et al.  Comparisons of multi‐marker association methods to detect association between a candidate region and disease , 2010, Genetic epidemiology.

[20]  K. Lange,et al.  Prioritizing GWAS results: A review of statistical methods and recommendations for their application. , 2010, American journal of human genetics.

[21]  M. Xiong,et al.  Genome-wide gene and pathway analysis , 2010, European Journal of Human Genetics.

[22]  M. Eileen Dolan,et al.  A genome-wide approach to identify genetic variants that contribute to etoposide-induced cytotoxicity , 2007, Proceedings of the National Academy of Sciences.

[23]  Hong Wang,et al.  Prioritizing risk pathways: a novel association approach to searching for disease pathways fusing SNPs and pathways , 2009, Bioinform..

[24]  Paul D P Pharoah,et al.  The admixture maximum likelihood test: a novel experiment‐wise test of association between disease and multiple SNPs , 2006, Genetic epidemiology.

[25]  Joachim Selbig,et al.  pcaMethods - a bioconductor package providing PCA methods for incomplete data , 2007, Bioinform..

[26]  Kai Wang,et al.  Pathway-based approaches for analysis of genomewide association studies. , 2007, American journal of human genetics.

[27]  J. Ott,et al.  Mathematical multi-locus approaches to localizing complex human trait genes , 2003, Nature Reviews Genetics.

[28]  F. Nestle,et al.  Characterization of dermal dendritic cells in psoriasis. Autostimulation of T lymphocytes and induction of Th1 type cytokines. , 1994, The Journal of clinical investigation.

[29]  D. Chasman On the utility of gene set methods in genomewide association studies of quantitative traits , 2008, Genetic epidemiology.

[30]  I. Jolliffe Principal Component Analysis , 2002 .

[31]  Chris S. Haley,et al.  Epistasis: too often neglected in complex trait studies? , 2004, Nature Reviews Genetics.

[32]  Kai Wang,et al.  ATOM: a powerful gene-based association test by combining optimally weighted markers , 2009, Bioinform..

[33]  C. Hoggart,et al.  Pathway Analysis of GWAS Provides New Insights into Genetic Susceptibility to 3 Inflammatory Diseases , 2009, PloS one.

[34]  Anbupalam Thalamuthu,et al.  Association tests using kernel‐based measures of multi‐locus genotype similarity between individuals , 2009, Genetic epidemiology.

[35]  X. Wen,et al.  Gene, region and pathway level analyses in whole‐genome studies , 2009, Genetic epidemiology.

[36]  Frank Dudbridge,et al.  Rank truncated product of P‐values, with application to genomewide association scans , 2003, Genetic epidemiology.

[37]  Daniel J Schaid,et al.  Nonparametric tests of association of multiple genes with human disease. , 2005, American journal of human genetics.

[38]  R. Tibshirani,et al.  On testing the significance of sets of genes , 2006, math/0610667.

[39]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[40]  Brad T. Sherman,et al.  DAVID: Database for Annotation, Visualization, and Integrated Discovery , 2003, Genome Biology.

[41]  Frank Dudbridge,et al.  Efficient computation of significance levels for multiple associations in large studies of correlated data, including genomewide association studies. , 2004, American journal of human genetics.

[42]  Kenneth M. Murphy,et al.  Dendritic cell regulation of TH1-TH2 development , 2000, Nature Immunology.

[43]  B. Nickoloff,et al.  The cytokine network in psoriasis. , 1991, Archives of dermatology.