Gene set-based analysis of polymorphisms: finding pathways or biological processes associated to traits in genome-wide association studies

Genome-wide association studies have become a popular strategy to find associations of genes to traits of interest. Despite the high-resolution available today to carry out genotyping studies, the success of its application in real studies has been limited by the testing strategy used. As an alternative to brute force solutions involving the use of very large cohorts, we propose the use of the Gene Set Analysis (GSA), a different analysis strategy based on testing the association of modules of functionally related genes. We show here how the Gene Set-based Analysis of Polymorphisms (GeSBAP), which is a simple implementation of the GSA strategy for the analysis of genome-wide association studies, provides a significant increase in the power testing for this type of studies. GeSBAP is freely available at http://bioinfo.cipf.es/gesbap/

[1]  Susumu Goto,et al.  The KEGG resource for deciphering the genome , 2004, Nucleic Acids Res..

[2]  Kai Wang,et al.  Pathway-based approaches for analysis of genomewide association studies. , 2007, American journal of human genetics.

[3]  N. Katsanis,et al.  Human genetics and disease: Beyond Mendel: an evolving view of human genetic disease transmission , 2002, Nature Reviews Genetics.

[4]  Marit Holden,et al.  GSEA-SNP: applying gene set enrichment analysis to SNP data from genome-wide association studies , 2008, Bioinform..

[5]  N. Schork,et al.  Pathway analysis of seven common diseases assessed by genome-wide association. , 2008, Genomics.

[6]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[7]  M. McCarthy,et al.  Genome-wide association studies for complex traits: consensus, uncertainty and challenges , 2008, Nature Reviews Genetics.

[8]  Joaquín Dopazo,et al.  Formulating and testing hypotheses in functional genomics , 2009, Artif. Intell. Medicine.

[9]  K. Hemminki,et al.  Association of genetic variants in the Rho guanine nucleotide exchange factor AKAP 13 with familial breast cancer , 2006 .

[10]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[11]  J. Dopazo Functional interpretation of microarray experiments. , 2006, Omics : a journal of integrative biology.

[12]  Joaquín Dopazo,et al.  From genes to functional classes in the study of biological systems , 2007, BMC Bioinformatics.

[13]  Peter Bühlmann,et al.  Analyzing gene expression data in terms of gene sets: methodological issues , 2007, Bioinform..

[14]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[15]  Suzanne Schubbert,et al.  Hyperactive Ras in developmental disorders and cancer , 2007, Nature Reviews Cancer.

[16]  Alfons Meindl,et al.  Association of genetic variants in the Rho guanine nucleotide exchange factor AKAP13 with familial breast cancer. , 2006, Carcinogenesis.

[17]  Joaquín Dopazo,et al.  Babelomics: advanced functional profiling of transcriptomics, proteomics and genomics experiments , 2008, Nucleic Acids Res..

[18]  Joaquín Dopazo,et al.  Discovering molecular functions significantly related to phenotypes by combining gene expression data and biological information , 2005, Bioinform..

[19]  Hong Wang,et al.  Prioritizing risk pathways: a novel association approach to searching for disease pathways fusing SNPs and pathways , 2009, Bioinform..

[20]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[21]  Jason H. Moore,et al.  Pathways-based analyses of whole-genome association study data in bipolar disorder reveal genes mediating ion channel activity and synaptic neurotransmission , 2009, Human Genetics.

[22]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[23]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[24]  W. Willett,et al.  A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer , 2007, Nature Genetics.

[25]  Leonid Kruglyak,et al.  The road to genome-wide association studies , 2008, Nature Reviews Genetics.

[26]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[27]  Christian Gieger,et al.  Loci influencing lipid levels and coronary heart disease risk in 16 European population cohorts , 2009, Nature Genetics.