GiANT: gene set uncertainty in enrichment analysis

UNLABELLED Over the past years growing knowledge about biological processes and pathways revealed complex interaction networks involving many genes. In order to understand these networks, analysis of differential expression has continuously moved from single genes towards the study of gene sets. Various approaches for the assessment of gene sets have been developed in the context of gene set analysis (GSA). These approaches are bridging the gap between raw measurements and semantically meaningful terms.We present a novel approach for assessing uncertainty in the definition of gene sets. This is an essential step when new gene sets are constructed from domain knowledge or given gene sets are suspected to be affected by uncertainty. Quantification of uncertainty is implemented in the R-package GiANT. We also included widely used GSA methods, embedded in a generic framework that can readily be extended by custom methods. The package provides an easy to use front end and allows for fast parallelization. AVAILABILITY AND IMPLEMENTATION The package GiANT is available on CRAN. CONTACTS hans.kestler@leibniz-fli.de or hans.kestler@uni-ulm.de.

[1]  May D. Wang,et al.  GoMiner: a resource for biological interpretation of genomic and proteomic data , 2003, Genome Biology.

[2]  E. Birney,et al.  Reactome: a knowledgebase of biological pathways , 2004, Nucleic Acids Research.

[3]  I. Nookaew,et al.  Enriching the gene set analysis of genome-wide data by incorporating directionality of gene expression and combining statistical hypotheses and methods , 2013, Nucleic acids research.

[4]  David Bryant,et al.  DAVID Bioinformatics Resources: expanded annotation database and novel algorithms to better extract biology from large gene lists , 2007, Nucleic Acids Res..

[5]  Peter Bühlmann,et al.  Analyzing gene expression data in terms of gene sets: methodological issues , 2007, Bioinform..

[6]  Alexander R. Pico,et al.  WikiPathways: Pathway Editing for the People , 2008, PLoS biology.

[7]  Pierre L'Ecuyer,et al.  An Object-Oriented Random-Number Package with Many Long Streams and Substreams , 2002, Oper. Res..

[8]  H. Kestler,et al.  Disruption of Trp53 in livers of mice induces formation of carcinomas with bilineal differentiation. , 2012, Gastroenterology.

[9]  Florentino Fernández Riverola,et al.  WhichGenes: a web-based tool for gathering, building, storing and exporting gene sets with application in gene set enrichment analysis , 2009, Nucleic Acids Res..

[10]  Henryk Maciejewski,et al.  Gene set analysis methods: statistical models and methodological differences , 2013, Briefings Bioinform..

[11]  Korbinian Strimmer,et al.  BMC Bioinformatics BioMed Central Methodology article A general modular framework for gene set enrichment analysis , 2009 .

[12]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[13]  Sam Griffiths-Jones,et al.  Bias in microRNA functional enrichment analysis , 2015, Bioinform..

[14]  Michael Schneider,et al.  Retraction for Dixson et al., Identification of gene ontologies linked to prefrontal–hippocampal functional coupling in the human brain , 2014, Proceedings of the National Academy of Sciences.

[15]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[16]  P. Pavlidis,et al.  Pitfalls in the application of gene-set analysis to genetics studies. , 2014, Trends in genetics : TIG.

[17]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[18]  Jürgen Sühnel,et al.  AgeFactDB—the JenAge Ageing Factor Database—towards data integration in ageing research , 2013, Nucleic Acids Res..