Erratum to: ‘CompGO: an R package for comparing and visualizing Gene Ontology enrichment differences between DNA binding experiments’

Background: Gene ontology (GO) enrichment is commonly used for inferring biological meaning from systems biology experiments. However, determining differential GO and pathway enrichment between DNA-binding experiments or using the GO structure to classify experiments has received little attention. Results: Herein, we present a bioinformatics tool, CompGO, for identifying Differentially Enriched Gene Ontologies, called DiEGOs, and pathways, through the use of a z-score derivation of log odds ratios, and visualizing these differences at GO and pathway level. Through public experimental data focused on the cardiac transcription factor NKX2-5, we illustrate the problems associated with comparing GO enrichments between experiments using a simple overlap approach. Conclusions: We have developed an R/Bioconductor package, CompGO, which implements a new statistic normally used in epidemiological studies for performing comparative GO analyses and visualizing comparisons from . BED data containing genomic coordinates as well as gene lists as inputs. We justify the statistic through inclusion of experimental data and compare to the commonly used overlap method. CompGO is freely available as a R/Bioconductor package enabling easy integration into existing pipelines and is available at: http://www.bioconductor.org/packages/release/bioc/ html/CompGO.html packages/release/bioc/html/CompGO.html

[1]  Ashley J. Waardenberg,et al.  NKX2-5 mutations causative for congenital heart disease retain functionality and are directed to hundreds of targets , 2015, eLife.

[2]  Robert Gentleman,et al.  rtracklayer: an R package for interfacing with genome browsers , 2009, Bioinform..

[3]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[4]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[5]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[6]  Ni Li,et al.  Gene Ontology Annotations and Resources , 2012, Nucleic Acids Res..

[7]  Milena B. Furtado,et al.  Functional Characterization of a Novel Mutation in NKX2-5 Associated With Congenital Heart Disease and Adult-Onset Cardiomyopathy , 2013, Circulation. Cardiovascular genetics.

[8]  Peter C. Hollenhorst,et al.  Expression profiles frame the promoter specificity dilemma of the ETS family of transcription factors. , 2004, Nucleic acids research.

[9]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Ashley J. Waardenberg,et al.  CompGO: an R package for comparing and visualizing Gene Ontology enrichment differences between DNA binding experiments , 2015, BMC Bioinformatics.

[11]  M. Levandowsky,et al.  Distance between Sets , 1971, Nature.

[12]  D. Adelson,et al.  Comparative GO: A Web Application for Comparative Gene Ontology and Gene Ontology-Based Gene Selection in Bacteria , 2013, PloS one.

[13]  Bing Zhang,et al.  Functional annotation of differentially regulated gene set using WebGestalt: a gene set predictive of response to ipilimumab in tumor biopsies. , 2014, Methods in molecular biology.

[14]  Juliea Morris,et al.  Statistics in Medicine: Calculating confidence intervals for relative risks (odds ratios) and standardised ratios and rates , 1988, British medical journal.

[15]  S P Azen,et al.  OBTAINING CONFIDENCE INTERVALS FOR THE RISK RATIO IN COHORT STUDIES , 1978 .

[16]  Cory Y. McLean,et al.  GREAT improves functional interpretation of cis-regulatory regions , 2010, Nature Biotechnology.

[17]  Gary D Bader,et al.  Enrichment Map: A Network-Based Method for Gene-Set Enrichment Visualization and Interpretation , 2010, PloS one.

[18]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[19]  Aibin He,et al.  Co-occupancy by multiple cardiac transcription factors identifies transcriptional enhancers active in heart , 2011, Proceedings of the National Academy of Sciences.

[20]  J. Seidman,et al.  Mutations in the cardiac transcription factor NKX2.5 affect diverse cardiac developmental pathways. , 1999, The Journal of clinical investigation.

[21]  BMC Bioinformatics , 2005 .

[22]  Cristóbal Fresno,et al.  RDAVIDWebService: a versatile R interface to DAVID , 2013, Bioinform..

[23]  Qi Zheng,et al.  GOEAST: a web-based software toolkit for Gene Ontology enrichment analysis , 2008, Nucleic Acids Res..