CirGO: an alternative circular way of visualising gene ontology terms

BackgroundPrioritisation of gene ontology terms from differential gene expression analyses in a two-dimensional format remains a challenge with exponentially growing data volumes. Typically, gene ontology terms are represented as tree-maps that enclose all data into defined space. However, large datasets make this type of visualisation appear cluttered and busy, and often not informative as some labels are omitted due space limits, especially when published in two-dimensional (2D) figures.ResultsHere we present an open source CirGO (Circular Gene Ontology) software that visualises non-redundant two-level hierarchically structured ontology terms from gene expression data in a 2D space. Gene ontology terms based on statistical significance were summarised with a semantic similarity algorithm and grouped by hierarchical clustering. This software visualises the most enriched gene ontology terms in an informative, comprehensive and intuitive format that is achieved by organising data from the most relevant to the least, as well as the appropriate use of colours and supporting information. Additionally, CirGO is an easy to use software that supports researchers with little computational background to present their gene ontology data in a publication ready format.ConclusionsOur easy to use open source CirGO Python software package provides biologists with a succinct presentation of terms and functions that are most represented in a specific gene expression data set in a visually appealing 2D format (e.g. for reporting research results in scientific articles). CirGO is freely available at https://github.com/IrinaVKuznetsova/CirGO.git.

[1]  Aleksandra Filipovska,et al.  Hierarchical RNA Processing Is Required for Mitochondrial Ribosome Assembly. , 2016, Cell reports.

[2]  The Gene Ontology Consortium Expansion of the Gene Ontology knowledgebase and resources , 2016, Nucleic Acids Res..

[3]  Aedín C. Culhane,et al.  RamiGO: an R/Bioconductor package providing an AmiGO Visualize interface , 2013, Bioinform..

[4]  May D. Wang,et al.  GoMiner: a resource for biological interpretation of genomic and proteomic data , 2003, Genome Biology.

[5]  Christophe Dessimoz,et al.  The Gene Ontology Handbook , 2017, Methods in Molecular Biology.

[6]  David H. Laidlaw,et al.  Colorgorical: Creating discriminable and preferable color palettes for information visualization , 2017, IEEE Transactions on Visualization and Computer Graphics.

[7]  Waclaw Kusnierczyk,et al.  Taxonomy-based partitioning of the Gene Ontology , 2008, J. Biomed. Informatics.

[8]  David Sánchez,et al.  A framework for unifying ontology-based semantic similarity measures: A study in the biomedical domain , 2014, J. Biomed. Informatics.

[9]  Israel Steinfeld,et al.  BMC Bioinformatics BioMed Central , 2008 .

[10]  Alexander D. Diehl,et al.  Access to immunology through the Gene Ontology , 2008, Immunology.

[11]  Sebastian S. Bauer,et al.  Gene-Category Analysis. , 2017, Methods in molecular biology.

[12]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[13]  Juliana Costa-Silva,et al.  RNA-Seq differential expression analysis: An extended review and a software tool , 2017, PloS one.

[14]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[15]  Chris Mungall,et al.  AmiGO: online access to ontology and annotation data , 2008, Bioinform..

[16]  D. Corey,et al.  RNA sequencing: platform selection, experimental design, and data interpretation. , 2012, Nucleic acid therapeutics.

[17]  Matko Bosnjak,et al.  REVIGO Summarizes and Visualizes Long Lists of Gene Ontology Terms , 2011, PloS one.

[18]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[19]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[20]  Wacław Kunierczyk,et al.  Taxonomy-based partitioning of the Gene Ontology , 2008 .

[21]  Manuel Soler,et al.  Diversity, Loss, and Gain of Malaria Parasites in a Globally Invasive Bird , 2011, PloS one.

[22]  K. Dolinski,et al.  Use and misuse of the gene ontology annotations , 2008, Nature Reviews Genetics.

[23]  Artur Lugmayr,et al.  Cognitive big data: survey and review on big data research and its implications. What is really "new" in big data? , 2017, J. Knowl. Manag..

[24]  I. Goodhead,et al.  Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution , 2008, Nature.

[25]  The Gene Ontology Consortium,et al.  Expansion of the Gene Ontology knowledgebase and resources , 2016, Nucleic Acids Res..

[26]  M. Gerstein,et al.  The Transcriptional Landscape of the Yeast Genome Defined by RNA Sequencing , 2008, Science.

[27]  Ben Shneiderman,et al.  Tree visualization with tree-maps: 2-d space-filling approach , 1992, TOGS.

[28]  Luis Mauricio Bini,et al.  Predicting temporal variation in zooplankton beta diversity is challenging , 2017, PloS one.

[29]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[30]  Phillip W. Lord,et al.  Semantic Similarity in Biomedical Ontologies , 2009, PLoS Comput. Biol..