NeVOmics: An Enrichment Tool for Gene Ontology and Functional Network Analysis and Visualization of Data from OMICs Technologies

The increasing number of OMICs studies demands bioinformatic tools that aid in the analysis of large sets of genes or proteins to understand their roles in the cell and establish functional networks and pathways. In the last decade, over-representation or enrichment tools have played a successful role in the functional analysis of large gene/protein lists, which is evidenced by thousands of publications citing these tools. However, in most cases the results of these analyses are long lists of biological terms associated to proteins that are difficult to digest and interpret. Here we present NeVOmics, Network-based Visualization for Omics, a functional enrichment analysis tool that identifies statistically over-represented biological terms within a given gene/protein set. This tool provides a hypergeometric distribution test to calculate significantly enriched biological terms, and facilitates analysis on cluster distribution and relationship of proteins to processes and pathways. NeVOmics is adapted to use updated information from the two main annotation databases: Gene Ontology and Kyoto Encyclopedia of Genes and Genomes (KEGG). NeVOmics compares favorably to other Gene Ontology and enrichment tools regarding coverage in the identification of biological terms. NeVOmics can also build different network-based graphical representations from the enrichment results, which makes it an integrative tool that greatly facilitates interpretation of results obtained by OMICs approaches. NeVOmics is freely accessible at https://github.com/bioinfproject/bioinfo/.

[1]  Qi Zheng,et al.  GOEAST: a web-based software toolkit for Gene Ontology enrichment analysis , 2008, Nucleic Acids Res..

[2]  R. Kellner,et al.  Proteomics. Concepts and perspectives , 2000, Fresenius' journal of analytical chemistry.

[3]  Hedi Peterson,et al.  g:Profiler—a web server for functional interpretation of gene lists (2016 update) , 2016, Nucleic Acids Res..

[4]  Roland Eils,et al.  circlize implements and enhances circular visualization in R , 2014, Bioinform..

[5]  Bing Zhang,et al.  WebGestalt: an integrated system for exploring gene sets in various biological contexts , 2005, Nucleic Acids Res..

[6]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[7]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[8]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[9]  Andreas Schmidt,et al.  Bioinformatic analysis of proteomics data , 2014, BMC Systems Biology.

[10]  M. O. oude Egbrink,et al.  Exploration of the platelet proteome in patients with early-stage cancer. , 2018, Journal of proteomics.

[11]  Daniel L. Hartl,et al.  GeneMerge - Post-genomic Analysis, Data Mining, and Hypothesis Testing , 2003, Bioinform..

[12]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[13]  Alexander Lex,et al.  UpSetR: an R package for the visualization of intersecting sets and their properties , 2017, bioRxiv.

[14]  Lin Gao,et al.  Biological network analysis: insights into structure and functions. , 2012, Briefings in functional genomics.

[15]  H. Tipney,et al.  An introduction to effective use of enrichment analysis software , 2010, Human Genomics.

[16]  Israel Steinfeld,et al.  BMC Bioinformatics BioMed Central , 2008 .

[17]  Laurie J. Gay,et al.  Contribution of platelets to tumour metastasis , 2011, Nature Reviews Cancer.

[18]  Prudence Mutowo-Meullenet,et al.  The GOA database: Gene Ontology annotation updates for 2015 , 2014, Nucleic Acids Res..

[19]  C. Chapple,et al.  Transcriptome Analysis of Four Arabidopsis thaliana Mediator Tail Mutants Reveals Overlapping and Unique Functions in Gene Regulation , 2018, G3: Genes, Genomes, Genetics.

[20]  Bensu Karahalil,et al.  Overview of Systems Biology and Omics Technologies. , 2016, Current medicinal chemistry.

[21]  Yasset Perez-Riverol,et al.  Bioinformatics tools for the functional interpretation of quantitative proteomics results. , 2014, Current topics in medicinal chemistry.

[22]  Emily Dimmer,et al.  The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology , 2004, Nucleic Acids Res..

[23]  Minoru Kanehisa,et al.  KEGG as a reference resource for gene and protein annotation , 2015, Nucleic Acids Res..

[24]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[25]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[26]  Atul J. Butte,et al.  Ten Years of Pathway Analysis: Current Approaches and Outstanding Challenges , 2012, PLoS Comput. Biol..