Gene Ontology Terms Visualization with Dynamic Distance-Graph and Similarity Measures (S)

In the biological field, having a visual and interactive representation of data is useful, particularly when there is a need to investigate a large amount of multilevel data. It is advantageous to communicate this knowledge intuitively because it helps the users to see the dynamic structure in which the correct connections are interacting and extrapolated. In this work, we propose a human-interaction system to view similarity data based on the functions of the Gene Ontology (Cellular Component, Molecular Function, and Biological Process) for Alzheimer’s and Parkinson’s disease proteins/genes. The similarity data was built with the Lin and Wang measures for all three areas of gene ontology. We clustered data with the K-means algorithm and then we have suggested a dynamic and interactive view based on SigmaJS with the aim of allowing customization in the interactive mode of the analysis workflow by users. In this way we have obtained a more immediate visualization to capture the most relevant information within the three vocabularies of Gene Ontology. This facilitates to obtain an omic view and the possibility of carrying out a multilevel analysis with more details which is much more useful in order to better understand the knowledge of the end user.

[1]  Ted Pedersen,et al.  Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text , 2013, J. Biomed. Informatics.

[2]  Qian Zhao,et al.  Exploratory Gene Ontology Analysis with Interactive Visualization , 2018, Scientific Reports.

[3]  L. Dekang,et al.  Extracting collocations from text corpora , 1998 .

[4]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[5]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[6]  Damian Szklarczyk,et al.  The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible , 2016, Nucleic Acids Res..

[7]  Midori A. Harris,et al.  The Gene Ontology project , 2005 .

[8]  Daisuke Kihara,et al.  NaviGO: interactive tool for visualization and functional similarity and coherence analysis with gene ontology , 2017, BMC Bioinformatics.

[9]  Juan Miguel García-Gómez,et al.  BIOINFORMATICS APPLICATIONS NOTE Sequence analysis Manipulation of FASTQ data with Galaxy , 2005 .

[10]  Muhammad Arif Similarity-Dissimilarity Plot for Visualization of High Dimensional Data in Biomedical Pattern Classification , 2010, Journal of Medical Systems.

[11]  Marie-Claude Potier,et al.  Classification and basic pathology of Alzheimer disease , 2009, Acta Neuropathologica.

[12]  Yibo Wu,et al.  GOSemSim: an R package for measuring semantic similarity among GO terms and gene products , 2010, Bioinform..

[13]  A. Xie,et al.  Shared Mechanisms of Neurodegeneration in Alzheimer's Disease and Parkinson's Disease , 2014, BioMed research international.

[14]  David W. Henderson,et al.  Venn Diagrams for More than Four Classes , 1963 .

[15]  Philip S. Yu,et al.  A new method to measure the semantic similarity of GO terms , 2007, Bioinform..

[16]  T. Veenstra Omics in Systems Biology: Current Progress and Future Outlook , 2020, Proteomics.

[17]  Bang Wong,et al.  Visualizing biological data—now and in the future , 2010, Nature Methods.

[18]  Rachael P. Huntley,et al.  QuickGO: a web-based tool for Gene Ontology searching , 2009, Bioinform..

[19]  S. Bruley des Varannes,et al.  Parkinson disease , 2011, Neurology.

[20]  Manu Goyal,et al.  Artificial Intelligence-Based Image Classification for Diagnosis of Skin Cancer: Challenges and Opportunities. , 2019 .

[21]  Israel Steinfeld,et al.  BMC Bioinformatics BioMed Central , 2008 .

[22]  M. Jacomy,et al.  ForceAtlas2, a Continuous Graph Layout Algorithm for Handy Network Visualization Designed for the Gephi Software , 2014, PloS one.

[23]  J. Loor,et al.  What are omics sciences , 2017 .

[24]  Shannon L. Risacher,et al.  Network approaches to systems biology analysis of complex disease: integrative methods for multi-omics data , 2017, Briefings Bioinform..