PADD: Dynamic Distance-Graph based on Similarity Measures for GO Terms Visualization of Alzheimer and Parkinson diseases

In the biological field, having a visual and interactive representation of data is useful, particularly when there is a need to investigate a large amount of multilevel data. It is advantageous to communicate this knowledge intuitively because it helps the users to perceive the dynamic structure in which the correct connections are present and can be extrapolated. In this work, we propose a human-interaction system to view similarity data based on the functions of the Gene Ontology (Cellular Component, Molecular Function, and Biological Process) of the proteins/genes for Alzheimer disease and Parkinson disease. The similarity data was built with the Lin andWangmeasures for all three areas of Gene Ontology. We clustered data with the K-means algorithm in order to demonstrate how information derived from data can only be partial when using traditional display methods. Then, we have suggested a dynamic and interactive view based on SigmaJS with the aim of allowing customization in the interactive mode of the analysis workflow by users. To this aim, we have developed a first prototype to obtained a more immediate visualization to capture the most relevant information within the three vocabularies of Gene Ontology. This facilitates the creation of an omic view and the ability to perform a multilevel analysis with more details which is much more valuable for the understanding of knowledge by the end users.

[1]  L. Dekang,et al.  Extracting collocations from text corpora , 1998 .

[2]  Manu Goyal,et al.  Artificial Intelligence-Based Image Classification for Diagnosis of Skin Cancer: Challenges and Opportunities. , 2019 .

[3]  A. Xie,et al.  Shared Mechanisms of Neurodegeneration in Alzheimer's Disease and Parkinson's Disease , 2014, BioMed research international.

[4]  Muhammad Arif Similarity-Dissimilarity Plot for Visualization of High Dimensional Data in Biomedical Pattern Classification , 2010, Journal of Medical Systems.

[5]  David W. Henderson,et al.  Venn Diagrams for More than Four Classes , 1963 .

[6]  Marie-Claude Potier,et al.  Classification and basic pathology of Alzheimer disease , 2009, Acta Neuropathologica.

[7]  Qian Zhao,et al.  Exploratory Gene Ontology Analysis with Interactive Visualization , 2018, Scientific Reports.

[8]  Juan Miguel García-Gómez,et al.  BIOINFORMATICS APPLICATIONS NOTE Sequence analysis Manipulation of FASTQ data with Galaxy , 2005 .

[9]  T. Veenstra Omics in Systems Biology: Current Progress and Future Outlook , 2020, Proteomics.

[10]  Philip S. Yu,et al.  A new method to measure the semantic similarity of GO terms , 2007, Bioinform..

[11]  Michele Risi,et al.  Gene Ontology Terms Visualization with Dynamic Distance-Graph and Similarity Measures (S) , 2021, DMSVIVA.

[12]  Stacy Williams,et al.  Dynamical clustering of exchange rates , 2009 .

[13]  Shannon L. Risacher,et al.  Network approaches to systems biology analysis of complex disease: integrative methods for multi-omics data , 2017, Briefings Bioinform..

[14]  Rachael P. Huntley,et al.  QuickGO: a web-based tool for Gene Ontology searching , 2009, Bioinform..

[15]  Yibo Wu,et al.  GOSemSim: an R package for measuring semantic similarity among GO terms and gene products , 2010, Bioinform..

[16]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[17]  Damian Szklarczyk,et al.  The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible , 2016, Nucleic Acids Res..

[18]  Daisuke Kihara,et al.  NaviGO: interactive tool for visualization and functional similarity and coherence analysis with gene ontology , 2017, BMC Bioinformatics.

[19]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[20]  Midori A. Harris,et al.  The Gene Ontology project , 2005 .

[21]  Bang Wong,et al.  Visualizing biological data—now and in the future , 2010, Nature Methods.

[22]  Israel Steinfeld,et al.  BMC Bioinformatics BioMed Central , 2008 .

[23]  M. Jacomy,et al.  ForceAtlas2, a Continuous Graph Layout Algorithm for Handy Network Visualization Designed for the Gephi Software , 2014, PloS one.