Linked Data Based Multi-omics Integration and Visualization for Cancer Decision Networks

Visualization of Gene Expression (GE) is a challenging task since the number of genes and their associations are difficult to predict in various set of biological studies. GE could be used to understand tissue-gene-protein relationships. Currently, Heatmaps is the standard visualization technique to depict GE data. However, Heatmaps only covers the cluster of highly dense regions. It does not provide the Interaction, Functional Annotation and pooled understanding from higher to lower expression. In the present paper, we propose a graph-based technique - based on color encoding from higher to lower expression map, along with the functional annotation. This visualization technique is highly interactive (HeatMaps are mainly static maps). The visualization system here explains the association between overlapping genes with and without tissues types. Traditional visualization techniques (viz-Heatmaps) generally explain each of the association in distinct maps. For example, overlapping genes and their interactions, based on co-expression and expression cut off are three distinct Heatmaps. We demonstrate the usability using ortholog study of GE and visualize GE using GExpressionMap. We further compare and benchmark our approach with the existing visualization techniques. It also reduces the task to cluster the expressed gene networks further to understand the over/under expression. Further, it provides the interaction based on co-expression network which itself creates co-expression clusters. GExpressionMap provides a unique graph-based visualization for GE data with their functional annotation and associated interaction among the DEGs (Differentially Expressed Genes).

[1]  D. Koller,et al.  GeneXPress : A Visualization and Statistical Analysis Tool for Gene Expression and Sequence Data , .

[2]  Ting Chen,et al.  Modeling Gene Expression with Differential Equations , 1998, Pacific Symposium on Biocomputing.

[3]  Robin D Dowell,et al.  The similarity of gene expression between human and mouse tissues , 2011, Genome Biology.

[4]  Leng Han,et al.  Gene co-expression network analysis reveals common system-level properties of prognostic genes across cancer types , 2014, Nature Communications.

[5]  Kay Nieselt,et al.  Mayday - integrative analytics for expression data , 2010, BMC Bioinformatics.

[6]  Judith A. Blake,et al.  MGD: the Mouse Genome Database , 2003, Nucleic Acids Res..

[7]  Michael Burch,et al.  BiCluster Viewer: A Visualization Tool for Analyzing Gene Expression Data , 2011, ISVC.

[8]  Robert E. W. Hancock,et al.  INVEX—a web-based tool for integrative visualization of expression data , 2013, Bioinform..

[9]  Satoru Miyano,et al.  ArrayCluster: an analytic tool for clustering, data visualization and module finder on gene expression profiles , 2006, Bioinform..

[10]  Paul Stothard,et al.  Gene co-expression network analysis identifies porcine genes associated with variation in Salmonella shedding , 2014, BMC Genomics.

[11]  S. Mocellin,et al.  RNA interference: learning gene knock-down from cell physiology , 2004, Journal of Translational Medicine.

[12]  Jaak Vilo,et al.  ClustVis: a web tool for visualizing clustering of multivariate data using Principal Component Analysis and heatmap , 2015, Nucleic Acids Res..

[13]  E. Wang,et al.  Analysis and design of RNA sequencing experiments for identifying isoform regulation , 2010, Nature Methods.

[14]  C. Mungall,et al.  Gene Ontology Consortium : going forward The Gene Ontology , 2015 .

[15]  Arsen O. Batagov,et al.  HOXD-AS1 is a novel lncRNA encoded in HOXD cluster and a marker of neuroblastoma progression revealed via integrative analysis of noncoding transcriptome , 2014, BMC Genomics.

[16]  Kay Nieselt,et al.  Mayday-a microarray data analysis workbench , 2006, Bioinform..

[17]  Dietrich Rebholz-Schuhmann,et al.  Linked Functional Annotation For Differentially Expressed Gene (DEG) Demonstrated using Illumina Body Map 2.0 , 2015, SWAT4LS.

[18]  Dietrich Rebholz-Schuhmann,et al.  Towards precision medicine: discovering novel gynecological cancer biomarkers and pathways using linked data , 2017, Journal of Biomedical Semantics.

[19]  Chao Wu,et al.  Integrating gene expression and protein-protein interaction network to prioritize cancer-associated genes , 2012, BMC Bioinformatics.

[20]  Dietrich Rebholz-Schuhmann,et al.  Drug Dosage Balancing Using Large Scale Multi-omics Datasets , 2016, DMAH@VLDB.

[21]  Momiao Xiong,et al.  Canonical correlation analysis for RNA-seq co-expression networks , 2013, Nucleic acids research.

[22]  Javier León,et al.  Gene expression regulation and cancer , 2006, Clinical & translational oncology : official publication of the Federation of Spanish Oncology Societies and of the National Cancer Institute of Mexico.

[23]  Ka Yee Yeung,et al.  Principal component analysis for clustering gene expression data , 2001, Bioinform..

[24]  Stefan Decker,et al.  SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes , 2014, SWAT4LS.

[25]  José Luís Oliveira,et al.  TrigNER: automatically optimized biomedical event trigger recognition on scientific documents , 2014, Source Code for Biology and Medicine.

[26]  Derek J. Van Booven,et al.  HeatmapGenerator: high performance RNAseq and microarray visualization software suite to examine differential gene expression levels using an R and C++ hybrid computational pipeline , 2014, Source Code for Biology and Medicine.

[27]  Jörg Schultz,et al.  Genome Expression Pathway Analysis Tool – Analysis and visualization of microarray gene expression data under genomic, proteomic and metabolic context , 2007, BMC Bioinformatics.

[28]  Aidong Zhang,et al.  Interactive visualization and analysis for gene expression data , 2002, Proceedings of the 35th Annual Hawaii International Conference on System Sciences.

[29]  M. Bhasin,et al.  Determination of system level alterations in host transcriptome due to Zika virus (ZIKV) Infection in retinal pigment epithelium , 2018, Scientific Reports.