Caleydo: Design and evaluation of a visual analysis framework for gene expression data in its biological context

The goal of our work is to support experts in the process of hypotheses generation concerning the roles of genes in diseases. For a deeper understanding of the complex interdependencies between genes, it is important to bring gene expressions (measurements) into context with pathways. Pathways, which are models of biological processes, are available in online databases. In these databases, large networks are decomposed into small sub-graphs for better manageability. This simplification results in a loss of context, as pathways are interconnected and genes can occur in multiple instances scattered over the network. Our main goal is therefore to present all relevant information, i.e., gene expressions, the relations between expression and pathways and between multiple pathways in a simple, yet effective way. To achieve this we employ two different multiple-view approaches. Traditional multiple views are used for large datasets or highly interactive visualizations, while a 2.5D technique is employed to support a seamless navigation of multiple pathways which simultaneously links to the expression of the contained genes. This approach facilitates the understanding of the interconnection of pathways, and enables a non-distracting relation to gene expression data. We evaluated Caleydo with a group of users from the life science community. Users were asked to perform three tasks: pathway exploration, gene expression analysis and information comparison with and without visual links, which had to be conducted in four different conditions. Evaluation results show that the system can improve the process of understanding the complex network of pathways and the individual effects of gene expression regulation considerably. Especially the quality of the available contextual information and the spatial organization was rated good for the presented 2.5D setup.

[1]  Jihoon Kim,et al.  ArrayXPath II: mapping and visualizing microarray gene-expression data with biomedical ontologies and integrated biological pathway resources using Scalable Vector Graphics , 2005, Nucleic Acids Res..

[2]  Ben Shneiderman,et al.  Designing Semantic Substrates for Visual Network Exploration , 2007, Inf. Vis..

[3]  Hillevi Lindroos,et al.  Visualizing metabolic pathways: comparative genomics and expression analysis , 2002, Proc. IEEE.

[4]  Ben Shneiderman,et al.  A Rank-by-Feature Framework for Interactive Exploration of Multidimensional Data , 2005, Inf. Vis..

[5]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..

[6]  Carolina Cruz-Neira,et al.  Hierarchical visualization of metabolic networks using virtual reality , 2006, VRCIA '06.

[7]  Hubert Hackl,et al.  PathwayExplorer: web service for visualizing high-throughput expression data on biological pathways , 2005, Nucleic Acids Res..

[8]  Kurt Zatloukal,et al.  Biobanks: transnational, European and global networks. , 2007, Briefings in functional genomics & proteomics.

[9]  Tamara Munzner,et al.  Cerebral: Visualizing Multiple Experimental Conditions on a Graph with Biological Context , 2008, IEEE Transactions on Visualization and Computer Graphics.

[10]  Allison Woodruff,et al.  Guidelines for using multiple views in information visualization , 2000, AVI '00.

[11]  Dieter Schmalstieg,et al.  Gene and Protein Expression Profiling in Liver in a Sepsis-Baboon Model , 2009 .

[12]  Dieter Schmalstieg,et al.  Connecting Genes with Diseases , 2009, 2009 13th International Conference Information Visualisation.

[13]  Jitendra Malik,et al.  PointCloudXplore: Visual Analysis of 3D Gene Expression Data Using Physical Views and Parallel Coordinates , 2006, EuroVis.

[14]  Falk Schreiber,et al.  Dynamic exploration and editing of KEGG pathway diagrams , 2007, Bioinform..

[15]  Ben Shneiderman,et al.  Network Visualization by Semantic Substrates , 2006, IEEE Transactions on Visualization and Computer Graphics.

[16]  Thomas Ball,et al.  Software Visualization in the Large , 1996, Computer.

[17]  M. Sheelagh T. Carpendale,et al.  VisLink: Revealing Relationships Amongst Visualizations , 2007, IEEE Transactions on Visualization and Computer Graphics.

[18]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[19]  Dieter Schmalstieg,et al.  Navigation and Exploration of Interconnected Pathways , 2008, Comput. Graph. Forum.

[20]  Jock D. Mackinlay,et al.  The perspective wall: detail and context smoothly integrated , 1991, CHI.

[21]  Chris North,et al.  Beyond visual acuity: the perceptual scalability of information visualizations for large displays , 2007, CHI.

[22]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Jean-Marc Neefs,et al.  Mining the human genome using virtual reality , 2002, EGPGV.