Comparative Analysis of Multidimensional, Quantitative Data

When analyzing multidimensional, quantitative data, the comparison of two or more groups of dimensions is a common task. Typical sources of such data are experiments in biology, physics or engineering, which are conducted in different configurations and use replicates to ensure statistically significant results. One common way to analyze this data is to filter it using statistical methods and then run clustering algorithms to group similar values. The clustering results can be visualized using heat maps, which show differences between groups as changes in color. However, in cases where groups of dimensions have an a priori meaning, it is not desirable to cluster all dimensions combined, since a clustering algorithm can fragment continuous blocks of records. Furthermore, identifying relevant elements in heat maps becomes more difficult as the number of dimensions increases. To aid in such situations, we have developed Matchmaker, a visualization technique that allows researchers to arbitrarily arrange and compare multiple groups of dimensions at the same time. We create separate groups of dimensions which can be clustered individually, and place them in an arrangement of heat maps reminiscent of parallel coordinates. To identify relations, we render bundled curves and ribbons between related records in different groups. We then allow interactive drill-downs using enlarged detail views of the data, which enable in-depth comparisons of clusters between groups. To reduce visual clutter, we minimize crossings between the views. This paper concludes with two case studies. The first demonstrates the value of our technique for the comparison of clustering algorithms. In the second, biologists use our system to investigate why certain strains of mice develop liver disease while others remain healthy, informally showing the efficacy of our system when analyzing multidimensional data containing distinct groups of dimensions.

[1]  Martin Graham,et al.  Combining linking and focusing techniques for a multiple hierarchy visualisation , 2001, Proceedings Fifth International Conference on Information Visualisation.

[2]  Dieter Schmalstieg,et al.  Caleydo: Design and evaluation of a visual analysis framework for gene expression data in its biological context , 2010, 2010 IEEE Pacific Visualization Symposium (PacificVis).

[3]  Helwig Hauser,et al.  Parallel Sets: interactive exploration and visual analysis of categorical data , 2006, IEEE Transactions on Visualization and Computer Graphics.

[4]  Tamara Munzner,et al.  MizBee: A Multiscale Synteny Browser , 2009, IEEE Transactions on Visualization and Computer Graphics.

[5]  J.C. Roberts,et al.  State of the Art: Coordinated & Multiple Views in Exploratory Visualization , 2007, Fifth International Conference on Coordinated and Multiple Views in Exploratory Visualization (CMV 2007).

[6]  Ben Shneiderman,et al.  Interactively Exploring Hierarchical Clustering Results , 2003 .

[7]  Lars Linsen,et al.  MultiClusterTree: Interactive Visual Exploration of Hierarchical Clusters in Multidimensional Multivariate Data , 2009, Comput. Graph. Forum.

[8]  Jarke J. van Wijk,et al.  Visual Comparison of Hierarchically Organized Data , 2008, Comput. Graph. Forum.

[9]  Georges G. Grinstein,et al.  Heat Map Visualizations Allow Comparison of Multiple Clustering Results and Evaluation of Dataset Quality: Application to Microarray Data , 2007, 2007 11th International Conference Information Visualization (IV '07).

[10]  Alfred Inselberg,et al.  Parallel coordinates: a tool for visualizing multi-dimensional geometry , 1990, Proceedings of the First IEEE Conference on Visualization: Visualization `90.

[11]  Jacques Bertin,et al.  Graphische Semiologie: Diagramme, Netze, Karten , 2010 .

[12]  Steven J. M. Jones,et al.  Circos: an information aesthetic for comparative genomics. , 2009, Genome research.

[13]  Jin Young Hong,et al.  Zoomology: ComparingTwo Large Hierarchical Trees , 2003 .

[14]  Steven P. Reiss,et al.  Stretching the rubber sheet: a metaphor for viewing large layouts on small screens , 1993, UIST '93.

[15]  Christopher G. Healey,et al.  Choosing effective colours for data visualization , 1996, Proceedings of Seventh Annual IEEE Visualization '96.

[16]  Danny Holten,et al.  Hierarchical Edge Bundles: Visualization of Adjacency Relations in Hierarchical Data , 2006, IEEE Transactions on Visualization and Computer Graphics.

[17]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[18]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Martin Graham,et al.  Exploring Multiple Trees through DAG Representations , 2007, IEEE Transactions on Visualization and Computer Graphics.

[20]  Pat Hanrahan,et al.  Polaris: a system for query, analysis, and visualization of multidimensional databases , 2008, Commun. ACM.

[21]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[22]  Dieter Schmalstieg,et al.  Caleydo: connecting pathways and gene expression , 2009, Bioinform..

[23]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[24]  F Iberer,et al.  The disease relevance of human hepatocellular xenograft models: molecular characterization and review of the literature. , 2009, Cancer letters.

[25]  M. Omary,et al.  The genetic background modulates susceptibility to mouse liver Mallory‐Denk body formation and liver injury , 2008, Hepatology.

[26]  Kurt Zatloukal,et al.  Ballooned hepatocytes in steatohepatitis: the value of keratin immunohistochemistry for diagnosis. , 2008, Journal of hepatology.

[27]  Alexandru Telea,et al.  Code Flows: Visualizing Structural Evolution of Source Code , 2008, Comput. Graph. Forum.

[28]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[29]  Han-Wei Shen,et al.  Visualizing Changes of Hierarchical Data using Treemaps , 2007, IEEE Transactions on Visualization and Computer Graphics.

[30]  Nicholas Chen,et al.  TreeJuxtaposer : Scalable Tree Comparison using Focus + Context with Guaranteed Visibility , 2006 .