Dendrogramix: A hybrid tree-matrix visualization technique to support interactive exploration of dendrograms

Clustering is often a first step when trying to make sense of a large data set. A wide family of cluster analysis algorithms, namely hierarchical clustering algorithms, does not provide a partition of the data set but a hierarchy of clusters organized in a binary tree, known as a dendrogram. The dendrogram has a classical node-link representation used by experts for various tasks like: to decide which subtrees are actual clusters (e.g., by cutting the dendrogram at a given depth); to give those clusters a name by inspecting their content; etc. We present Dendrogramix, a hybrid tree-matrix interactive visualization of dendrograms that superimposes the relationship between individual objects on to the hierarchy of clusters. Dendrogramix enables users to do tasks which involve both clusters and individual objects that are impracticable with the classical representation, like: to explain why a particular objects belongs to a particular cluster; to elicit and understand uncommon patterns (e.g., objects that could have been classified in a totally different cluster); etc. Those sensemaking tasks are supported by a consistent set of interaction techniques that facilitates the exploration of large clustering results.

[1]  Mark H. Chignell,et al.  Elastic hierarchies: combining treemaps and node-link diagrams , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[2]  Michael J. McGuffin,et al.  Quantifying the Space-Efficiency of 2D Graphical Representations of Trees , 2010, Inf. Vis..

[3]  Michael J. McGuffin,et al.  TreeMatrix: A Hybrid Visualization of Compound Graphs , 2012, Comput. Graph. Forum.

[4]  Ben Shneiderman,et al.  Direct Manipulation: A Step Beyond Programming Languages , 1983, Computer.

[5]  François Guimbretière,et al.  Techniques , 2011, Laboratory Investigation.

[6]  Chris North,et al.  A comparison of benchmark task and insight evaluation methods for information visualization , 2011, Inf. Vis..

[7]  Jin Chen,et al.  Constructing Overview + Detail Dendrogram-Matrix Views , 2009, IEEE Transactions on Visualization and Computer Graphics.

[8]  Jean-Daniel Fekete,et al.  NodeTrix: a Hybrid Visualization of Social Networks , 2007, IEEE Transactions on Visualization and Computer Graphics.

[9]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[10]  Erik D. Demaine,et al.  K-ary Clustering with Optimal Leaf Ordering for Gene Expression Data , 2002, WABI.

[11]  Jean-Daniel Fekete,et al.  Hierarchical Aggregation for Information Visualization: Overview, Techniques, and Design Guidelines , 2010, IEEE Transactions on Visualization and Computer Graphics.

[12]  Eric Lecolinet,et al.  Browsing Zoomable Treemaps: Structure-Aware Multi-Scale Navigation Techniques , 2007, IEEE Transactions on Visualization and Computer Graphics.

[13]  P. Sneath The application of computers to taxonomy. , 1957, Journal of general microbiology.

[14]  J. Gower,et al.  Expressing complex relationships in two dimensions , 1981 .

[15]  Chris North,et al.  An insight-based methodology for evaluating bioinformatics visualizations , 2005, IEEE Transactions on Visualization and Computer Graphics.

[16]  Niklas Elmqvist,et al.  Exploring the design space of composite visualization , 2012, 2012 IEEE Pacific Visualization Symposium.

[17]  Jean-Daniel Fekete,et al.  NodeTrix: Hybrid Representation for Analyzing Social Networks , 2007, ArXiv.

[18]  Gilles Bisson,et al.  Improving Visualization of Large Hierarchical Clustering , 2012, 2012 16th International Conference on Information Visualisation.

[19]  Jean-Daniel Fekete,et al.  MatLink: Enhanced Matrix Visualization for Analyzing Social Networks , 2007, INTERACT.

[20]  Leland Wilkinson,et al.  The History of the Cluster Heat Map , 2009 .