Interactive hierarchical displays: a general framework for visualization and exploration of large multivariate data sets

Abstract Numerous multivariate visualization techniques and systems have been developed in the past three decades to visually analyze and explore multivariate data being produced daily in application areas ranging from stock markets to the earth and space sciences. However, traditional multivariate visualization techniques typically do not scale well to large multivariate data sets, with the latter becoming more and more common nowadays. This paper proposes a general framework for interactive hierarchical displays (IHDs) to tackle the clutter problem faced by traditional multivariate visualization techniques when analyzing large data sets. The underlying principle of this framework is to develop a multi-resolution view of the data via hierarchical clustering, and to use hierarchical variations of traditional multivariate visualization techniques to convey aggregation information about the resulting clusters. Users can then explore their desired focus region at different levels of detail, using our suite of navigation and filtering tools. We describe this IHD framework and its full implementation on four traditional multivariate visualization techniques, namely, parallel coordinates (Inselberg and Dimsdale, Proceedings of Visualization (1990) 361; Wegman, J. Amer. Statist. Assoc. 411(85) (1990) 664), star glyphs (Siegel et al., Surgery 72 (1972) 126), scatterplot matrices (Cleveland and McGill, Dynamics Graphics for Statistics (1988)), and dimensional stacking (LeBlanc et al., Proceedings of Visualization 90 (1995) 271), as implemented in the XmdvTool system (Ward, Proceedings of Visualization 94 (1994) 326; Martin and Ward, Proceedings of Visualization 95 (1995) 271; Fua et al., Proceedings of Visualization 99 (1999) 43; Proceedings of Information Visualization 99 (1999) 58). We also describe an empirical evaluation that verified the effectiveness of the interactive hierarchical displays.

[1]  Edward J. Wegman,et al.  High Dimensional Clustering Using Parallel Coordinates and the Grand Tour , 1997 .

[2]  Matthew O. Ward,et al.  Exploring N-dimensional databases , 1990, Proceedings of the First IEEE Conference on Visualization: Visualization `90.

[3]  Sudipto Guha,et al.  CURE: an efficient clustering algorithm for large databases , 1998, SIGMOD '98.

[4]  D. F. Andrews,et al.  PLOTS OF HIGH-DIMENSIONAL DATA , 1972 .

[5]  Matthew O. Ward,et al.  High Dimensional Brushing for Interactive Exploration of Multivariate Data , 1995, Proceedings Visualization '95.

[6]  Alfred Inselberg,et al.  Parallel coordinates for visualizing multi-dimensional geometry , 1987 .

[7]  ShneidermanBen Tree visualization with tree-maps , 1992 .

[8]  Matthew O. Ward,et al.  XmdvTool: integrating multiple methods for visualizing multivariate data , 1994, Proceedings Visualization '94.

[9]  Herman Chernoff,et al.  The Use of Faces to Represent Points in k- Dimensional Space Graphically , 1973 .

[10]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[11]  Sougata Mukherjea,et al.  Glyphmaker: creating customized visualizations of complex data , 1994, Computer.

[12]  Pak Chung Wong,et al.  Multiresolution multidimensional wavelet brushing , 1996, Proceedings of Seventh Annual IEEE Visualization '96.

[13]  Ben Shneiderman,et al.  Tree visualization with tree-maps: 2-d space-filling approach , 1992, TOGS.

[14]  H. P. Friedman,et al.  The surgical implications of physiologic patterns in myocardial infarction shock. , 1972, Surgery.

[15]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[16]  Matthew O. Ward,et al.  Hierarchical parallel coordinates for exploration of large datasets , 1999, Proceedings Visualization '99 (Cat. No.99CB37067).

[17]  Graham J. Wills,et al.  An interactive view for hierarchical clustering , 1998, Proceedings IEEE Symposium on Information Visualization (Cat. No.98TB100258).

[18]  Hans-Peter Kriegel,et al.  Recursive pattern: a technique for visualizing very large amounts of data , 1995, Proceedings Visualization '95.

[19]  Matthew O. Ward,et al.  Scalable Visual Hierarchy Exploration , 2000, DEXA.

[20]  Alfred Inselberg,et al.  Parallel coordinates: a tool for visualizing multi-dimensional geometry , 1990, Proceedings of the First IEEE Conference on Visualization: Visualization `90.

[21]  M. E. McGill,et al.  Dynamic Graphics for Statistics. , 1990 .

[22]  Matthew O. Ward,et al.  Structure-Based Brushes: A Mechanism for Navigating Hierarchically Organized Data and Information Spaces , 2000, IEEE Trans. Vis. Comput. Graph..

[23]  Matthew O. Ward,et al.  Navigating hierarchies with structure-based brushes , 1999, Proceedings 1999 IEEE Symposium on Information Visualization (InfoVis'99).

[24]  E. Wegman Hyperdimensional Data Analysis Using Parallel Coordinates , 1990 .