Exploratory Visualization of Multivariate Data with Variable Quality

Real-world data is known to be imperfect, suffering from various forms of defects such as sensor variability, estimation errors, uncertainty, human errors in data entry, and gaps in data gathering. Analysis conducted on variable quality data can lead to inaccurate or incorrect results. An effective visualization system must make users aware of the quality of their data by explicitly conveying not only the actual data content, but also its quality attributes. While some research has been conducted on visualizing uncertainty in spatio-temporal data and univariate data, little work has been reported on extending this capability into multivariate data visualization. In this paper we describe our approach to the problem of visually exploring multivariate data with variable quality. As a foundation, we propose a general approach to defining quality measures for tabular data, in which data may experience quality problems at three granularities: individual data values, complete records, and specific dimensions. We then present two approaches to visual mapping of quality information into display space. In particular, one solution embeds the quality measures as explicit values into the original dataset by regarding value quality and record quality as new data dimensions. The other solution is to superimpose the quality information within the data visualizations using additional visual variables. We also report on user studies conducted to assess alternate mappings of quality attributes to visual variables for the second method. In addition, we describe case studies that expose some of the advantages and disadvantages of these two approaches

[1]  Penny Rheingans,et al.  Procedural annotation of uncertain information , 2000, Proceedings Visualization 2000. VIS 2000 (Cat. No.00CH37145).

[2]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[3]  John Stasko,et al.  BEST PAPER: A Knowledge Task-Based Framework for Design and Evaluation of Information Visualizations , 2004 .

[4]  D. J. Newman,et al.  UCI Repository of Machine Learning Database , 1998 .

[5]  Matthew O. Ward,et al.  Visual Hierarchical Dimension Reduction for Exploration of High Dimensional Datasets , 2003, VisSym.

[6]  Jock D. Mackinlay,et al.  Visualizing data with bounded uncertainty , 2002, IEEE Symposium on Information Visualization, 2002. INFOVIS 2002..

[7]  Alex T. Pang,et al.  Glyphs for Visualizing Uncertainty in Vector Fields , 1996, IEEE Trans. Vis. Comput. Graph..

[8]  Shiping Huang,et al.  Exploratory Visualization of Data with Variable Quality , 2005 .

[9]  B. Marx The Visual Display of Quantitative Information , 1985 .

[10]  P. Fayers,et al.  The Visual Display of Quantitative Information , 1990 .

[11]  Kristin A. Cook,et al.  Illuminating the Path: The Research and Development Agenda for Visual Analytics , 2005 .

[12]  M. Kate Beard,et al.  NCGIA Research Initiative 7 Visualization of Spatial Data Quality: Scientific Report for the Specialist Meeting (91-26) , 1991 .

[13]  Barry N. Taylor,et al.  Guidelines for Evaluating and Expressing the Uncertainty of Nist Measurement Results , 2017 .

[14]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[15]  Alex Pang Visualizing Uncertainty in Geo-spatial Data , 2001 .

[16]  Chris R. Johnson,et al.  NHI-NSF Visualization Research Challenges Report , 2005 .

[17]  Alex T. Pang,et al.  Approaches to uncertainty visualization , 1996, The Visual Computer.

[18]  Gary J. Hunter,et al.  New Tools For Handling Spatial Data Quality : Moving from Academic Concepts to Practical Reality , 1999 .

[19]  Ross Brown Animated visual vibrations as an uncertainty visualisation technique , 2004, GRAPHITE '04.

[20]  John L.P. Thompson,et al.  Missing data , 2004 .

[21]  Colin Ware,et al.  Information Visualization: Perception for Design , 2000 .

[22]  Deborah F. Swayne,et al.  Missing Data in Interactive High-Dimensional Data Visualization , 1998 .

[23]  Alan M. MacEachren,et al.  VISUALIZING UNCERTAIN INFORMATION , 1992 .

[24]  Penny Rheingans,et al.  NIH-NSF visualization research challenges report summary , 2006, IEEE Computer Graphics and Applications.

[25]  Matthew O. Ward,et al.  High Dimensional Brushing for Interactive Exploration of Multivariate Data , 1995, Proceedings Visualization '95.

[26]  Heike Hofmann,et al.  Interactive Graphics for Data Sets with Missing Values—MANET , 1996 .