Towards Perceptual Optimization of the Visual Design of Scatterplots

Designing a good scatterplot can be difficult for non-experts in visualization, because they need to decide on many parameters, such as marker size and opacity, aspect ratio, color, and rendering order. This paper contributes to research exploring the use of perceptual models and quality metrics to set such parameters automatically for enhanced visual quality of a scatterplot. A key consideration in this paper is the construction of a cost function to capture several relevant aspects of the human visual system, examining a scatterplot design for some data analysis task. We show how the cost function can be used in an optimizer to search for the optimal visual design for a user’s dataset and task objectives (e.g., “reliable linear correlation estimation is more important than class separation”). The approach is extensible to different analysis tasks. To test its performance in a realistic setting, we pre-calibrated it for correlation estimation, class separation, and outlier detection. The optimizer was able to produce designs that achieved a level of speed and success comparable to that of those using human-designed presets (e.g., in R or MATLAB). Case studies demonstrate that the approach can adapt a design to the data, to reveal patterns without user intervention.

[1]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[2]  Steven Franconeri,et al.  Perception of Average Value in Multiclass Scatterplots , 2013, IEEE Transactions on Visualization and Computer Graphics.

[3]  Jing Li,et al.  Judging Correlation from Scatterplots and Parallel Coordinate Plots , 2010, Inf. Vis..

[4]  Jeffrey Heer,et al.  Beyond Weber's Law: A Second Look at Ranking Visualizations of Correlation , 2016, IEEE Transactions on Visualization and Computer Graphics.

[5]  Joshua M. Lewis,et al.  Human Cluster Evaluation and Formal Quality Measures: A Comparative Study , 2012, CogSci.

[6]  Alessandro Acquisti,et al.  Beyond the Turk: An Empirical Comparison of Alternative Platforms for Crowdsourcing Online Behavioral Research , 2016 .

[7]  Joachim Meyer,et al.  Correlation estimates as perceptual judgments. , 1997 .

[8]  Melanie Tory,et al.  Human factors in visualization research , 2004, IEEE Transactions on Visualization and Computer Graphics.

[9]  Tamara Munzner,et al.  Visualization Analysis and Design , 2014, A.K. Peters visualization series.

[10]  Michael Gleicher,et al.  Splatterplots: Overcoming Overdraw in Scatter Plots , 2013, IEEE Transactions on Visualization and Computer Graphics.

[11]  Jeffrey Heer,et al.  Crowdsourcing graphical perception: using mechanical turk to assess visualization design , 2010, CHI.

[12]  Michaël Aupetit,et al.  SepMe: 2002 New visual separation measures , 2016, 2016 IEEE Pacific Visualization Symposium (PacificVis).

[13]  R. Grossman,et al.  Graph-theoretic scagnostics , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[14]  S. Johansson,et al.  Interactive Dimensionality Reduction Through User-defined Combinations of Quality Metrics , 2009, IEEE Transactions on Visualization and Computer Graphics.

[15]  Antti Oulasvirta,et al.  MenuOptimizer: interactive optimization of menu systems , 2013, UIST.

[16]  Emmanuelle Gouillart,et al.  scikit-image: image processing in Python , 2014, PeerJ.

[17]  George W. Fitzmaurice,et al.  Dynamic Opacity Optimization for Scatter Plots , 2015, CHI.

[18]  Tamara Munzner,et al.  Empirical Guidance on Scatterplot and Dimension Reduction Technique Choices , 2013, IEEE Transactions on Visualization and Computer Graphics.

[19]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Ronald A. Rensink,et al.  The Perception of Correlation in Scatterplots , 2010, Comput. Graph. Forum.

[21]  Mark Last Automated Detection of Outliers in Real-World Data , 2001 .

[22]  Tamara Munzner,et al.  Visualizing dimensionally-reduced data: interviews with analysts and a characterization of task sequences , 2014, BELIV.

[23]  Enrico Bertini,et al.  Quality Metrics in High-Dimensional Data Visualization: An Overview and Systematization , 2011, IEEE Transactions on Visualization and Computer Graphics.

[24]  R. Marler,et al.  The weighted sum method for multi-objective optimization: new insights , 2010 .

[25]  P. Kellman,et al.  A theory of visual interpolation in object perception , 1991, Cognitive Psychology.

[26]  Hadley Wickham,et al.  A Layered Grammar of Graphics , 2010 .

[27]  Marcus A. Magnor,et al.  Combining automated analysis and visualization techniques for effective exploration of high-dimensional data , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[28]  Antti Oulasvirta,et al.  User Interface Design with Combinatorial Optimization , 2017, Computer.

[29]  Daniel A. Keim,et al.  Pixnostics: Towards Measuring the Value of Visualization , 2006, 2006 IEEE Symposium On Visual Analytics Science And Technology.

[30]  Rosane Minghim,et al.  Perception-Based Evaluation of Projection Methods for Multidimensional Data Visualization , 2015, IEEE Transactions on Visualization and Computer Graphics.

[31]  Aaron Hertzmann,et al.  Learning Layouts for Single-PageGraphic Designs , 2014, IEEE Transactions on Visualization and Computer Graphics.

[32]  K. Senthamarai Kannan,et al.  Multiple Linear Regression Models in Outlier Detection , 2012 .

[33]  Michaël Aupetit,et al.  Data‐driven Evaluation of Visual Quality Measures , 2015, Comput. Graph. Forum.

[34]  Kun Zhou,et al.  Visual Abstraction and Exploration of Multi-class Scatterplots , 2014, IEEE Transactions on Visualization and Computer Graphics.

[35]  Michael Bögl,et al.  Metaheuristic Search Concepts , 2010 .

[36]  Luca Maria Gambardella,et al.  Ant Algorithms for Discrete Optimization , 1999, Artificial Life.

[37]  Steven Franconeri,et al.  Ranking Visualizations of Correlation Using Weber's Law , 2014, IEEE Transactions on Visualization and Computer Graphics.

[38]  Tamara Munzner,et al.  A Taxonomy of Visual Cluster Separation Factors , 2012, Comput. Graph. Forum.

[39]  Pierre Dragicevic,et al.  Fair Statistical Communication in HCI , 2016 .

[40]  Jacob Cohen The earth is round (p < .05) , 1994 .

[41]  P. Mahalanobis On the generalized distance in statistics , 1936 .

[42]  Martin Reddy,et al.  Perceptually Optimized 3D Graphics , 2001, IEEE Computer Graphics and Applications.

[43]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[44]  W. Cleveland,et al.  Variables on Scatterplots Look More Highly Correlated When the Scales Are Increased , 1982, Science.