Scatterplots: Tasks, Data, and Designs

Traditional scatterplots fail to scale as the complexity and amount of data increases. In response, there exist many design options that modify or expand the traditional scatterplot design to meet these larger scales. This breadth of design options creates challenges for designers and practitioners who must select appropriate designs for particular analysis goals. In this paper, we help designers in making design choices for scatterplot visualizations. We survey the literature to catalog scatterplot-specific analysis tasks. We look at how data characteristics influence design decisions. We then survey scatterplot-like designs to understand the range of design options. Building upon these three organizations, we connect data characteristics, analysis tasks, and design choices in order to generate challenges, open questions, and example best practices for the effective design of scatterplots.

[1]  Tamara Munzner,et al.  A Multi-Level Typology of Abstract Visualization Tasks , 2013, IEEE Transactions on Visualization and Computer Graphics.

[2]  Alan J. Dix,et al.  A Taxonomy of Clutter Reduction for Information Visualisation , 2007, IEEE Transactions on Visualization and Computer Graphics.

[3]  Jean-Daniel Fekete,et al.  Task taxonomy for graph visualization , 2006, BELIV '06.

[4]  Alexander Wolff,et al.  Selecting the Aspect Ratio of a Scatter Plot Based on Its Delaunay Triangulation , 2013, IEEE Transactions on Visualization and Computer Graphics.

[5]  Silvia Miksch,et al.  Task Cube: A three-dimensional conceptual space of user tasks in visualization design and evaluation , 2016, Inf. Vis..

[6]  Dirk J. Lehmann,et al.  Visualnostics: Visual Guidance Pictograms for Analyzing Projections of High‐dimensional Data , 2015, Comput. Graph. Forum.

[7]  Michael Gleicher,et al.  Splatterplots: Overcoming Overdraw in Scatter Plots , 2013, IEEE Transactions on Visualization and Computer Graphics.

[8]  Steven Franconeri,et al.  Perception of Average Value in Multiclass Scatterplots , 2013, IEEE Transactions on Visualization and Computer Graphics.

[9]  Ronald A. Rensink,et al.  The Perception of Correlation in Scatterplots , 2010, Comput. Graph. Forum.

[10]  Alex Endert,et al.  InterAxis: Steering Scatterplot Axes via Observation-Level Interaction , 2016, IEEE Transactions on Visualization and Computer Graphics.

[11]  Robert Kincaid,et al.  MassVis: Visual analysis of protein complexes using mass spectrometry , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[12]  Christophe Hurter,et al.  MoleView: An Attribute and Structure-Based Semantic Lens for Large Element-Based Plots , 2011, IEEE Transactions on Visualization and Computer Graphics.

[13]  Joseph A. Cottam,et al.  Overplotting: Unified solutions under Abstract Rendering , 2013, 2013 IEEE International Conference on Big Data.

[14]  Tamara Munzner,et al.  Visualizing dimensionally-reduced data: interviews with analysts and a characterization of task sequences , 2014, BELIV.

[15]  Kwan-Liu Ma,et al.  Flow-based scatterplots for sensitivity analysis , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[16]  Issei Fujishiro,et al.  The elements of graphing data , 2005, The Visual Computer.

[17]  Leland Wilkinson,et al.  Stacking Graphic Elements to Avoid Over-Plotting , 2010, IEEE Transactions on Visualization and Computer Graphics.

[18]  Stephen M. Casner,et al.  Task-analytic approach to the automated design of graphic presentations , 1991, TOGS.

[19]  John P. Lewis,et al.  Eurographics/ Ieee-vgtc Symposium on Visualization 2009 Selecting Good Views of High-dimensional Data Using Class Consistency , 2022 .

[20]  Daniel R. Montello,et al.  Testing the First Law of Cognitive Geography on Point-Display Spatializations , 2003, COSIT.

[21]  Colin Ware,et al.  Information Visualization: Perception for Design , 2000 .

[22]  D. W. Scott Kernel Density Estimators , 2008 .

[23]  Daniel J. Denis,et al.  The early origins and development of the scatterplot. , 2005, Journal of the history of the behavioral sciences.

[24]  Vidya Setlur,et al.  An Engineering Model for Color Difference as a Function of Size , 2014, CIC.

[25]  Carla E. Brodley,et al.  Dis-function: Learning distance functions interactively , 2012, 2012 IEEE Conference on Visual Analytics Science and Technology (VAST).

[26]  Jeffrey Heer,et al.  Multi-Scale Banking to 45 Degrees , 2006, IEEE Transactions on Visualization and Computer Graphics.

[27]  Daniel A. Keim,et al.  Generalized Scatter Plots , 2010, Inf. Vis..

[28]  Leland Wilkinson,et al.  The Grammar of Graphics (Statistics and Computing) , 2005 .

[29]  Raghu Machiraju,et al.  Visualizing Multidimensional Data with Glyph SPLOMs , 2014, Comput. Graph. Forum.

[30]  Jeffrey Heer,et al.  D³ Data-Driven Documents , 2011, IEEE Transactions on Visualization and Computer Graphics.

[31]  Jean-Daniel Fekete,et al.  Hierarchical Aggregation for Information Visualization: Overview, Techniques, and Design Guidelines , 2010, IEEE Transactions on Visualization and Computer Graphics.

[32]  Tamara Munzner,et al.  Spatialization Design: Comparing Points and Landscapes , 2007, IEEE Transactions on Visualization and Computer Graphics.

[33]  Giuseppe Santucci,et al.  Give Chance a Chance: Modeling Density to Enhance Scatter Plot Quality through Random Data Sampling , 2006, Inf. Vis..

[34]  Sebastian Grottel,et al.  Enhancing Scatterplots with Multi‐Dimensional Focal Blur , 2016, Comput. Graph. Forum.

[35]  Michael Gleicher,et al.  What Shakespeare Taught Us About Text Visualization , 2012 .

[36]  Tamara Munzner,et al.  Visualization analysis & design , 2015 .

[37]  Robert Kosara,et al.  Laws of Attraction: From Perceptual Forces to Conceptual Similarity , 2010, IEEE Transactions on Visualization and Computer Graphics.

[38]  Jing Li,et al.  Judging Correlation from Scatterplots and Parallel Coordinate Plots , 2010, Inf. Vis..

[39]  Jing Li,et al.  A model of symbol size discrimination in scatterplots , 2010, CHI.

[40]  Robert L. Grossman,et al.  Graph-Theoretic Scagnostics , 2005, INFOVIS.

[41]  Michael Gleicher,et al.  Task-Driven Comparison of Topic Models , 2016, IEEE Transactions on Visualization and Computer Graphics.

[42]  John T. Stasko,et al.  VisIRR: Visual analytics for information retrieval and recommendation with large-scale document data , 2014, 2014 IEEE Conference on Visual Analytics Science and Technology (VAST).

[43]  Dirk J. Lehmann,et al.  Orthographic Star Coordinates , 2013, IEEE Transactions on Visualization and Computer Graphics.

[44]  David H. Laidlaw,et al.  The relation between visualization size, grouping, and user performance , 2014, IEEE Transactions on Visualization and Computer Graphics.

[45]  Xiaoru Yuan,et al.  Dimension Projection Matrix/Tree: Interactive Subspace Visual Exploration and Analysis of High Dimensional Data , 2013, IEEE Transactions on Visualization and Computer Graphics.

[46]  William S. Cleveland,et al.  The Shape Parameter of a Two-Variable Graph , 1988 .

[47]  Pat Hanrahan,et al.  Arc Length-Based Aspect Ratio Selection , 2011, IEEE Transactions on Visualization and Computer Graphics.

[48]  Jeffrey Heer,et al.  Multi-Scale Banking to 45º , 2006 .

[49]  Robert E. Roth,et al.  An Empirically-Derived Taxonomy of Interaction Primitives for Interactive Cartography and Geovisualization , 2013, IEEE Transactions on Visualization and Computer Graphics.

[50]  Jeffrey Heer,et al.  SpanningAspectRatioBank Easing FunctionS ArrayIn ColorIn Date Interpolator MatrixInterpola NumObjecPointI Rectang ISchedu Parallel Pause Scheduler Sequen Transition Transitioner Transiti Tween Co DelimGraphMLCon IData JSONCon DataField DataSc Dat DataSource Data DataUtil DirtySprite LineS RectSprite , 2011 .

[51]  Jock D. Mackinlay,et al.  Automating the design of graphical presentations of relational information , 1986, TOGS.

[52]  Tamara Munzner,et al.  A Taxonomy of Visual Cluster Separation Factors , 2012, Comput. Graph. Forum.

[53]  Enrico Bertini,et al.  Quality Metrics in High-Dimensional Data Visualization: An Overview and Systematization , 2011, IEEE Transactions on Visualization and Computer Graphics.

[54]  Heidrun Schumann,et al.  A Design Space of Visualization Tasks , 2013, IEEE Transactions on Visualization and Computer Graphics.

[55]  Daniel Weiskopf,et al.  Continuous Scatterplots , 2008, IEEE Transactions on Visualization and Computer Graphics.

[56]  Daniel A. Keim,et al.  Visual quality metrics and human perception: an initial study on 2D projections of large multidimensional data , 2010, AVI.

[57]  Jing Li,et al.  A model of symbol lightness discrimination in sparse scatterplots , 2010, 2010 IEEE Pacific Visualization Symposium (PacificVis).

[58]  Elisabeth S. Nelson How Maps Work: Representation, Visualization, and Design , 1996 .

[59]  Eser Kandogan,et al.  Visualizing multi-dimensional clusters, trends, and outliers using star coordinates , 2001, KDD '01.

[60]  Tamara Munzner,et al.  Empirical Guidance on Scatterplot and Dimension Reduction Technique Choices , 2013, IEEE Transactions on Visualization and Computer Graphics.

[61]  Leland Wilkinson,et al.  ScagExplorer: Exploring Scatterplots by Their Scagnostics , 2014, 2014 IEEE Pacific Visualization Symposium.

[62]  Dana K. Urribarri,et al.  Prediction of data visibility in two-dimensional scatterplots , 2017, Inf. Vis..

[63]  Daniel B. Carr,et al.  Scatterplot matrix techniques for large N , 1986 .

[64]  Madison Elliott,et al.  Interference in the Perception of Two-Population Scatterplots. , 2015, Journal of vision.

[65]  Kun Zhou,et al.  Visual Abstraction and Exploration of Multi-class Scatterplots , 2014, IEEE Transactions on Visualization and Computer Graphics.

[66]  James R. Eagan,et al.  Low-level components of analytic activity in information visualization , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[67]  Heidrun Schumann,et al.  Interactive Lenses for Visualization: An Extended Survey , 2017, Comput. Graph. Forum.

[68]  Ronald A. Rensink The nature of correlation perception in scatterplots , 2016, Psychonomic bulletin & review.

[69]  Matthew O. Ward,et al.  Measuring Data Abstraction Quality in Multiresolution Visualizations , 2006, IEEE Transactions on Visualization and Computer Graphics.

[70]  M. Sheelagh T. Carpendale,et al.  Bubble Sets: Revealing Set Relations with Isocontours over Existing Visualizations , 2009, IEEE Transactions on Visualization and Computer Graphics.

[71]  M. Pawlak,et al.  Kernel density estimators from quantized data , 1997 .

[72]  Lisa A. Best,et al.  Perceiving Relationships: A Physiological Examination of the Perception of Scatterplots , 2006, Diagrams.

[73]  James T. Enns,et al.  High-speed visual estimation using preattentive processing , 1996, TCHI.

[74]  George W. Fitzmaurice,et al.  Dynamic Opacity Optimization for Scatter Plots , 2015, CHI.

[75]  Qi Han,et al.  DocuCompass: Effective exploration of document landscapes , 2016, 2016 IEEE Conference on Visual Analytics Science and Technology (VAST).

[76]  Robert McGill,et al.  The Many Faces of a Scatterplot , 1984 .

[77]  Jean-Daniel Fekete,et al.  Interactive information visualization of a million items , 2002, IEEE Symposium on Information Visualization, 2002. INFOVIS 2002..

[78]  Alex Endert,et al.  Visualization by Demonstration: An Interaction Paradigm for Visual Data Exploration , 2017, IEEE Transactions on Visualization and Computer Graphics.