Transforming Scagnostics to Reveal Hidden Features

Scagnostics (Scatterplot Diagnostics) were developed by Wilkinson et al. based on an idea of Paul and John Tukey, in order to discern meaningful patterns in large collections of scatterplots. The Tukeys' original idea was intended to overcome the impediments involved in examining large scatterplot matrices (multiplicity of plots and lack of detail). Wilkinson's implementation enabled for the first time scagnostics computations on many points as well as many plots. Unfortunately, scagnostics are sensitive to scale transformations. We illustrate the extent of this sensitivity and show how it is possible to pair statistical transformations with scagnostics to enable discovery of hidden structures in data that are not discernible in untransformed visualizations.

[1]  William S. Cleveland,et al.  A Model for Studying Display Methods of Statistical Graphics , 1993 .

[2]  Leland Wilkinson,et al.  ScagExplorer: Exploring Scatterplots by Their Scagnostics , 2014, 2014 IEEE Pacific Visualization Symposium.

[3]  Tobias Schreck,et al.  Visual analytics of time dependent 2D point clouds , 2009, CGI.

[4]  Pat Hanrahan,et al.  Arc Length-Based Aspect Ratio Selection , 2011, IEEE Transactions on Visualization and Computer Graphics.

[5]  F. J. Anscombe,et al.  THE TRANSFORMATION OF POISSON, BINOMIAL AND NEGATIVE-BINOMIAL DATA , 1948 .

[6]  Olivier Bau,et al.  Representation-Independent In-Place Magnification with Sigma Lenses , 2010, IEEE Transactions on Visualization and Computer Graphics.

[7]  Ben Shneiderman,et al.  A Rank-by-Feature Framework for Interactive Exploration of Multidimensional Data , 2005, Inf. Vis..

[8]  William S. Cleveland,et al.  The Shape Parameter of a Two-Variable Graph , 1988 .

[9]  Lijie Fu,et al.  Implementation of Three-dimensional Scagnostics , 2009 .

[10]  Ben Shneiderman,et al.  The big picture for big data: visualization. , 2014, Science.

[11]  Rory A. Fisher,et al.  The general sampling distribution of the multiple correlation coefficient , 1928 .

[12]  Steven Skiena,et al.  The Algorithm Design Manual , 2020, Texts in Computer Science.

[13]  Robert L. Grossman,et al.  High-Dimensional Visual Analytics: Interactive Exploration Guided by Pairwise Views of Point Distributions , 2006, IEEE Transactions on Visualization and Computer Graphics.

[14]  Alexander Wolff,et al.  Selecting the Aspect Ratio of a Scatter Plot Based on Its Delaunay Triangulation , 2013, IEEE Transactions on Visualization and Computer Graphics.

[15]  Rick L. Edgeman,et al.  LISP-STAT: An Object-Oriented Environment for Statistical Computing and Dynamic Graphics , 1992 .

[16]  Matthew O. Ward,et al.  XmdvTool: visual interactive data exploration and trend discovery of high-dimensional data sets , 2002, SIGMOD '02.

[17]  Emmanuel Pietriga,et al.  Sigma lenses: focus-context transitions combining space, time and translucence , 2008, CHI.

[18]  Jeffrey Heer,et al.  Multi-Scale Banking to 45 Degrees , 2006, IEEE Transactions on Visualization and Computer Graphics.

[19]  G. W. Furnas,et al.  Generalized fisheye views , 1986, CHI '86.

[20]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[21]  J. Berkson Application of the Logistic Function to Bio-Assay , 1944 .

[22]  David G. Kirkpatrick,et al.  On the shape of a set of points in the plane , 1983, IEEE Trans. Inf. Theory.

[23]  Benjamin B. Bederson,et al.  A review of overview+detail, zooming, and focus+context interfaces , 2009, CSUR.

[24]  Marcus A. Magnor,et al.  Perception-based visual quality measures , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).

[25]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[26]  Andreas Buja,et al.  Xgobi: Interactive Dynamic Graphics In The X Window System With A Link To S , 1991 .

[27]  T Allison,et al.  Sleep in mammals: ecological and constitutional correlates. , 1976, Science.

[28]  Michael D. Lee,et al.  The perceptual organization of point constellations , 2009 .

[29]  S. John,et al.  New York City Dataset , 2015 .

[30]  J. J. Gilbert,et al.  Sleep in Mammals : Ecological and Constitutional Correlates , 2007 .

[31]  Marcus A. Magnor,et al.  Automated Analytical Methods to Support Visual Exploration of High-Dimensional Data , 2011, IEEE Transactions on Visualization and Computer Graphics.

[32]  Daniel A. Keim,et al.  Pixnostics: Towards Measuring the Value of Visualization , 2006, 2006 IEEE Symposium On Visual Analytics Science And Technology.

[33]  Kwan-Liu Ma,et al.  Big-Data Visualization , 2013, IEEE Computer Graphics and Applications.

[34]  J. Tukey On the Comparative Anatomy of Transformations , 1957 .

[35]  John P. Lewis,et al.  Eurographics/ Ieee-vgtc Symposium on Visualization 2009 Selecting Good Views of High-dimensional Data Using Class Consistency , 2022 .

[36]  Paul H. Harvey,et al.  Sleep in mammals , 1988, Animal Behaviour.

[37]  Robert Kosara,et al.  Pargnostics: Screen-Space Metrics for Parallel Coordinates , 2010, IEEE Transactions on Visualization and Computer Graphics.