Graph-theoretic scagnostics

We introduce Tukey and Tukey scagnostics and develop graph-theoretic methods for implementing their procedure on large datasets.

[1]  J. Hartigan Printer graphics for clustering , 1975 .

[2]  Mathew D. Penrose,et al.  Extremes for the minimal spanning tree on normally distributed points , 1998, Advances in Applied Probability.

[3]  Issei Fujishiro,et al.  The elements of graphing data , 2005, The Visual Computer.

[4]  David G. Kirkpatrick,et al.  On the shape of a set of points in the plane , 1983, IEEE Trans. Inf. Theory.

[5]  J. Kruskal On the shortest spanning subtree of a graph and the traveling salesman problem , 1956 .

[6]  Mark Bailey,et al.  The Grammar of Graphics , 2007, Technometrics.

[7]  J. Steele Growth Rates of Euclidean Minimal Spanning Trees With Power Weighted Edges , 1988 .

[8]  David W. Scott,et al.  Multivariate Density Estimation: Theory, Practice, and Visualization , 1992, Wiley Series in Probability and Statistics.

[9]  John W. Tukey,et al.  Exploratory Data Analysis. , 1979 .

[10]  Alan M. MacEachren,et al.  Exploring high-D spaces with multiform matrices and small multiples , 2003, IEEE Symposium on Information Visualization 2003 (IEEE Cat. No.03TH8714).

[11]  Daniel B. Carr,et al.  Scatterplot matrix techniques for large N , 1986 .

[12]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[13]  Werner Stuetzle,et al.  Estimating the Cluster Tree of a Density by Analyzing the Minimal Spanning Tree of a Sample , 2003, J. Classif..

[14]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[15]  D. W. Scott,et al.  Multivariate Density Estimation, Theory, Practice and Visualization , 1992 .

[16]  David Eppstein,et al.  Spanning Trees and Spanners , 2000, Handbook of Computational Geometry.

[17]  Steven Skiena,et al.  The Algorithm Design Manual , 2020, Texts in Computer Science.

[18]  Michael Ian Shamos,et al.  Computational geometry: an introduction , 1985 .

[19]  J. Friedman,et al.  Graph-Theoretic Measures of Multivariate Association and Prediction , 1983 .

[20]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[21]  Andreas Buja,et al.  Computing and Graphics in Statistics. , 1992 .

[22]  Matthew Brand,et al.  Continuous nonlinear dimensionality reduction by kernel Eigenmaps , 2003, IJCAI.

[23]  Joseph O'Rourke,et al.  Computational geometry in C (2nd ed.) , 1998 .

[24]  J. Gower,et al.  Minimum Spanning Trees and Single Linkage Cluster Analysis , 1969 .

[25]  Michael Friendly,et al.  Effect ordering for data displays , 2003, Comput. Stat. Data Anal..

[26]  W. Relative Neighborhood Graphs and Their Relatives , 2004 .

[27]  J. Hartigan,et al.  The runt test for multimodality , 1992 .

[28]  Joseph O'Rourke,et al.  Computational Geometry in C. , 1995 .

[29]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[30]  T. Hastie,et al.  Principal Curves , 2007 .

[31]  J. Michael Steele,et al.  Sums of Squares of Edge Lengths and Spacefilling Curve Heuristics for the Traveling Salesman Problem , 1994, SIAM J. Discret. Math..

[32]  Josef Kittler,et al.  A survey of the hough transform , 1988, Comput. Vis. Graph. Image Process..

[33]  Godfried T. Toussaint,et al.  The relative neighbourhood graph of a finite planar set , 1980, Pattern Recognit..

[34]  Jerome H. Friedman,et al.  John W. Tukey's work on interactive graphics , 2002 .

[35]  A. Mazure,et al.  The use of minimal spanning tree to characterize the 2D cluster galaxy distribution , 1998 .

[36]  Steven Skiena,et al.  On Minimum-Area Hulls , 1998, Algorithmica.

[37]  Bryan L. Shader,et al.  Sphere-of-influence graphs using the sup-norm , 2000 .

[38]  D. Kirkpatrick,et al.  A Framework for Computational Morphology , 1985 .

[39]  Matthew O. Ward,et al.  Clutter Reduction in Multi-Dimensional Data Visualization Using Dimension Reordering , 2004, IEEE Symposium on Information Visualization.

[40]  R. Sokal,et al.  A New Statistical Approach to Geographic Variation Analysis , 1969 .

[41]  Ben Shneiderman,et al.  A Rank-by-Feature Framework for Unsupervised Multidimensional Data Exploration Using Low Dimensional Projections , 2004, IEEE Symposium on Information Visualization.

[42]  K. Gabriel,et al.  The biplot graphic display of matrices with application to principal component analysis , 1971 .