Visualization of High-dimensional Data via Orthogonal Curves

Computers are still much less useful than the ability of the human eye for pattern matching. This ability can be used quite straightforwardly to identify structure in a data set when it is two or three dimensional. With data sets with more than 3 dimensions some kind of transformation is always necessary. In this paper we review in depth and present and extension of one of these mechanisms: Andrews' curves. With the Andrews' curves we use a curve to represent each data point. A human can run his eye along a set of curves (representing the members of the data set) and identify particular regions of the curves which are optimal for identifying clusters in the data set. Of interest in this context, is our extension in which a moving three-dimensional image is created in which we can see clouds of data points moving as we move along the curves; in a very real sense, the data which dance together are members of the same cluster.

[1]  Richard A. Becker,et al.  Brushing scatterplots , 1987 .

[2]  Andreas Buja,et al.  Grand tour methods: an outline , 1986 .

[3]  Marcus Gallagher,et al.  Multi-layer Perceptron Error Surfaces: Visualization, Structure and Modelling , 2000 .

[4]  Edward J. Wegman,et al.  On some mathematics for visualizing high dimensional data , 2002 .

[5]  Edward A. Rietman,et al.  Dynamic images of plasma processes: Use of Fourier blobs for endpoint detection during plasma etching of patterned wafers , 1998 .

[6]  Ravindra Khattree,et al.  Andrews plots for multivariate data: some new suggestions and applications , 2002 .

[7]  Edward A. Rietman,et al.  A study on /spl Rfr//sup m//spl rarr//spl Rfr//sup 1/ maps: application to a 0.16-/spl mu/m via etch process endpoint , 2000 .

[8]  D. F. Andrews,et al.  PLOTS OF HIGH-DIMENSIONAL DATA , 1972 .

[9]  Agnes M. Herzberg,et al.  An introduction to wavelets with applications to Andrews' plots , 1995 .

[10]  J. Gower,et al.  Methods for statistical data analysis of multivariate observations , 1977, A Wiley publication in applied statistics.

[11]  W. Hacke,et al.  A bivariate version of Andrews plots , 1991, IEEE Transactions on Biomedical Engineering.

[12]  K. Vijayan,et al.  360: Significance Tests in Plots of Multi-Dimensional Data in Two Dimensions , 1974 .

[13]  P. Embrechts,et al.  Variations of Andrews' plots , 1991 .

[14]  Neil Spencer,et al.  Investigating Data with Andrews Plots , 2003 .

[15]  John Frank Murphy Methods for collection and processing of gene expression data , 2005 .

[16]  Daniel Asimov,et al.  The grand tour: a tool for viewing multidimensional data , 1985 .

[17]  Edward J. Wegman,et al.  Image grand tour , 1998, Defense, Security, and Sensing.

[18]  Andreas Buja,et al.  Interactive data visualization using focusing and linking , 1991, Proceeding Visualization '91.

[19]  S. R Kulkarni,et al.  Use, of andrews' function plot technique to construct control curves for multivariate process , 1984 .

[20]  Jürgen Symanzik,et al.  New Applications of the Image Grand Tour , 2002 .

[21]  Agnes M. Herzberg,et al.  An investigation of Andrews' plots to detect period and outliers in time series data , 1986 .