Parallel Coordinates: Visualization, Exploration and Classification of High-Dimensional Data

A dataset with M items has 2M subsets, any one of which may be the one we really want. With a good data display, our own fantastic pattern-recognition abilities can not only sort through this combinatorial explosion, but they can also extract insights fromthe visual patterns. These are the core reasons for data visualization. With parallel coordinates (abbrev. f-coords), the search for multivariate relations in highdimensional datasets is transformed into a 2-D pattern recognition problem. In this chapter, the guidelines and strategy for knowledge discovery using parallel coordinates are illustrated on various real datasets, one with 400 variables froma manufacturing process. A geometric classification algorithm based on f-coords is presented and applied to complex datasets. It has low computational complexity, providing the classification rule explicitly and visually.Theminimal set of variables required to state the rule are found and ordered by their predictive value. A visual economic model of a real country is constructed and analyzed to illustrate how multivariate relations can be modeled using hypersurfaces.The overview at the end provides a basic summary of f-coords and a prelude of what is on the way: the distillation of relational information into patterns that eliminate need for polygonal lines altogether.

[1]  R. Kosara,et al.  Parallel sets: visual analysis of categorical data , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[2]  E. Wegman Hyperdimensional Data Analysis Using Parallel Coordinates , 1990 .

[3]  Alfred Inselberg,et al.  The plane with parallel coordinates , 1985, The Visual Computer.

[4]  Alfred Inselberg,et al.  Convexity algorithms in parallel coordinates , 1987, JACM.

[5]  Daniel C. H. Yang,et al.  Mobility analysis of planar four-bar mechanisms through the parallel coordinate system , 1986 .

[6]  Armond Inselberg Intelligent Instrumentation and Process Control , 1985, CAIA.

[7]  Edward Tufte,et al.  Visual Explanations , 1997 .

[8]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[9]  Li Yang,et al.  Pruning and visualizing generalized association rules in parallel coordinates , 2005, IEEE Transactions on Knowledge and Data Engineering.

[10]  Matthew O. Ward,et al.  XmdvTool: integrating multiple methods for visualizing multivariate data , 1994, Proceedings Visualization '94.

[11]  Alfred Inselberg,et al.  Approximated Planes in Parallel Coordinates , 2000 .

[12]  Selig Brodetsky,et al.  A First Course in Nomography , 2010, Nature.

[13]  Edward R. Tufte,et al.  The Visual Display of Quantitative Information , 1986 .

[14]  A. Inselberg,et al.  Visualizing multi-dimensional polytopes and topologies for tolerances , 1995 .

[15]  Alfred Inselberg,et al.  Parallel coordinates for visualizing multi-dimensional geometry , 1987 .

[16]  Paolo Fiorini,et al.  Configuration space representation in parallel coordinates , 1989, Proceedings, 1989 International Conference on Robotics and Automation.

[17]  Alfred Inselberg,et al.  The automated multidimensional detective , 1999, Proceedings 1999 IEEE Symposium on Information Visualization (InfoVis'99).

[18]  Alfred Inselberg,et al.  Don't panic ... just do it in parallel! , 1999, Comput. Stat..

[19]  Abhijit Chatterjee,et al.  Visualization in linear programming using parallel coordinates , 1993, Pattern Recognit..

[20]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .

[21]  Robert P. Burton,et al.  A survey and characterization of multidimensional presentation techniques , 1991 .

[22]  R. H. Myers,et al.  Interpreting plots of a multidimensional dose-response surface in a parallel coordinate system. , 1990, Biometrics.

[23]  John Scott Eickemeyer Visualizing P-flats in N-space using parallel coordinates , 1992 .

[24]  Frank Harary,et al.  Graph Theory , 2016 .

[25]  Alfred Inselberg Visual Data Mining with Parallel Coordinates , 1998 .

[26]  Edward R. Tufte,et al.  Envisioning Information , 1990 .

[27]  Bela Bollobas,et al.  Graph theory , 1979 .

[28]  Lawrence C. Walters,et al.  Graphical Presentations of Data Envelopment Analyses: Management Implications from Parallel Axes Representations* , 1991 .

[29]  Christopher Vyn Jones,et al.  Visualization and Optimization , 1997 .

[30]  Hans Hinterberger,et al.  Comparative multivariate visualization across conceptually different graphic displays , 1994, Seventh International Working Conference on Scientific and Statistical Database Management.