Evaluation of Projection Algorithms

A number of linear and nonlinear mapping algorithms for the projection of patterns from a high-dimensional space to two dimensions are available. These two-dimensional representations allow quick visual observation of a data set. A combination of two popular mapping algorithms-Sammon's mean-square error technique and the triangulation method-is proposed to overcome the limitations in the individual algorithms. Some factors which describe the goodness of a projection are described, and a comparison is made of six of these algorithms by running them on four data sets. The results obtained support the use of the proposed algorithm.

[1]  Heinrich Niemann,et al.  A Fast-Converging Algorithm for Nonlinear Mapping of High-Dimensional Data to a Plane , 1979, IEEE Transactions on Computers.

[2]  J. Kruskal Nonmetric multidimensional scaling: A numerical method , 1964 .

[3]  Geoffrey H. Ball,et al.  Data analysis in the social sciences: what about the details? , 1965, AFIPS '65 (Fall, part I).

[4]  Roger N. Shepard,et al.  Multidimensional scaling : theory and applications in the behavioral sciences , 1974 .

[5]  John W. Sammon,et al.  An Optimal Set of Discriminant Vectors , 1975, IEEE Transactions on Computers.

[6]  Brian Everitt,et al.  Graphical Techniques for Multivariate Data. , 1978 .

[7]  Bruce J. Schachter A nonlinear mapping algorithm for large data sets , 1978 .

[8]  C. E. Pykett Improving the efficiency of Sammon's nonlinear mapping by using clustering archetypes , 1978 .

[9]  Herman Chernoff GRAPHICAL REPRESENTATIONS AS A DISCIPLINE , 1978 .

[10]  Richard C. T. Lee,et al.  A Heuristic Relaxation Method for Nonlinear Mapping in Cluster Analysis , 1973, IEEE Trans. Syst. Man Cybern..

[11]  Anil K. Jain,et al.  An Intrinsic Dimensionality Estimator from Near-Neighbor Information , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Michael R. Anderberg,et al.  Cluster Analysis for Applications , 1973 .

[13]  Herman Chernoff,et al.  The Use of Faces to Represent Points in k- Dimensional Space Graphically , 1973 .

[14]  Thomas W. Calvert,et al.  Nonorthogonal Projections for Feature Extraction in Pattern Recognition , 1969, IEEE Transactions on Computers.

[15]  Heinrich Niemann,et al.  Linear and nonlinear mapping of patterns , 1980, Pattern Recognit..

[16]  John W. Tukey,et al.  A Projection Pursuit Algorithm for Exploratory Data Analysis , 1974, IEEE Transactions on Computers.

[17]  Manabu Ichino,et al.  Suboptimum Linear Feature Selection in Multiclass Problem , 1974, IEEE Trans. Syst. Man Cybern..

[18]  Anil K. Jain,et al.  Clustering Methodologies in Exploratory Data Analysis , 1980, Adv. Comput..

[19]  B. S. Everitt,et al.  Visual Techniques for Representing Multivariate Data , 1975 .

[20]  Anil K. Jain,et al.  Feature definition in pattern recognition with small sample size , 1978, Pattern Recognit..

[21]  G. Rivard Direct fast Fourier transform of bivariate functions , 1977 .

[22]  Keinosuke Fukunaga,et al.  The optimum nonlinear features for a scatter criterion in discriminant analysis , 1977, IEEE Trans. Inf. Theory.

[23]  Richard C. T. Lee,et al.  A Triangulation Method for the Sequential Mapping of Points from N-Space to Two-Space , 1977, IEEE Transactions on Computers.

[24]  Josef Kittler,et al.  Mathematics Methods of Feature Selection in Pattern Recognition , 1975, Int. J. Man Mach. Stud..

[25]  Bruce A. Eisenstein,et al.  A Declustering Criterion for Feature Extraction in Pattern Recognition , 1978, IEEE Transactions on Computers.

[26]  W. Krzanowski Some Exact Percentage Points of a Statistic Useful in Analysis of Variance and Principal Component Analysis , 1979 .

[27]  Joseph B. Kruskal Comments on "A Nonlinear Mapping for Data Structure Analysis" , 1971, IEEE Trans. Computers.

[28]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[29]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[30]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[31]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.