Two-Way Multidimensional Scaling: A Review

Multidimensional scaling (MDS) is a technique used to extract a set of independent variables from a proximity matrix or matrices. Applications of MDS are found in a wide range of areas, including visualization, pattern analysis, data preprocessing, scale development, cybernetics, and localization. The overall rationale behind the paper is to help share innovations across disciplines. We survey and synthesize MDS methods from the academic areas of psychometrics, statistics, and computing. We define classical MDS and distance-based MDS. We then introduce basic MDS formulations and functions. We survey MDS techniques designed for nonlinear data and describe distance-based MDS in terms of minimizing the energy function in a spring system. We describe completely nonmetric MDS techniques for ordinal input data and describe how MDS solutions can be compared using ordinal neighborhood information. We describe optimization methods for fitting MDS models, covering both continuous optimization techniques and combinatorial techniques. We give several illustrative applications of MDS from the areas of cybernetics, air traffic control, molecular chemistry, robotics, and network localization. We link this work to the techniques described in the previous sections of the paper. We list a wide range of currently available MDS software and discuss possible future work in the area.

[1]  Joseph Woelfel,et al.  Multidimensional scaling in Riemann space , 1982 .

[2]  Jarkko Venna,et al.  Local multidimensional scaling , 2006, Neural Networks.

[3]  J. Douglas Carroll,et al.  Development of an Agreement Metric Based Upon the RAND Index for the Evaluation of Dimensionality Reduction Techniques, with Applications to Mapping Customer Data , 2007, MLDM.

[4]  Amaury Lendasse,et al.  A robust nonlinear projection method , 2000 .

[5]  Michael J. Brusco,et al.  Using Quadratic Assignment Methods to Generate Initial Permutations for Least-Squares Unidimensional Scaling of Symmetric Proximity Matrices , 2000, J. Classif..

[6]  Tamara Munzner,et al.  Steerable, Progressive Multidimensional Scaling , 2004, IEEE Symposium on Information Visualization.

[7]  Christos Faloutsos,et al.  FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets , 1995, SIGMOD '95.

[8]  Bernhard Schölkopf,et al.  Kernel Principal Component Analysis , 1997, ICANN.

[9]  Hujun Yin,et al.  ViSOM - a novel method for multivariate data projection and structure visualization , 2002, IEEE Trans. Neural Networks.

[10]  J. Douglas Carroll,et al.  DEMScale: Large Scale MDS Accounting for a Ridge Operator and Demographic Variables , 2009, IDA.

[11]  A. Householder,et al.  Discussion of a set of points in terms of their mutual distances , 1938 .

[12]  Michael J. Brusco,et al.  A Binary Integer Program to Maximize the Agreement Between Partitions , 2008, J. Classif..

[13]  Dimitris K. Agrafiotis,et al.  Multidimensional scaling and visualization of large molecular similarity tables , 2001, J. Comput. Chem..

[14]  Alexander M. Bronstein,et al.  Multigrid multidimensional scaling , 2006, Numer. Linear Algebra Appl..

[15]  Dominique Brodbeck,et al.  Combining topological clustering and multidimensional scaling for visualizing large data sets , 1998 .

[16]  Joshua B. Tenenbaum,et al.  Sparse multidimensional scaling using land-mark points , 2004 .

[17]  Hujun Yin,et al.  On multidimensional scaling and the embedding of self-organising maps , 2008, Neural Networks.

[18]  Kaizhong Zhang,et al.  MetricMap: an embedding technique for processing distance-based queries in metric spaces , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[19]  M. Greenacre Correspondence analysis in practice , 1993 .

[20]  Ulrik Brandes,et al.  Eigensolver Methods for Progressive Multidimensional Scaling of Large Data , 2006, GD.

[21]  Victor E. McGee,et al.  THE MULTIDIMENSIONAL ANALYSIS OF ‘ELASTIC’ DISTANCES , 1966 .

[22]  Kai Zhang,et al.  Density-Weighted Nyström Method for Computing Large Kernel Eigensystems , 2009, Neural Comput..

[23]  Andrew Webb,et al.  A Kernel Approach to Metric Multidimensional Scaling , 2002, SSPR/SPR.

[24]  L. Hubert,et al.  Comparing partitions , 1985 .

[25]  James T. Kwok,et al.  Density-Weighted Nystrm Method for Computing Large Kernel Eigensystems , 2009, Neural Computation.

[26]  Willem J. Heiser,et al.  Constrained Multidimensional Scaling, Including Confirmation , 1983 .

[27]  Golan Yona,et al.  Distributional Scaling: An Algorithm for Structure-Preserving Embedding of Metric and Nonmetric Spaces , 2004, J. Mach. Learn. Res..

[28]  R. Shepard The analysis of proximities: Multidimensional scaling with an unknown distance function. II , 1962 .

[29]  Takayuki Saito,et al.  MULTIDIMENSIONAL SCALING OF ASYMMETRIC PROXIMITY: MODEL AND METHOD , 1990 .

[30]  Willem J. Heiser,et al.  Global Optimization in Any Minkowski Metric: A Permutation-Translation Simulated Annealing Algorithm for Multidimensional Scaling , 2007, J. Classif..

[31]  R. Shepard The analysis of proximities: Multidimensional scaling with an unknown distance function. I. , 1962 .

[32]  Michael J. Brusco A Simulated Annealing Heuristic for Unidimensional and Multidimensional (City-Block) Scaling of Symmetric Proximity Matrices , 2001, J. Classif..

[33]  Roberto Rocci,et al.  Rotation Techniques in Asymmetric Multidimensional Scaling , 2002 .

[34]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[35]  Edward M. Reingold,et al.  Graph drawing by force‐directed placement , 1991, Softw. Pract. Exp..

[36]  Bharat K. Bhargava,et al.  Visualization of wormholes in sensor networks , 2004, WiSe '04.

[37]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[38]  Patrick Mair,et al.  Multidimensional Scaling Using Majorization: SMACOF in R , 2008 .

[39]  Heiga Zen,et al.  On the Use of Kernel PCA for Feature Extraction in Speech Recognition , 2003, IEICE Trans. Inf. Syst..

[40]  C. Coombs A theory of data. , 1965, Psychology Review.

[41]  P. Groenen,et al.  Global Optimization in Least-Squares Multidimensional Scaling by Distance Smoothing , 1999 .

[42]  J. Douglas Carroll,et al.  PARAMAP vs. Isomap: A Comparison of Two Nonlinear Mapping Algorithms , 2006, J. Classif..

[43]  Frank Rehm,et al.  Visual data analysis in air traffic management , 2007 .

[44]  John Platt,et al.  FastMap, MetricMap, and Landmark MDS are all Nystrom Algorithms , 2005, AISTATS.

[45]  Frank Klawonn,et al.  MDSpolar: A New Approach for Dimension Reduction to Visualize High Dimensional Data , 2005, IDA.

[46]  Michel van de Velden Optimal Scaling of Paired Comparison Data , 2004, J. Classif..

[47]  R. Mooney,et al.  Impact of Similarity Measures on Web-page Clustering , 2000 .

[48]  J. Snyder Flattening the Earth: Two Thousand Years of Map Projections , 1994 .

[49]  P. Groenen,et al.  The majorization approach to multidimensional scaling for Minkowski distances , 1995 .

[50]  Wojciech Basalaj,et al.  Incremental multidimensional scaling method for database visualization , 1999, Electronic Imaging.

[51]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[52]  Dominique M. Hanssens,et al.  Modeling Asymmetric Competition , 1988 .

[53]  Matthew Chalmers,et al.  A hybrid layout algorithm for sub-quadratic multidimensional scaling , 2002, IEEE Symposium on Information Visualization, 2002. INFOVIS 2002..

[54]  J. Leeuw,et al.  The Gifi system of descriptive multivariate analysis , 1998 .

[55]  Christopher K. I. Williams On a Connection between Kernel PCA and Metric Multidimensional Scaling , 2004, Machine Learning.

[56]  P. Kroonenberg Applied Multiway Data Analysis , 2008 .

[57]  J. Chang,et al.  Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition , 1970 .

[58]  Antoine Naud An Accurate MDS-Based Algorithm for the Visualization of Large Multidimensional Datasets , 2006, ICAISC.

[59]  Michel Verleysen,et al.  DD-HDS: A Method for Visualization and Exploration of High-Dimensional Data , 2007, IEEE Transactions on Neural Networks.

[60]  A. Buja,et al.  Local Multidimensional Scaling for Nonlinear Dimension Reduction, Graph Drawing, and Proximity Analysis , 2009 .

[61]  Michael J. Brusco,et al.  Optimal Least-Squares Unidimensional Scaling: Improved Branch-and-Bound Procedures and Comparison to Dynamic Programming , 2005 .

[62]  W. Torgerson Multidimensional scaling: I. Theory and method , 1952 .

[63]  Andreas Buja,et al.  Visualization Methodology for Multidimensional Scaling , 2002, J. Classif..

[64]  Joshua B. Tenenbaum,et al.  Global Versus Local Methods in Nonlinear Dimensionality Reduction , 2002, NIPS.

[65]  Guy Melançon,et al.  Multiscale hybrid MDS , 2004, Proceedings. Eighth International Conference on Information Visualisation, 2004. IV 2004..

[66]  Andrzej Cichocki,et al.  Kernel PCA for Feature Extraction and De-Noising in Nonlinear Regression , 2001, Neural Computing & Applications.

[67]  Rudolf Mathar,et al.  Least Squares Multidimensional Scaling with Transformed Distances , 1996 .

[68]  Daijin Kim,et al.  Bayesian Multidimensional Scaling for Multi-Robot Localization , 2008, 2008 IEEE International Conference on Networking, Sensing and Control.

[69]  C. Krumhansl Concerning the Applicability of Geometric Models to Similarity Data : The Interrelationship Between Similarity and Spatial Density , 2005 .

[70]  L. Guttman A general nonmetric technique for finding the smallest coordinate space for a configuration of points , 1968 .

[71]  Shizuhiko Nishisato,et al.  Optimal scaling of paired comparison and rank order data: An alternative to Guttman's formulation , 1978 .

[72]  André Skupin,et al.  From metaphor to method: cartographic perspectives on information visualization , 2000, IEEE Symposium on Information Visualization 2000. INFOVIS 2000. Proceedings.

[73]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[74]  Bernhard Schölkopf,et al.  The Kernel Trick for Distances , 2000, NIPS.

[75]  Andreas Noack,et al.  An Energy Model for Visual Graph Clustering , 2003, GD.

[76]  Ingwer Borg,et al.  Measuring the Similarity of MDS Configurations. , 1985, Multivariate behavioral research.

[77]  Jesper W. Schneider,et al.  Matrix comparison, Part 1: Motivation and important issues for measuring the resemblance between proximity measures or ordination results , 2007 .

[78]  Jonathan Goldstein,et al.  When Is ''Nearest Neighbor'' Meaningful? , 1999, ICDT.

[79]  Joseph B. Kruskal Comments on "A Nonlinear Mapping for Data Structure Analysis" , 1971, IEEE Trans. Computers.

[80]  David J. Kriegman,et al.  Generalized Non-metric Multidimensional Scaling , 2007, AISTATS.

[81]  Phipps Arabie,et al.  Was euclid an unnecessarily sophisticated psychologist? , 1991 .

[82]  Alexander M. Bronstein,et al.  Fast Multidimensional Scaling using Vector Extrapolation , 2008 .

[83]  Heungsun Hwang,et al.  Correspondence Analysis, Multiple Correspondence Analysis and Recent Developments , 2008 .

[84]  Bernard Fertil,et al.  RankVisu: Mapping from the neighborhood network , 2009, Neurocomputing.

[85]  Karl F. MacDorman,et al.  Human emotion and the uncanny valley: A GLM, MDS, and Isomap analysis of robot video ratings , 2008, 2008 3rd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[86]  K. Kim,et al.  Face recognition using kernel principal component analysis , 2002, IEEE Signal Process. Lett..

[87]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[88]  D. Defays A short note on a method of seriation , 1978 .

[89]  Marc Olano,et al.  Glimmer: Multilevel MDS on the GPU , 2009, IEEE Transactions on Visualization and Computer Graphics.

[90]  Zehdreh Allen-Lafayette,et al.  Flattening the Earth, Two Thousand Years of Map Projections , 1998 .

[91]  P. Wesseling An Introduction to Multigrid Methods , 1992 .

[92]  J. Leeuw Applications of Convex Analysis to Multidimensional Scaling , 2000 .

[93]  Richard A. Tapia,et al.  An Approach to Parallelizing Isotonic Regression , 1996 .

[94]  Matthew Chalmers,et al.  A linear iteration time layout algorithm for visualising high-dimensional data , 1996, Proceedings of Seventh Annual IEEE Visualization '96.

[95]  Ying Zhang,et al.  Localization from connectivity in sensor networks , 2004, IEEE Transactions on Parallel and Distributed Systems.

[96]  Forrest W. Young,et al.  Nonmetric individual differences multidimensional scaling: An alternating least squares method with optimal scaling features , 1977 .

[97]  David Harel,et al.  Drawing graphs nicely using simulated annealing , 1996, TOGS.

[98]  E. Holman Completely nonmetric multidimensional scaling , 1978 .

[99]  Pia Borlund,et al.  Matrix comparison, Part 2: Measuring the resemblance between proximity measures or ordination results by use of the mantel and procrustes statistics , 2007, J. Assoc. Inf. Sci. Technol..

[100]  A. Naud,et al.  Visualization of high-dimensional data using an association of multidimensional scaling to clustering , 2004, IEEE Conference on Cybernetics and Intelligent Systems, 2004..

[101]  Phipps Arabie,et al.  Combinatorial Data Analysis: Optimization by Dynamic Programming , 1987 .

[102]  Louis Guttman,et al.  An Approach for Quantifying Paired Comparisons and Rank Order , 1946 .

[103]  Zhi-Hua Zhou,et al.  Supervised nonlinear dimensionality reduction for visualization and classification , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[104]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[105]  Heeyoul Choi,et al.  Robust kernel Isomap , 2007, Pattern Recognit..

[106]  Michel Wedel,et al.  Implications for Asymmetry, Nonproportionality, and Heterogeneity in Brand Switching from Piece-wise Exponential Mixture Hazard Models , 1995 .

[107]  J. Douglas Carroll,et al.  Nonlinear Mapping Using a Hybrid of PARAMAP and Isomap Approaches , 2010 .

[108]  Yoshio Takane,et al.  Multidimensional successive categories scaling: A maximum likelihood method , 1981 .

[109]  A. Sidi,et al.  Extrapolation methods for vector sequences , 1987 .

[110]  Michel Verleysen,et al.  Nonlinear projection with curvilinear distances: Isomap versus curvilinear distance analysis , 2004, Neurocomputing.

[111]  Michael J. Watts,et al.  A Decade of Kasabov's Evolving Connectionist Systems: A Review , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[112]  V. Pliner Metric unidimensional scaling and global optimization , 1996 .

[113]  Ajay K. Manrai,et al.  A New Multidimensional Scaling Methodology for the Analysis of Asymmetric Proximity Data in Marketing Research , 1992 .

[114]  J. Kruskal Nonmetric multidimensional scaling: A numerical method , 1964 .

[115]  J. Ramsay Some Statistical Approaches to Multidimensional Scaling Data , 1982 .

[116]  A. Tversky Features of Similarity , 1977 .

[117]  Joseph L. Zinnes,et al.  Theory and Methods of Scaling. , 1958 .

[118]  Ronald A. Cole,et al.  Spoken Letter Recognition , 1990, HLT.

[119]  Akinori Okada,et al.  Nonmetric Multidimensional Scaling of Asymmetric Proximities , 1987 .

[120]  Satoru Kawai,et al.  An Algorithm for Drawing General Undirected Graphs , 1989, Inf. Process. Lett..

[121]  Anil K. Jain,et al.  Incremental nonlinear dimensionality reduction by manifold learning , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[122]  Phipps Arabie,et al.  Chapter 3 – Multidimensional Scaling , 1998 .

[123]  Michel Verleysen,et al.  A robust non-linear projection method , 2000, ESANN.

[124]  Kin-Nam Lau,et al.  A Nonlinear Programming Approach to Metric Unidimensional Scaling , 1998 .

[125]  Matthew Chalmers,et al.  Fast Multidimensional Scaling Through Sampling, Springs and Interpolation , 2003, Inf. Vis..

[126]  P. Groenen,et al.  The tunneling method for global optimization in multidimensional scaling , 1996 .

[127]  J. Douglas Carroll,et al.  Is the Distance Compression Effect Overstated? Some Theory and Experimentation , 2009, MLDM.

[128]  Evangelia Simantiraki,et al.  Unidimensional scaling: A linear programming approach minimizing absolute deviations , 1996 .

[129]  Lawrence Hubert,et al.  Linear and circular unidimensional scaling for symmetric proximity matrices , 1997 .

[130]  J. Leeuw Convergence of the majorization method for multidimensional scaling , 1988 .

[131]  A. Buja,et al.  Energy / Stress Functions for Dimension Reduction and Graph Drawing : Power Laws and Their Clustering Properties , 2009 .

[132]  Maja J. Mataric,et al.  A spatio-temporal extension to Isomap nonlinear dimension reduction , 2004, ICML.

[133]  Rein Kuusik,et al.  Pattern Discovery Using Seriation and Matrix Reordering : A Unified View, Extensions and an Application to Inventory Management.Mustrite avastamine kasutades järjestamist ning maatriksi ümberkorrastamist: unifitseeritud vaade, edasiarendused ning rakendus ladude juhtimises , 2008 .

[134]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[135]  Patrick J. F. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 2003 .

[136]  Charu C. Aggarwal,et al.  On the Surprising Behavior of Distance Metrics in High Dimensional Spaces , 2001, ICDT.

[137]  Michael W. Trosset Multidimensional Scaling Algorithms for Large Data Sets , 2005 .

[138]  L. McMillan,et al.  A Fast Approximation to Multidimensional Scaling , 2006 .

[139]  Jeanny Hérault,et al.  Curvilinear component analysis: a self-organizing neural network for nonlinear mapping of data sets , 1997, IEEE Trans. Neural Networks.

[140]  Weichao Wang,et al.  Interactive Wormhole Detection and Evaluation , 2007, Inf. Vis..

[141]  P. Arabie,et al.  Three-Way Scaling and Clustering. , 1991 .

[142]  Willem J. Heiser,et al.  A Permutation-Translation Simulated Annealing Algorithm for L1 and L2 Unidimensional Scaling , 2005, J. Classif..

[143]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[144]  Jarkko Venna,et al.  Trustworthiness and metrics in visualizing similarity of gene expression , 2003, BMC Bioinformatics.

[145]  Peter Eades,et al.  A Heuristic for Graph Drawing , 1984 .

[146]  Charles E. Heckler,et al.  Correspondence Analysis and Data Coding With Java and R , 2007, Technometrics.

[147]  Peter Salamon,et al.  Facts, Conjectures, and Improvements for Simulated Annealing , 1987 .

[148]  Deborah F. Swayne,et al.  Data Visualization With Multidimensional Scaling , 2008 .

[149]  Shizuhiko Nishisato,et al.  Multidimensional Nonlinear Descriptive Analysis , 2006 .

[150]  S. Saburi,et al.  A maximum likelihood method for an asymmetric MDS model , 2008, Comput. Stat. Data Anal..

[151]  G. A. Miller,et al.  An Analysis of Perceptual Confusions Among Some English Consonants , 1955 .

[152]  Marc Strickert,et al.  High-Throughput Multi-dimensional Scaling (HiT-MDS) for cDNA-Array Expression Data , 2005, ICANN.

[153]  Chao Wang,et al.  Improving the robustness of ISOMAP by de-noising , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[154]  Willem J. Heiser,et al.  City-Block Scaling: Smoothing Strategies for Avoiding Local Minima , 1998 .

[155]  M. Trosset A New Formulation of the Nonmetric Strain Problem in Multidimensional Scaling , 1998 .

[156]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.