Mapping Networks and Trees with Multidimensional Scaling of Proximities

Network methodology typically has two separate stages: (1) constructing a graph from relational data, and (2) drawing the graph on a map to comprehend its structure. Multidimensional scaling (MDS) is discussed as a distance-driven graph drawing method. It is shown that popular drawing methods in computer science that minimize the potential energy of a spring model are equivalent to a simple form of MDS without optimal transformation of the graphical distance. They share a weighted least squares loss function (Kruskal’s stress). The best way to minimize Stress (Guttman’s algorithm) is shown to be a particular force-directed updating scheme in terms of a spring model. With several analyses of two examples (a simple graph and an additive tree), it is shown that using shortest path distances in the graph as input to MDS gives better drawing results than just using its adjacency matrix. Inclusion of data weights in Stress to emphasize good fit of small distances turns out not to be essential. It may even be detrimental to correct representation of line length in a weighted graph.

[1]  Akinori Okada,et al.  Two-Dimensional Centrality of a Social Network , 2007, GfKl.

[2]  Peter Eades,et al.  A Heuristic for Graph Drawing , 1984 .

[3]  Yehuda Koren,et al.  Graph Drawing by Stress Majorization , 2004, GD.

[4]  J. Kruskal Nonmetric multidimensional scaling: A numerical method , 1964 .

[5]  Joseph L. Zinnes,et al.  Theory and Methods of Scaling. , 1958 .

[6]  Stephen G. Kobourov,et al.  Graph Layouts by t‐SNE , 2017, Comput. Graph. Forum.

[7]  A. Okada Using Additive Conjoint Measurement Analysis of Social Network Data , 2003 .

[8]  W. Heiser Fitting Graphs and Trees with Multidimensional Scaling Methods , 1998 .

[9]  Jonathan D. Cohen,et al.  Drawing graphs to convey proximity: an incremental arrangement method , 1997, TCHI.

[10]  Frank L. Klingberg,et al.  Studies in measurement of the relations among sovereign states , 1941 .

[11]  L. Guttman A general nonmetric technique for finding the smallest coordinate space for a configuration of points , 1968 .

[12]  C. Osgood,et al.  The Measurement of Meaning , 1958 .

[13]  A. Buja,et al.  Local Multidimensional Scaling for Nonlinear Dimension Reduction, Graph Drawing, and Proximity Analysis , 2009 .

[14]  J. Douglas Carroll,et al.  Some Multidimensional Scaling and Related Procedures Devised at Bell Laboratories, With Ecological Applications , 1987 .

[15]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[16]  Satoru Kawai,et al.  An Algorithm for Drawing General Undirected Graphs , 1989, Inf. Process. Lett..

[17]  J. Gower,et al.  Minimum Spanning Trees and Single Linkage Cluster Analysis , 1969 .

[18]  R. Shepard The analysis of proximities: Multidimensional scaling with an unknown distance function. II , 1962 .

[19]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[20]  R. Prim Shortest connection networks and some generalizations , 1957 .

[21]  F W Snyder,et al.  AFFECTIVE MEANING SYSTEMS: A MULTIVARIATE APPROACH. , 1970, Multivariate behavioral research.

[22]  Roger N. Shepard,et al.  Additive clustering: Representation of similarities as combinations of discrete overlapping properties. , 1979 .

[23]  Louis Guttman,et al.  The Relative Associational Contiguity of Occupations in an Urban Setting , 1966 .

[24]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[25]  R N Shepard,et al.  Multidimensional Scaling, Tree-Fitting, and Clustering , 1980, Science.

[26]  A. Buja,et al.  Inequalities and Positive-Definite Functions Arising from a Problem in Multidimensional Scaling , 1994 .

[27]  L. Freeman Spheres, cubes and boxes: Graph dimensionality and network structure , 1983 .

[28]  Wolfgang Gaul,et al.  COMMUNITY STRUCTURE DISCOVERY IN DIRECTED GRAPHS BY ASYMMETRIC CLUSTERING , 2013 .

[29]  J. Leeuw Applications of Convex Analysis to Multidimensional Scaling , 2000 .

[30]  J. N. R. Jeffers,et al.  Graphical Models in Applied Multivariate Statistics. , 1990 .

[31]  Francis Cailliez,et al.  The analytical solution of the additive constant problem , 1983 .

[32]  N. Henley A psychological study of the semantics of animal terms , 1969 .

[33]  W. Heiser,et al.  Feature selection in feature network models: finding predictive subsets of features with the Positive Lasso. , 2008, The British journal of mathematical and statistical psychology.

[34]  Katherine Faust,et al.  Predicting the structure of a communications network from recalled data , 1982 .

[35]  C. Osgood,et al.  A blind analysis of a case of multiple personality using the semantic differential. , 1954, Journal of abnormal psychology.

[36]  J. Meulman The integration of multidimensional scaling and multivariate analysis with optimal transformations , 1992 .

[37]  Ulrik Brandes,et al.  An Experimental Study on Distance-Based Graph Drawing , 2009, GD.

[38]  Sacha Epskamp,et al.  The Gaussian Graphical Model in Cross-Sectional and Time-Series Data , 2016, Multivariate behavioral research.

[39]  A. Tversky,et al.  Additive similarity trees , 1977 .

[40]  A. Householder,et al.  Discussion of a set of points in terms of their mutual distances , 1938 .

[41]  Victor E. McGee,et al.  THE MULTIDIMENSIONAL ANALYSIS OF ‘ELASTIC’ DISTANCES , 1966 .

[42]  Patrick J. F. Groenen,et al.  Multidimensional scaling by majorization: A review , 2016 .

[43]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[44]  Edward M. Reingold,et al.  Graph drawing by force‐directed placement , 1991, Softw. Pract. Exp..

[45]  E. Holman The relation between hierarchical and euclidean models for psychological distances , 1972 .

[46]  Willem J. Heiser,et al.  Hierarchical trees can be perfectly scaled in one dimension , 1988 .

[47]  G. Soete A least squares algorithm for fitting additive trees to proximity data , 1983 .

[48]  Edward O. Laumann,et al.  NEW DIRECTIONS IN THE STUDY OF COMMUNITY ELITES , 1973 .

[49]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[50]  J. Leeuw,et al.  Upper bounds for Kruskal's stress , 1984 .

[51]  Lawrence Hubert,et al.  The Structural Representation of Proximity Matrices with MATLAB , 2006 .