A mathematical programming approach to fitting general graphs

We present an algorithm for fitting general graphs to proximity data. The algorithm utilizes a mathematical programming procedure based on a penalty function approach to impose additivity constraints upon parameters. For a user-specified number of links, the algorithm seeks to provide the connected network that gives the least-squares approximation to the proximity data with the specified number of links, allowing for linear transformations of the data. The network distance is the minimum-path-length metric for connected graphs. As a limiting case, the algorithm provides a tree where each node corresponds to an object, if the number of links is set equal to the number of objects minus one. A Monte Carlo investigation indicates that the resulting networks tend to fall within one percentage point of the least-squares solution in terms of the variance accounted for, but do not always attain this global optimum. The network model is discussed in relation to ordinal network representations (Klauer 1989) and NETSCAL (Hutchinson 1989), and applied to several well-known data sets.

[1]  M. J. D. Powell,et al.  Restart procedures for the conjugate gradient method , 1977, Math. Program..

[2]  R. Shepard,et al.  Monotone mapping of similarities into a general metric space , 1974 .

[3]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[4]  Roger N. Shepard,et al.  Additive clustering: Representation of similarities as combinations of discrete overlapping properties. , 1979 .

[5]  J. Cunningham,et al.  Free trees and bidirectional trees as representations of psychological distance , 1978 .

[6]  G. Soete A least squares algorithm for fitting additive trees to proximity data , 1983 .

[7]  P. Arabie,et al.  Mapclus: A mathematical programming approach to fitting the adclus model , 1980 .

[8]  K. C. Klauer Ordinal network representation: Representing proximities by graphs , 1989 .

[9]  A. Tversky,et al.  Additive similarity trees , 1977 .

[10]  W. DeSarbo,et al.  The representation of three-way proximity data by single and multiple tree structure models , 1984 .

[11]  A. Tversky,et al.  Similarity, Separability, and the Triangle Inequality , 1982 .

[12]  A. Tversky Features of Similarity , 1977 .

[13]  F. Restle A metric and an ordering on sets , 1959 .

[14]  I. Borg,et al.  Dimensional models for the perception of rectangles , 1983, Perception & psychophysics.

[15]  A. D. Hall,et al.  The PORT Mathematical Subroutine Library , 1978, TOMS.

[16]  J. W. Hutchinson Netscal: A network scaling algorithm for nonsymmetric proximity data , 1989 .

[17]  Phipps Arabie,et al.  Three-way scaling and clustering. , 1987 .

[18]  J. Carroll,et al.  Spatial, non-spatial and hybrid models for scaling , 1976 .

[19]  A. Tversky,et al.  Extended similarity trees , 1986 .

[20]  Moonja P. Kim,et al.  The Method of Sorting as a Data-Gathering Procedure in Multivariate Research. , 1975, Multivariate behavioral research.