Approximate Evaluation Techniques for the Single-Link and Complete-Link Hierarchical Clustering Procedures

Abstract A technique is presented for testing the hypothesis that a hierarchical sequence of partitions constructed by the single-link or complete-link clustering method could have been obtained because of “noise.” Two rank orderings of the object pairs are compared. One of the orderings is obtained from the initial proximity values; the second is derived from the levels at which an object pair first appears within a single subset within the hierarchy. The hypothesis that the given set of proximity values have been assigned randomly is tested by referring the Goodman-Kruskal rank correlation y statistic to an approximate permutation distribution.

[1]  R. Shepard Stimulus and response generalization: tests of a model relating generalization to distance in psychological space. , 1958, Journal of experimental psychology.

[2]  Louis L. McQuitty,et al.  Hierarchical Linkage Analysis for the Isolation of Types , 1960 .

[3]  R. Sokal,et al.  THE COMPARISON OF DENDROGRAMS BY OBJECTIVE METHODS , 1962 .

[4]  R. Sokal,et al.  Principles of numerical taxonomy , 1965 .

[5]  F. Baker,et al.  Monte Carlo F-II: A Computer Program for Analysis of Variance F-Tests By Means of Permutation , 1966 .

[6]  G. N. Lance,et al.  A General Theory of Classificatory Sorting Strategies: 1. Hierarchical Systems , 1967, Comput. J..

[7]  J. Hartigan REPRESENTATION OF SIMILARITY MATRICES BY TREES , 1967 .

[8]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[9]  J. Farris On the Cophenetic Correlation Coefficient , 1969 .

[10]  George A. Miller,et al.  A psychological method to investigate verbal concepts , 1969 .

[11]  W. J. Conover,et al.  Practical Nonparametric Statistics , 1972 .

[12]  G. N. Lance,et al.  Controversy Concerning the Criteria for Taxonometric Strategies , 1971, Computer/law journal.

[13]  L. Hubert Some extensions of Johnson's hierarchical clustering algorithms , 1972 .

[14]  John C. Ogilvie,et al.  Evaluation of hierarchical grouping techniques; a preliminary study , 1972, Comput. J..

[15]  R. F. Ling A Probability Theory of Cluster Analysis , 1973 .

[16]  L. Hubert Monotone invariant clustering procedures , 1973 .

[17]  S. Boorman,et al.  Metrics on spaces of finite trees , 1973 .

[18]  E. Clark,et al.  The Growth of Word Meaning , 1961 .

[19]  M. Kendall,et al.  Rank Correlation Methods , 1949 .

[20]  F. Baker Stability of Two Hierarchical Grouping Techniques Case I: Sensitivity to Data Errors , 1974 .