The comparison and fitting of given classification schemes

Abstract A permutation procedure is described for statistically comparing a given classification scheme, characterized as a hierarchically organized collection of subsets, to either a proximity matrix or a second classification scheme defined on the same basic set of objects. To prevent a bias that may result when an optimization method is used to construct the classification structures being considered, it is assumed that the desired comparisons involve classifications and/or proximities that have been derived from separate sources. In addition to an extensive number of examples used to clarify the major points in the presentation of the inference strategy, several heuristic optimization techniques are also introduced and illustrated. These latter procedures attempt to “fit” the form of a target classification structure by relabeling the rows and simultaneously the columns of a given proximity matrix to maximize the correspondence between the fixed target and the relabeled proximity matrix.

[1]  L. Hubert,et al.  Quadratic assignment as a general data analysis strategy. , 1976 .

[2]  L. Hubert,et al.  Analyzing Distinctive Features , 1977 .

[3]  Lawrence Hubert,et al.  Evaluating Object Set Partitions: Free-Sort Analysis and Some Generalizations. , 1976 .

[4]  L. Hubert,et al.  Measuring the Power of Hierarchical Cluster Analysis , 1975 .

[5]  J. M. Bevan,et al.  Rank Correlation Methods , 1949 .

[6]  P. E. Johnson,et al.  Some psychological aspects of subject-matter structure. , 1967, Journal of educational psychology.

[7]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[8]  J DEESE,et al.  On the structure of associative meaning. , 1962, Psychological review.

[9]  L. Hubert,et al.  A Graph-Theoretic Approach to Goodness-of-Fit in Complete-Link Hierarchical Clustering , 1976 .

[10]  F. Baker Stability of Two Hierarchical Grouping Techniques Case I: Sensitivity to Data Errors , 1974 .

[11]  Lawrence Hubert,et al.  Data Analysis by Single-Link and Complete-Link Hierarchical Clustering , 1976 .

[12]  Lawrence Hubert Nominal scale response agreement as a generalized correlation , 1977 .

[13]  Lawrence Hubert A Note on the Restriction of Range for Pearson Product-Moment Correlation Coefficients , 1972 .

[14]  N. Mantel The detection of disease clustering and a generalized regression approach. , 1967, Cancer research.

[15]  L. Hubert Approximate Evaluation Techniques for the Single-Link and Complete-Link Hierarchical Clustering Procedures , 1974 .

[16]  Byron J. T. Morgan,et al.  Acoustic confusion of digits in memory and recognition , 1973 .

[17]  Amnon Rapoport,et al.  Structures in the subjective lexicon , 1971 .

[18]  F. Rohlf Methods of Comparing Classifications , 1974 .