Grouping and Association in Contingency Tables: An Exploratory Canonical Correlation Approach

Abstract The criteria of homogeneity and structure were proposed by Goodman (1981a) for determining whether certain rows or columns of a contingency table should be grouped. A data-based procedure (using the canonical form of bivariate distributions) is presented in this article to guide exploratory analysis to determine which rows or columns of a table may be grouped. This procedure facilitates the application of the homogeneity criterion. Relationships between the proposed method of grouping and the structural criterion are discussed as well as simultaneous inference for grouped tables. The grouping method is extended to multiway tables. The use of canonical forms as a model exploratory tool is addressed. Examples are discussed in detail.

[1]  Leo A. Goodman,et al.  Criteria for Determining Whether Certain Categories in a Cross-Classification Table Should Be Combined, with Special Reference to Occupational Categories in an Occupational Mobility Table , 1981, American Journal of Sociology.

[2]  Gilbert Saporta,et al.  L'analyse des données , 1981 .

[3]  J. Aitchison,et al.  Maximum-Likelihood Estimation of Parameters Subject to Restraints , 1958 .

[4]  Z. Gilula On the Analysis of Heterogeneity Among Populations , 1985 .

[5]  Leo A. Goodman,et al.  Association Models and Canonical Correlation in the Analysis of Cross-Classifications Having Ordered Categories , 1981 .

[6]  Shelby J. Haberman,et al.  Tests for Independence in Two-Way Contingency Tables Based on Canonical Correlation and on Linear-By-Linear Interaction , 1981 .

[7]  L. A. Goodman The Analysis of Cross-Classified Data Having Ordered and/or Unordered Categories: Association Models, Correlation Models, and Asymmetry Models for Contingency Tables With or Without Missing Entries , 1985 .

[8]  K. Gabriel,et al.  Simultaneous Test Procedures for Multiple Comparisons on Categorical Data , 1966 .

[9]  L. A. Goodman The Analysis of Cross-Classified Data: Independence, Quasi-Independence, and Interactions in Contingency Tables with or without Missing Entries , 1968 .

[10]  Irving John Good,et al.  The Estimation of Probabilities: An Essay on Modern Bayesian Methods , 1965 .

[11]  M. O'Neill,et al.  ASYMPTOTIC DISTRIBUTIONS OF THE CANONICAL CORRELATIONS FROM CONTINGENCY TABLES , 1978 .

[12]  Leo A. Goodman,et al.  A Simple Simultaneous Test Procedure for Quasi‐Independence in Contingency Tables , 1971 .

[13]  Clifford C. Clogg,et al.  Some Models for the Analysis of Association in Multiway Cross-Classifications Having Ordered Categories , 1982 .

[14]  F. Yates The analysis of contingency tables with groupings based on quantitative characters. , 1948, Biometrika.

[15]  Zvi Gilula,et al.  On some similarities between canonical correlation models and latent class models for two-way contingency tables , 1984 .

[16]  Stephen E. Fienberg,et al.  The analysis of cross-classified categorical data , 1980 .

[17]  E. J. Williams USE OF SCORES FOR THE ANALYSIS OF ASSOCIATION IN CONTINGENCY TABLES , 1952 .

[18]  S. Haberman,et al.  Canonical Analysis of Contingency Tables by Maximum Likelihood , 1986 .

[19]  Albert E. Beaton The Influence of Education and Ability on Salary and Attitudes , 1975 .

[20]  R. Fisher THE PRECISION OF DISCRIMINANT FUNCTIONS , 1940 .

[21]  Otis Dudley Duncan,et al.  Partitioning polytomous variables in multiway contigency analysis , 1975 .

[22]  J. Kruskal Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics , 1977 .

[23]  Z. Gilula A note on the analysis of association in cross-classifications having ordered categories , 1982 .

[24]  Shelby J. Haberman,et al.  Log-Linear Models for Frequency Tables with Ordered Classifications , 1974 .

[25]  L. A. Goodman Simple Models for the Analysis of Association in Cross-Classifications Having Ordered Categories , 1979 .

[26]  G. Dallal A Simple Interaction Model for Two-Dimensional Contingency Tables , 1982 .

[27]  M. Hill Correspondence Analysis: A Neglected Multivariate Method , 1974 .

[28]  A. Krieger,et al.  The Decomposability and Monotonicity of Pearson's Chi-Square for Collapsed Contingency Tables with Applications , 1983 .

[29]  H. O. Lancaster The chi-squared distribution , 1971 .