It is straightforward to analyze data from a single multinomial table. Specifically, for the analysis of a two-way categorical table, the common chi-squared test of independence between the two variables and maximum likelihood estimators of the cell probabilities are readily available. When the counts in the two-way categorical table are formed from familial data (clusters of correlated data), the common chi-squared test no longer applies. We note that there are several approximate adjustments to the common chi-squared test. For example, Choi and McHugh (Choi, J.W. and McHugh, R.B., 1989, A reduction factor in goodness-of-fit and independence tests for clustered and weighted observations. Biometrics, 45, 979–996.) showed how to adjust the chi-squared statistic for clustered and weighted data. However, our main contribution is the construction and analysis of a Bayesian model which removes all analytical approximations. This is an extension of a standard multinomial-Dirichlet model to include the intraclass correlation associated with the individuals within a cluster. We have used a key formula described by Altham (Altham, P.M., 1976, Discrete variable analysis for individuals grouped into families. Biometrika, 63, 263–269.) to incorporate the intraclass correlation. This intraclass correlation varies with the size of the cluster, but we assume that it is the same for all clusters of the same size for the same variable. We use Markov chain Monte Carlo methods to fit our model, and to make posterior inference about the intraclass correlations and the cell probabilities. Also, using Monte Carlo integration with a binomial importance function, we obtain the Bayes factor for a test of no association. To demonstrate the performance of the alternative test and estimation procedure, we have used data on activity limitation status and age from the National Health Interview Survey and a simulation study.
[1]
A. Scott,et al.
The Analysis of Categorical Data from Complex Sample Surveys: Chi-Squared Tests for Goodness of Fit and Independence in Two-Way Tables
,
1981
.
[2]
S. Chib,et al.
Marginal Likelihood From the Metropolis–Hastings Output
,
2001
.
[3]
I. Fellegi.
Approximate Tests of Independence and Goodness of Fit Based on Stratified Multistage Samples
,
1980
.
[4]
Joel E. Cohen,et al.
The Distribution of the Chi-Squared Statistic under Clustered Sampling from Contingency Tables
,
1976
.
[5]
E. Bedrick.
Adjusted chi-squared tests for cross-classified tables of survey data
,
1983
.
[6]
Stephen E. Fienberg,et al.
The Use of Chi‐Squared Statistics for Categorical Data Problems
,
1979
.
[7]
Calyampudi Radhakrishna Rao,et al.
Linear Statistical Inference and its Applications
,
1967
.
[8]
P. Altham.
Discrete variable analysis for individuals grouped into families
,
1976
.
[9]
A. Scott,et al.
Chi-squared Tests with Survey Data
,
1980
.
[10]
Calyampudi R. Rao,et al.
Linear Statistical Inference and Its Applications.
,
1975
.
[11]
R. McHugh,et al.
A reduction factor in goodness-of-fit and independence tests for clustered and weighted observations.
,
1989,
Biometrics.
[12]
S. Brier.
Analysis of contingency tables under cluster sampling
,
1980
.
[13]
A. Scott,et al.
On Chi-Squared Tests for Multiway Contingency Tables with Cell Proportions Estimated from Survey Data
,
1984
.