On the Application of Symmetric Dirichlet Distributions and their Mixtures to Contingency Tables

This paper is a continuation of a paper in the Annals of Statistics (1976), 4 1159-1189 where, among other things, a Bayesian approach to testing independence in contingency tables was developed. Our first purpose now, after allowing for an improvement in the previous theory (which also has repercussions on earlier work on the multinomial), is to give extensive numerical results for twodimensional tables, both sparse and nonsparse. We deal with the statistics X2, A (the likelihood-ratio statistic), a slight transformation G of the Type II likelihood ratio, and the number of repeats within cells. The latter has approximately a Poisson distribution for sparse tables. Some of the "asymptotic" distributions are surprisingly good down to exceedingly small tail-area probabilities, as in the previous "mixed Dirichlet" approach to multinomial distributions (J. Roy. Statist. Soc. B, 1967, 29 399-431; J. Amer. Statist. Assoc. 1974, 69 711-720). The approach leads to a quantitative measure of the amount of evidence concerning independence provided by the marginal totals, and this amount is found to be small when neither the row totals nor the column totals are very "rough" and the two sets of totals are not both very flat. For Model 3 (all margins fixed), the relationship is examined between the Bayes factor against independence and its tail-area probability.

[1]  Harold Jeffreys,et al.  Further significance tests , 1936, Mathematical Proceedings of the Cambridge Philosophical Society.

[2]  R Fisher,et al.  Design of Experiments , 1936 .

[3]  H. Jeffreys The tests for sampling differences and contingency , 1937 .

[4]  S. S. Wilks The Large-Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses , 1938 .

[5]  ON TESTS FOR HOMOGENEITY , 1938 .

[6]  H. Jeffreys,et al.  The Theory of Probability , 1896 .

[7]  J. B. S. Haldane,et al.  THE MEAN AND VARIANCE OF χ2, WHEN USED AS A TEST OF HOMOGENEITY, WHEN EXPECTATIONS ARE SMALL , 1940 .

[8]  Frank Sandon,et al.  The Advanced Theory of Statistics. II , 1947, The Mathematical Gazette.

[9]  G. Barnard Significance tests for 2 X 2 tables. , 1947, Biometrika.

[10]  Wilfred Perks,et al.  Some observations on inverse probability including a new indifference rule , 1947 .

[11]  E. S. Pearson,et al.  The choice of statistical tests illustrated on the interpretation of data classed in a 2 X 2 table. , 1947, Biometrika.

[12]  L. Stein,et al.  Probability and the Weighing of Evidence , 1950 .

[13]  A test for heterogeneity of proportions. , 1951, Annals of eugenics.

[14]  L. J. Savage,et al.  Probability and the weighing of evidence , 1951 .

[15]  H. Chernoff On the Distribution of the Likelihood Ratio , 1954 .

[16]  B. Woolf ON ESTIMATING THE RELATION BETWEEN BLOOD GROUP AND DISEASE , 1955, Annals of human genetics.

[17]  S. G. Soal,et al.  Modern Experiments in Telepathy , 1955 .

[18]  S. Mitra,et al.  AN INTRODUCTION TO SOME NON-PARAMETRIC GENERALIZATIONS OF ANALYSIS OF VARIANCE AND MULTIVARIATE ANALYSIS , 1956 .

[19]  I. Good On the Estimation of Small Frequencies in Contingency Tables , 1956 .

[20]  R. A. Fisher,et al.  Statistical methods and scientific inference. , 1957 .

[21]  I. Good Saddle-point Methods for the Multinomial Distribution , 1957 .

[22]  D. E. Barton The Matching Distributions: Poisson Limiting Forms and Derived Methods of Approximation , 1958 .

[23]  R. Eck Non-Randomness in Amino-Acid ‘Alleles’ , 1961, Nature.

[24]  Jack Levine,et al.  Minimum periods, modulo $p$, of first-order Bell exponential integers. , 1962 .

[25]  M. Kendall,et al.  The advanced theory of statistics , 1945 .

[26]  I. Good Maximum Entropy for Hypothesis Formulation, Especially for Multidimensional Contingency Tables , 1963 .

[27]  M. W. Birch Maximum Likelihood in Three-Way Contingency Tables , 1963 .

[28]  Hansraj Gupta,et al.  A combinatorial distribution problem , 1966 .

[29]  Irene A. Stegun,et al.  Handbook of Mathematical Functions. , 1966 .

[30]  G. A. Barnard,et al.  The Bayesian Controversy in Statistical Inference , 1967 .

[31]  I. Good A Bayesian Significance Test for Multinomial Distributions , 1967 .

[32]  F. James Rohlf,et al.  Biometry: The Principles and Practice of Statistics in Biological Research , 1969 .

[33]  P. R. Stein,et al.  ENUMERATION OF STOCHASTIC MATRICES WITH INTEGER ELEMENTS. , 1970 .

[34]  R. Fisher Statistical Methods for Research Workers , 1971 .

[35]  G. Baikunth Nath,et al.  Note on the combinatorial formula for nhr , 1972 .

[36]  William O. J. Moser,et al.  Arrays with fixed row and column sums , 1973, Discret. Math..

[37]  N. Sloane A Handbook Of Integer Sequences , 1973 .

[38]  J. F. Crook,et al.  The Bayes/Non-Bayes Compromise and the Multinomial Distribution , 1974 .

[39]  Murray Aitkin,et al.  Simultaneous Inference and the Choice of Variable Subsets in Multiple Regression , 1974 .

[40]  J. Dickey,et al.  Bayes factors for independence in contingency tables , 1974 .

[41]  Stephen E. Fienberg,et al.  Discrete Multivariate Analysis: Theory and Practice , 1976 .

[42]  I. Good The Bayes Factor Against Equiprobability of a Multinomial Population Assuming a Symmetric Dirichlet Prior , 1975 .

[43]  P. Holland,et al.  Discrete Multivariate Analysis. , 1976 .

[44]  Mitchell H. Gail,et al.  Counting the Number of r×c Contingency Tables with Fixed Margins , 1977 .

[45]  I. J. Good,et al.  The enumeration of arrays and a generalization related to contingency tables , 1977, Discret. Math..

[46]  George A. Milliken,et al.  A Nonrandomized Unconditional Test for Comparing Two Proportions in 2×2 Contingency Tables , 1977 .

[47]  R. L. Plackett,et al.  The marginal totals of a 2×2 table , 1977 .

[48]  J. Reeds,et al.  Compound Multinomial Likelihood Functions are Unimodal: Proof of a Conjecture of I. J. Good , 1977 .