Markov chain Monte Carlo exact tests for incomplete two-way contingency tables

We consider testing the quasi-independence hypothesis for two-way contingency tables which contain some structural zero cells. For sparse contingency tables where the large sample approximation is not adequate, the Markov chain Monte Carlo exact tests are powerful tools. To construct a connected chain over the two-way contingency tables with fixed sufficient statistics and an arbitrary configuration of structural zero cells, an algebraic algorithm proposed by Diaconis and Sturmfels [Diaconis, P. and Sturmfels, B. (1998). The Annals of statistics, 26, pp. 363–397.] can be used. However, their algorithm does not seem to be a satisfactory answer, because the Markov basis produced by the algorithm often contains many redundant elements and is hard to interpret. We derive an explicit characterization of a minimal Markov basis, prove its uniqueness, and present an algorithm for obtaining the unique minimal basis. A computational example and the discussion on further basis reduction for the case of positive sufficient statistics are also given.

[1]  N. Vidmar Effects of decision alternatives on the verdicts and social perceptions of simulated jurors. , 1972, Journal of personality and social psychology.

[2]  Jonathan J. Forster,et al.  Monte Carlo exact conditional tests for log-linear and logistic models , 1996 .

[3]  L. A. Goodman The Analysis of Cross-Classified Data: Independence, Quasi-Independence, and Interactions in Contingency Tables with or without Missing Entries , 1968 .

[4]  Brian D. Ripley,et al.  Stochastic Simulation , 2005 .

[5]  E. Thompson,et al.  Performing the exact test of Hardy-Weinberg proportion for multiple alleles. , 1992, Biometrics.

[6]  A. Takemura,et al.  Minimal Basis for a Connected Markov Chain over 3 × 3 ×K Contingency Tables with Fixed Two‐Dimensional Marginals , 2003 .

[7]  S. Fienberg,et al.  Incomplete two-dimensional contingency tables. , 1969, Biometrics.

[8]  J. A. Harris ON THE SELECTIVE ELIMINATION OCCURRING DURING THE DEVELOPMENT OF THE FRUITS OF STAPHYLEA , 1910 .

[9]  Hidefumi Ohsugi,et al.  Koszul Bipartite Graphs , 1999 .

[10]  Stephen E. Fienberg,et al.  Discrete Multivariate Analysis: Theory and Practice , 1976 .

[11]  A. Agresti [A Survey of Exact Inference for Contingency Tables]: Rejoinder , 1992 .

[12]  Diane Hernek Random generation of 2×n contingency tables , 1998, Random Struct. Algorithms.

[13]  Scott L. Hershberger,et al.  Incomplete Contingency Tables , 2005 .

[14]  Fabio Rapallo Algebraic Markov Bases and MCMC for Two‐Way Contingency Tables , 2003 .

[15]  J. Besag,et al.  Generalized Monte Carlo significance tests , 1989 .

[16]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[17]  Diane Hernek Random generation of 2 × n contingency tables , 1998 .

[18]  John W. McDonald,et al.  Exact conditional tests of quasi-independence for triangular contingency tables: estimating attained significance levels , 1995 .

[19]  J. Forster,et al.  Monte Carlo exact tests for square contingency tables , 1996 .

[20]  Martin E. Dyer,et al.  Polynomial-time counting and sampling of two-rowed contingency tables , 2000, Theor. Comput. Sci..

[21]  A. Takemura,et al.  Some characterizations of minimal Markov basis for sampling from discrete conditional distributions , 2004 .

[22]  P. Diaconis,et al.  Algebraic algorithms for sampling from conditional distributions , 1998 .

[23]  Harrison C. White,et al.  An Anatomy Of Kinship , 1963 .

[24]  Tarakchandra Das The purums : an old Kuki tribe of Manipur , 1945 .