Counting Tables Using the Double-Saddlepoint Approximation

A double-saddlepoint approximation is proposed for the number of contingency tables with counts satisfying certain linear constraints. Computation of the approximation involves fitting a generalized linear model for geometric responses which can be accomplished almost instantaneously using the iterated weighted least squares algorithm. The approximation is far superior to other analytical approximations that have been proposed, and is shown to be highly accurate in a range of examples, including some for which analytical approximations were previously unavailable. A similar approximation is proposed for tables consisting of only zeros and ones based on a logistic regression model. A higher order adjustment to the basic double saddlepoint further improves the accuracy of the approximation in almost all cases. Computer code for implementing methods described in the article is provided as supplemental material.

[1]  Mitchell H. Gail,et al.  Counting the Number of r×c Contingency Tables with Fixed Margins , 1977 .

[2]  Ruriko Yoshida,et al.  Barvinok's rational functions: algorithms and applications to optimization, statistics, and algebra , 2004 .

[3]  James M. Dickey,et al.  Discussion: Testing for Independence in a Two-Way Table: New Interpretations of the Chi-Square Statistic , 1985 .

[4]  S. Sullivant,et al.  Sequential importance sampling for multiway tables , 2006, math/0605615.

[5]  H. Daniels Saddlepoint Approximations in Statistics , 1954 .

[6]  I. J. Good,et al.  The enumeration of arrays and a generalization related to contingency tables , 1977, Discret. Math..

[7]  N D Holmquist,et al.  Variability in classification of carcinoma in situ of the uterine cervix. , 1967, Archives of pathology.

[8]  J. Stoyanov Saddlepoint Approximations with Applications , 2008 .

[9]  Alexander I. Barvinok,et al.  A Polynomial Time Algorithm for Counting Integral Points in Polyhedra when the Dimension Is Fixed , 1993, FOCS.

[10]  D. Edwards,et al.  A fast procedure for model search in multidimensional contingency tables , 1985 .

[11]  Michael Hout,et al.  Association and Heterogeneity: Structural Models of Similarities and Differences , 1987 .

[12]  Lee K. Jones,et al.  On uniform generation of two-way tables with fixed margins and the conditional volume test of Diaconis and Efron , 1996 .

[13]  A. Agresti An introduction to categorical data analysis , 1997 .

[14]  Yuguo Chen,et al.  Sequential Monte Carlo Methods for Statistical Analysis of Tables , 2005 .

[15]  Edward A. Bender,et al.  The asymptotic number of non-negative integer matrices with given row and column sums , 1974, Discret. Math..

[16]  P. McCullagh Tensor Methods in Statistics , 1987 .

[17]  Matthias Beck,et al.  Counting Lattice Points by Means of the Residue Theorem , 2000 .

[18]  A. Barvinok A polynomial time algorithm for counting integral points in polyhedra when the dimension is fixed , 1994 .

[19]  Jesús A. De Loera,et al.  Effective lattice point counting in rational convex polytopes , 2004, J. Symb. Comput..

[20]  A. Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[21]  Peter McCullagh,et al.  Laplace Approximation of High Dimensional Integrals , 1995 .