Exact Hypothesis Tests for Log-linear Models with exactLoglinTest

This manuscript overviews exact testing of goodness of fit for log-linear models using the R package exactLoglinTest. This package evaluates model fit for Poisson log-linear models by conditioning on minimal sufficient statistics to remove nuisance parameters. A Monte Carlo algorithm is proposed to estimate P values from the resulting conditional distribution. In particular, this package implements a sequentially rounded normal approximation and importance sampling to approximate probabilities from the conditional distribution. Usually, this results in a high percentage of valid samples. However, in instances where this is not the case, a Metropolis Hastings algorithm can be implemented that makes more localized jumps within the reference set. The manuscript details how some conditional tests for binomial logit models can also be viewed as conditional Poisson log-linear models and hence can be performed via exactLoglinTest. A diverse battery of examples is considered to highlight use, features and extensions of the software. Notably, potential extensions to evaluating disclosure risk are also considered.

[1]  Cyrus R. Mehta,et al.  A network algorithm for the exact treatment of the 2×k contingency table , 1980 .

[2]  Nitin R. Patel,et al.  A Network Algorithm for Performing Fisher's Exact Test in r × c Contingency Tables , 1983 .

[3]  D. Edwards,et al.  A fast procedure for model search in multidimensional contingency tables , 1985 .

[4]  Nitin R. Patel,et al.  Computing Distributions for Exact Logistic Regression , 1987 .

[5]  A. Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[6]  Cyrus R. Mehta,et al.  StatXact: A Statistical Package for Exact Nonparametric Inference , 1991 .

[7]  Alan Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[8]  Robert J. MacG. Dawson The “Unusual Episode” Data Revisited , 1995 .

[9]  Jonathan J. Forster,et al.  Monte Carlo exact conditional tests for log-linear and logistic models , 1996 .

[10]  J. Forster,et al.  Monte Carlo exact tests for square contingency tables , 1996 .

[11]  P. Diaconis,et al.  Algebraic algorithms for sampling from conditional distributions , 1998 .

[12]  P W Smith,et al.  Exact tests of goodness of fit of log-linear models for rates. , 1999, Biometrics.

[13]  Brian S. Caffo,et al.  A Markov chain Monte Carlo Algorithm for Approximating Exact Conditional Probabilities , 2001 .

[14]  A. Dobra Markov bases for decomposable graphical models , 2003 .

[15]  Rob Harrop,et al.  Installation and Administration , 2004 .

[16]  J. Booth,et al.  Exact Conditional P Value Calculation for the Quasi-Symmetry Model , 2005 .

[17]  Yuguo Chen,et al.  Sequential Monte Carlo Methods for Statistical Analysis of Tables , 2005 .

[18]  S. Sullivant,et al.  Sequential importance sampling for multiway tables , 2006, math/0605615.