Implementing a Class of Permutation Tests: The coin Package

The R package coin implements a unified approach to permutation tests providing a huge class of independence tests for nominal, ordered, numeric, and censored data as well as multivariate data at mixed scales. Based on a rich and flexible conceptual framework that embeds different permutation test procedures into a common theory, a computational framework is established in coin that likewise embeds the corresponding R functionality in a common S4 class structure with associated generic functions. As a consequence, the computational tools in coin inherit the flexibility of the underlying theory and conditional inference functions for important special cases can be set up easily. Conditional versions of classical tests---such as tests for location and scale problems in two or more samples, independence in two- or three-way contingency tables, or association problems for censored, ordered categorical or multivariate data---can easily be implemented as special cases using this computational toolbox by choosing appropriate transformations of the observations. The paper gives a detailed exposition of both the internal structure of the package and the provided user interfaces along with examples on how to extend the implemented functionality.

[1]  D. Bauer Constructing Confidence Sets Using Rank Statistics , 1972 .

[2]  Kurt Hornik,et al.  The Strucplot Framework: Visualizing Multi-way Contingency Tables with vcd , 2006 .

[3]  J. I The Design of Experiments , 1936, Nature.

[4]  R. Bergmann,et al.  Different Outcomes of the Wilcoxon—Mann—Whitney Test from Different Statistics Packages , 2000 .

[5]  W. R. Buckland,et al.  Distributions in Statistics: Continuous Multivariate Distributions , 1973 .

[6]  K. Hornik,et al.  A Lego System for Conditional Inference , 2006 .

[7]  Helmut Strasser,et al.  On the Asymptotic Theory of Permutation Statistics , 1999 .

[8]  P. Sen,et al.  Theory of rank tests , 1969 .

[9]  Tx Station Stata Statistical Software: Release 7. , 2001 .

[10]  Mark A. van de Wiel,et al.  The split-up algorithm: a fast symbolic method for computing p-values of distribution-free statistics , 2001, Comput. Stat..

[11]  Robert A. Oster,et al.  An Examination of Statistical Software Packages for Categorical Data Analysis Using Exact Methods , 2002 .

[12]  W. R. Buckland,et al.  Distributions in Statistics: Continuous Multivariate Distributions , 1974 .

[13]  Kurt Hornik,et al.  Exact Nonparametric Inference in R , 2002, COMPSTAT.

[14]  Robert A Oster,et al.  An Examination of Statistical Software Packages for Categorical Data Analysis Using Exact Methods—Part II , 2003 .

[15]  A. Genz Numerical Computation of Multivariate Normal Probabilities , 1992 .

[16]  Jaroslav Hájek,et al.  Theory of rank tests , 1969 .

[17]  M. D. Ernst Permutation Methods: A Basis for Exact Inference , 2004 .

[18]  J. Booth,et al.  Resampling-Based Multiple Testing. , 1994 .