On the empirical multilinear copula process for count data

Continuation refers to the operation by which the cumulative distribution function of a discontinuous random vector is made continuous through multilinear interpolation. The copula that results from the application of this technique to the classical empirical copula is either called the multilinear or the checkerboard copula. As shown by Genest and Ne\v{s}lehov\'{a} (Astin Bull. 37 (2007) 475-515) and Ne\v{s}lehov\'{a} (J. Multivariate Anal. 98 (2007) 544-567), this copula plays a central role in characterizing dependence concepts in discrete random vectors. In this paper, the authors establish the asymptotic behavior of the empirical process associated with the multilinear copula based on $d$-variate count data. This empirical process does not generally converge in law on the space $\mathcal {C}([0,1]^d)$ of continuous functions on $[0,1]^d$, equipped with the uniform norm. However, the authors show that the process converges in $\mathcal{C}(K)$ for any compact $K\subset\mathcal{O}$, where $\mathcal{O}$ is a dense open subset of $[0,1]^d$, whose complement is the Cartesian product of the ranges of the marginal distribution functions. This result is sufficient to deduce the weak limit of many functionals of the process, including classical statistics for monotone trend. It also leads to a powerful and consistent test of independence which is applicable even to sparse contingency tables whose dimension is sample size dependent.

[1]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[2]  M. Wegkamp,et al.  Weak Convergence of Empirical Copula Processes , 2004 .

[3]  C. Genest,et al.  A Primer on Copulas for Count Data , 2007, ASTIN Bulletin.

[4]  Paul Deheuvels,et al.  Non parametric tests of independence , 1980 .

[5]  Johanna Nešlehová,et al.  On rank correlation measures for non-continuous random variables , 2007 .

[6]  Richard A. Davis,et al.  Time Series: Theory and Methods (2nd ed.). , 1992 .

[7]  B. Rémillard,et al.  Test of independence and randomness based on the empirical copula process , 2004 .

[8]  Jean-François Quessy,et al.  Tests of Multivariate Independence for Ordinal Data , 2009 .

[9]  J. Tebbs,et al.  An Introduction to Categorical Data Analysis , 2008 .

[10]  Friedrich Schmid,et al.  Multivariate Extensions of Spearman's Rho and Related Statistics , 2007 .

[11]  Bruno Rémillard,et al.  On the estimation of Spearman's rho and related tests of independence for possibly discontinuous multivariate data , 2013, J. Multivar. Anal..

[12]  Jean-Paul Chilès,et al.  Wiley Series in Probability and Statistics , 2012 .

[13]  J. Segers Asymptotics of empirical copula processes under non-restrictive smoothness assumptions , 2010, 1012.2133.

[14]  Dudley,et al.  Real Analysis and Probability: Measurability: Borel Isomorphism and Analytic Sets , 2002 .

[15]  Christian Genest,et al.  Discussion: Statistical models and methods for dependence in insurance data , 2011 .

[16]  Ludger Rüschendorf,et al.  Asymptotic Distributions of Multivariate Rank Order Statistics , 1976 .

[17]  A. Agresti An introduction to categorical data analysis , 1997 .

[18]  R. Nelsen An Introduction to Copulas , 1998 .

[19]  Alan J. Lee,et al.  U-Statistics: Theory and Practice , 1990 .