Testing the significance of the RV coefficient

The relationship between two sets of variables defined for the same individuals can be evaluated by the RV coefficient. However, it is impossible to assess by the RV value alone whether or not the two sets of variables are significantly correlated, which is why a test is required. Asymptotic tests do exist but fail in many situations, hence the interest in permutation tests. However, the main drawbacks of the permutation tests are that they are time consuming. It is therefore interesting to approximate the permutation distribution with continuous distributions (without doing any permutation). The current approximations (normal approximation, a log-transformation and Pearson type III approximation) are discussed and a new one is described: an Edgeworth expansion. Finally, these different approximations are compared for both simulations and for a sensory example.

[1]  Peter Hall,et al.  On the Removal of Skewness by Transformation , 1992 .

[2]  G. Ducharme,et al.  Vector correlation for elliptical distributions , 1989 .

[3]  Robert Cléroux,et al.  Vector correlation based on ranks and a nonparametric test of no association between vectors 1 , 1995 .

[4]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[5]  A. Antoni,et al.  Information des tableaux individus x variables , 1995 .

[6]  K. Ruben Gabriel,et al.  A permutation test of association between configurations by means of the rv coefficient , 1998 .

[7]  Naguib Lallmahomed,et al.  Statistique Théorique et Appliquée , 2007 .

[8]  D. E. Barton,et al.  The conditions under which Gram-Charlier and Edgeworth curves are positive definite and unimodal , 1952 .

[9]  Kiyoshi Asai,et al.  The em Algorithm for Kernel Matrix Completion with Auxiliary Data , 2003, J. Mach. Learn. Res..

[10]  R. Sabatier,et al.  Refined approximations to permutation tests for multivariate inference , 1995 .

[11]  Y. Escoufier LE TRAITEMENT DES VARIABLES VECTORIELLES , 1973 .

[12]  James G. Booth,et al.  On the Validity of Edgeworth and Saddlepoint Approximations , 1994 .

[13]  Pierre-André Cornillon Prise en compte de proximites en analyse factorielle et comparative , 1998 .

[14]  P. Good,et al.  Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses , 1995 .

[15]  Alfredo Rizzi,et al.  COMPSTAT : proceedings in computational statistics, 17th symposium held in Rome, Italy, 2006 , 2006 .

[16]  G. der Megreditchian Meteorological networks optimization from a statistical point of view , 1989 .

[17]  Jérôme Pagès,et al.  Collection and analysis of perceived product inter-distances using multiple factor analysis: Application to the study of 10 white wines from the Loire Valley , 2005 .

[18]  P. Hall The Bootstrap and Edgeworth Expansion , 1992 .

[19]  C. Chatfield Continuous Univariate Distributions, Vol. 1 , 1995 .

[20]  Yoshihiro Yamanishi,et al.  Protein network inference from multiple genomic data: a supervised approach , 2004, ISMB/ECCB.

[21]  N. L. Johnson,et al.  Continuous Univariate Distributions. , 1995 .

[22]  Robert Cléroux,et al.  Some results on vector correlation , 1985 .