A Simple Bootstrap for Chatterjee's Rank Correlation

We prove that an $m$ out of $n$ bootstrap procedure for Chatterjee's rank correlation is consistent whenever asymptotic normality of Chatterjee's rank correlation can be established. In particular, we prove that $m$ out of $n$ bootstrap works for continuous as well as for discrete data with independent coordinates; furthermore, simulations indicate that it also performs well for discrete data with dependent coordinates, and that it outperforms alternative estimation methods. Consistency of the bootstrap is proved in the Kolmogorov as well as in the Wasserstein distance.

[1]  Zhexiao Lin,et al.  On the failure of the bootstrap for Chatterjee's rank correlation , 2023, Biometrika.

[2]  Zhexiao Lin,et al.  Limit theorems of Chatterjee's rank correlation , 2022, ArXiv.

[3]  M. Drton,et al.  On Azadkia–Chatterjee’s conditional dependence coefficient , 2021, Bernoulli.

[4]  Fang Han,et al.  On boosting the power of Chatterjee’s rank correlation , 2021, Biometrika.

[5]  Daniel Ting Simple, Optimal Algorithms for Random Sampling Without Replacement , 2021, arXiv.org.

[6]  Gershon Wolansky,et al.  Optimal Transport , 2021 .

[7]  B. Sen,et al.  Measuring Association on Topological Spaces Using Kernels and Geometric Graphs , 2020, 2010.01768.

[8]  M. Drton,et al.  On the power of Chatterjee’s rank correlation , 2020, Biometrika.

[9]  P. Bickel,et al.  Correlations with tailored extremal properties , 2020, 2008.10177.

[10]  T. Klein,et al.  Global sensitivity analysis: A novel generation of mighty estimators based on rank statistics , 2020, Bernoulli.

[11]  S. Chatterjee,et al.  A simple measure of conditional dependence , 2019, The Annals of Statistics.

[12]  S. Chatterjee A New Coefficient of Correlation , 2019, Journal of the American Statistical Association.

[13]  H. Dette,et al.  A Copula‐Based Non‐parametric Measure of Regression Dependence , 2013 .

[14]  Joseph T. Chang,et al.  Conditioning as disintegration , 1997 .

[15]  R. K. Shyamasundar,et al.  Introduction to algorithms , 1996 .

[16]  Joseph P. Romano,et al.  Large Sample Confidence Regions Based on Subsamples under Minimal Assumptions , 1994 .

[17]  F. Cole To the Best of Our Knowledge , 1979 .

[18]  W. Hoeffding A Class of Statistics with Asymptotically Normal Distribution , 1948 .

[19]  S. Holmes,et al.  Measuring multivariate association and beyond. , 2016, Statistics surveys.

[20]  F. Götze,et al.  RESAMPLING FEWER THAN n OBSERVATIONS: GAINS, LOSSES, AND REMEDIES FOR LOSSES , 2012 .

[21]  Robert W. Keener,et al.  Probability and Measure , 2009 .

[22]  P. Bickel,et al.  ON THE CHOICE OF m IN THE m OUT OF n BOOTSTRAP AND CONFIDENCE BOUNDS FOR EXTREMA , 2008 .