论文信息 - Self-Discrepancy Conditional Independence Test

Self-Discrepancy Conditional Independence Test

Tests of conditional independence (CI) of random variables play an important role in machine learning and causal inference. Of particular interest are kernel-based CI tests which allow us to test for independence among random variables with complex distribution functions. The efficacy of a CI test is measured in terms of its power and its calibratedness. We show that the Kernel CI Permutation Test (KCIPT) suffers from a loss of calibratedness as its power is increased by increasing the number of bootstraps. To address this limitation, we propose a novel CI test, called SelfDiscrepancy Conditional Independence Test (SDCIT). SDCIT uses a test statistic that is a modified unbiased estimate of maximum mean discrepancy (MMD), the largest difference in the means of features of the given sample and its permuted counterpart in the kernel-induced Hilbert space. We present results of experiments that demonstrate SDCIT is, relative to the other methods: (i) competitive in terms of its power and calibratedness, outperforming other methods when the number of conditioning variables is large; (ii) more robust with respect to the choice of the kernel function; and (iii) competitive in run time.

Vasant Honavar | Sanghack Lee | Sanghack Lee | Vasant G Honavar

[1] Bernhard Schölkopf,et al. A Kernel Two-Sample Test , 2012, J. Mach. Learn. Res..

[2] Vladimir Kolmogorov,et al. Blossom V: a new implementation of a minimum cost perfect matching algorithm , 2009, Math. Program. Comput..

[3] J. Friedman,et al. On bagging and nonlinear estimation , 2007 .

[4] Michael I. Jordan,et al. Kernel dimension reduction in regression , 2009, 0908.1854.

[5] Michael I. Jordan,et al. Kernel independent component analysis , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[6] Bernhard Schölkopf,et al. A Kernel Method for the Two-Sample-Problem , 2006, NIPS.

[7] Bernhard Schölkopf,et al. Kernel Measures of Conditional Dependence , 2007, NIPS.

[8] Michael I. Jordan,et al. Dimensionality Reduction for Supervised Learning with Reproducing Kernel Hilbert Spaces , 2004, J. Mach. Learn. Res..

[9] A. Berlinet,et al. Reproducing kernel Hilbert spaces in probability and statistics , 2004 .

[10] A. Buja,et al. OBSERVATIONS ON BAGGING , 2006 .

[11] Zaïd Harchaoui,et al. A Fast, Consistent Kernel Two-Sample Test , 2009, NIPS.

[12] Aapo Hyvärinen,et al. On the Identifiability of the Post-Nonlinear Causal Model , 2009, UAI.

[13] David Hinkley,et al. Bootstrap Methods: Another Look at the Jackknife , 2008 .

[14] A. Dawid. Conditional Independence in Statistical Theory , 1979 .

[15] Bernhard Schölkopf,et al. Kernel-based Conditional Independence Test and Application in Causal Discovery , 2011, UAI.

[16] Bernhard Schölkopf,et al. Measuring Statistical Dependence with Hilbert-Schmidt Norms , 2005, ALT.

[17] Bernhard Schölkopf,et al. A Permutation-Based Kernel Conditional Independence Test , 2014, UAI.

[18] Le Song,et al. A Hilbert Space Embedding for Distributions , 2007, Discovery Science.

[19] Bernhard Schölkopf,et al. Hilbert Space Embeddings and Metrics on Probability Measures , 2009, J. Mach. Learn. Res..

[20] J. Pearl. Causality: Models, Reasoning and Inference , 2000 .

[21] Tom Burr,et al. Causation, Prediction, and Search , 2003, Technometrics.