An 𝓁p-based Kernel Conditional Independence Test

We propose a new computationally efficient test for conditional independence based on the L distance between two kernel-based representatives of well suited distributions. By evaluating the difference of these two representatives at a finite set of locations, we derive a finite dimensional approximation of the L metric, obtain its asymptotic distribution under the null hypothesis of conditional independence and design a simple statistical test from it. The test obtained is consistent and computationally efficient. We conduct a series of experiments showing that the performance of our new tests outperforms state-of-the-art methods both in term of statistical power and type-I error even in the high dimensional setting.

[1]  Zoltán Szabó,et al.  Characteristic and Universal Tensor Product Kernels , 2017, J. Mach. Learn. Res..

[2]  Bing Li,et al.  Sufficient Dimension Reduction: Methods and Applications with R , 2018 .

[3]  Don Coppersmith,et al.  Matrix multiplication via arithmetic progressions , 1987, STOC.

[4]  C. Li,et al.  On nonparametric conditional independence tests for continuous variables , 2019, WIREs Computational Statistics.

[5]  Bernhard Schölkopf,et al.  Kernel Measures of Conditional Dependence , 2007, NIPS.

[6]  Eric V. Strobl,et al.  Approximate Kernel-Based Conditional Independence Tests for Fast Non-Parametric Causal Discovery , 2017, Journal of Causal Inference.

[7]  M. West,et al.  Sparse graphical models for exploring gene expression data , 2004 .

[8]  Michael I. Jordan Graphical Models , 2003 .

[9]  S Richardson,et al.  A Bayesian approach to measurement error problems in epidemiology using conditional independence models. , 1993, American journal of epidemiology.

[10]  Arthur Gretton,et al.  An Adaptive Test of Independence with Analytic Kernel Embeddings , 2016, ICML.

[11]  Arthur Gretton,et al.  Fast Two-Sample Testing with Analytic Representations of Probability Measures , 2015, NIPS.

[12]  Mihaela van der Schaar,et al.  Conditional Independence Testing using Generative Adversarial Networks , 2019, NeurIPS.

[13]  Pietro Perona,et al.  Fast Conditional Independence Test for Vector Variables with Large Sample Sizes , 2018, ArXiv.

[14]  Rainer Spang,et al.  Inferring cellular networks – a review , 2007, BMC Bioinformatics.

[15]  Bernhard Schölkopf,et al.  A Permutation-Based Kernel Conditional Independence Test , 2014, UAI.

[16]  A. Caponnetto,et al.  Optimal Rates for the Regularized Least-Squares Algorithm , 2007, Found. Comput. Math..

[17]  Wicher P. Bergsma,et al.  Double Generative Adversarial Networks for Conditional Independence Testing , 2020, J. Mach. Learn. Res..

[18]  Ingo Steinwart,et al.  Sobolev Norm Learning Rates for Regularized Least-Squares Algorithms , 2017, J. Mach. Learn. Res..

[19]  J. Daudin Partial association measures and an application to qualitative regression , 1980 .

[20]  Bernhard Schölkopf,et al.  Kernel-based Conditional Independence Test and Application in Causal Discovery , 2011, UAI.

[21]  Bernhard Schölkopf,et al.  A Kernel Two-Sample Test , 2012, J. Mach. Learn. Res..

[22]  Lucas Janson,et al.  Panning for gold: ‘model‐X’ knockoffs for high dimensional controlled variable selection , 2016, 1610.02351.

[23]  P. Spirtes,et al.  Review of Causal Discovery Methods Based on Graphical Models , 2019, Front. Genet..

[24]  Rajen Dinesh Shah,et al.  The hardness of conditional independence testing and the generalised covariance measure , 2018, The Annals of Statistics.

[25]  Wicher P. Bergsma,et al.  Testing conditional independence for continuous random variables , 2004 .

[26]  Lorenzo Rosasco,et al.  Generalization Properties of Learning with Random Features , 2016, NIPS.

[27]  G. Varoquaux,et al.  Comparing distributions: 𝓁1 geometry improves kernel two-sample testing , 2019, NeurIPS 2019.

[28]  J. Pearl Causal inference in statistics: An overview , 2009 .

[29]  Sreeram Kannan,et al.  Mimic and Classify : A meta-algorithm for Conditional Independence Testing , 2018, ArXiv.

[30]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .