Using permutations to quantify and correct for confounding in machine learning predictions

Clinical machine learning applications are often plagued with confounders that are clinically irrelevant, but can still artificially boost the predictive performance of the algorithms. Confounding is especially problematic in mobile health studies run "in the wild", where it is challenging to balance the demographic characteristics of participants that self select to enter the study. Here, we develop novel permutation approaches to quantify and adjust for the influence of observed confounders in machine learning predictions. Using restricted permutations we develop statistical tests to detect response learning in the presence of confounding, as well as, confounding learning per se. In particular, we prove that restricted permutations provide an alternative method to compute partial correlations. This result motivates a novel approach to adjust for confounders, where we are able to "subtract" the contribution of the confounders from the observed predictive performance of a machine learning algorithm using a mapping between restricted and standard permutation null distributions. We evaluate the statistical properties of our approach in simulation studies, and illustrate its application to synthetic data sets.

[1]  E. L. Lehmann,et al.  Consistency and Unbiasedness of Certain Nonparametric Tests , 1951 .

[2]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.

[3]  Maria L. Rizzo,et al.  Partial Distance Correlation with Methods for Dissimilarities , 2013, 1310.2926.

[4]  Carlos Rodarte,et al.  Pharmaceutical Perspective: How Digital Biomarkers and Contextual Data Will Enable Therapeutic Environments , 2017, Digital Biomarkers.

[5]  G. Wahba,et al.  Multivariate Bernoulli distribution , 2012, 1206.1874.

[6]  Karsten M. Borgwardt,et al.  ccSVM: correcting Support Vector Machines for confounding factors in biological data classification , 2011, Bioinform..

[7]  Larsson Omberg,et al.  Detecting confounding due to subject identification in clinical machine learning diagnostic applications: a permutation test approach , 2017 .

[8]  Janaina Mourão Miranda,et al.  Predictive modelling using neuroimaging data in the presence of confounds , 2017, NeuroImage.

[9]  Marti J. Anderson,et al.  Permutation tests for multi-factorial analysis of variance , 2003 .

[10]  C. Davatzikos,et al.  Addressing Confounding in Predictive Models with an Application to Neuroimaging , 2016, The international journal of biostatistics.

[11]  Thomas J. Santner,et al.  The Design and Analysis of Computer Experiments , 2003, Springer Series in Statistics.

[12]  P. Dagum Digital biomarkers of cognitive function , 2018, npj Digital Medicine.

[13]  P. Good,et al.  Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses , 1995 .

[14]  M. E. Johnson,et al.  Minimax and maximin distance designs , 1990 .

[15]  W. Hoeffding A Class of Statistics with Asymptotically Normal Distribution , 1948 .

[16]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[17]  D. Bamber The area above the ordinal dominance graph and the area below the receiver operating characteristic graph , 1975 .

[18]  N. Graham,et al.  Areas beneath the relative operating characteristics (ROC) and relative operating levels (ROL) curves: Statistical significance and interpretation , 2002 .

[19]  R. Serfling Approximation Theorems of Mathematical Statistics , 1980 .

[20]  Maria L. Rizzo,et al.  Measuring and testing dependence by correlation of distances , 2007, 0803.4101.