A Universal and Efficient Method to Compute Maps from Image-Based Prediction Models

Discriminative supervised learning algorithms, such as Support Vector Machines, are becoming increasingly popular in biomedical image computing. One of their main uses is to construct image-based prediction models, e.g., for computer aided diagnosis or "mind reading." A major challenge in these applications is the biological interpretation of the machine learning models, which can be arbitrarily complex functions of the input features (e.g., as induced by kernel-based methods). Recent work has proposed several strategies for deriving maps that highlight regions relevant for accurate prediction. Yet most of these methods o n strong assumptions about t he prediction model (e.g., linearity, sparsity) and/or data (e.g., Gaussianity), or fail to exploit the covariance structure in the data. In this work, we propose a computationally efficient and universal framework for quantifying associations captured by black box machine learning models. Furthermore, our theoretical perspective reveals that examining associations with predictions, in the absence of ground truth labels, can be very informative. We apply the proposed method to machine learning models trained to predict cognitive impairment from structural neuroimaging data. We demonstrate that our approach yields biologically meaningful maps of association.

[1]  Polina Golland,et al.  Discriminative Direction for Kernel Classifiers , 2001, NIPS.

[2]  Mert R. Sabuncu,et al.  On Feature Relevance in Image-Based Prediction Models: An Empirical Study , 2013, MLMI.

[3]  Shu-rong Zheng,et al.  Generalized Measures of Correlation for Asymmetry, Nonlinearity, and Beyond , 2012 .

[4]  Achim Zeileis,et al.  Conditional variable importance for random forests , 2008, BMC Bioinformatics.

[5]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001 .

[6]  Christian Böhm,et al.  Automated detection of brain atrophy patterns based on MRI for the prediction of Alzheimer's disease , 2010, NeuroImage.

[7]  Ferath Kherif,et al.  Multivariate voxel-based morphometry successfully differentiates schizophrenia patients from healthy controls , 2007, NeuroImage.

[8]  Gunnar Rätsch,et al.  The Feature Importance Ranking Measure , 2009, ECML/PKDD.

[9]  Karl J. Friston,et al.  Voxel-Based Morphometry—The Methods , 2000, NeuroImage.

[10]  X. Wu,et al.  Individual patient diagnosis of AD and FTD via high-dimensional pattern classification of MRI , 2008, NeuroImage.

[11]  N. Meinshausen,et al.  Stability selection , 2008, 0809.2932.

[12]  J. Morris,et al.  The Cortical Signature of Alzheimer's Disease: Regionally Specific Cortical Thinning Relates to Symptom Severity in Very Mild to Mild AD Dementia and is Detectable in Asymptomatic Amyloid-Positive Individuals , 2008, Cerebral cortex.

[13]  Jane S. Paulsen,et al.  Automatic detection of preclinical neurodegeneration , 2009, Neurology.

[14]  Gunnar Rätsch,et al.  POIMs: positional oligomer importance matrices—understanding support vector machine-based signal detectors , 2008, ISMB.

[15]  Mert R. Sabuncu,et al.  The Relevance Voxel Machine (RVoxM): A Self-Tuning Bayesian Model for Informative Image-Based Prediction , 2012, IEEE Transactions on Medical Imaging.

[16]  Daoqiang Zhang,et al.  Multimodal classification of Alzheimer's disease and mild cognitive impairment , 2011, NeuroImage.

[17]  Tso-Jung Yen,et al.  Discussion on "Stability Selection" by Meinshausen and Buhlmann , 2010 .

[18]  Jonathan D. Power,et al.  Prediction of Individual Brain Maturity Using fMRI , 2010, Science.

[19]  Emil Pitkin,et al.  Peeking Inside the Black Box: Visualizing Statistical Learning With Plots of Individual Conditional Expectation , 2013, 1309.6392.

[20]  Bilwaj Gaonkar,et al.  Analytic estimation of statistical significance maps for support vector machine based multi-variate image analysis and classification , 2013, NeuroImage.

[21]  Stefan Haufe,et al.  On the interpretation of weight vectors of linear models in multivariate neuroimaging , 2014, NeuroImage.