Leveraging both Lesion Features and Procedural Bias in Neuroimaging: An Dual-Task Split dynamics of inverse scale space

The prediction and selection of lesion features are two important tasks in voxel-based neuroimage analysis. Existing multivariate learning models take two tasks equivalently and optimize simultaneously. However, in addition to lesion features, we observe that there is another type of feature, which is commonly introduced during the procedure of preprocessing steps, which can improve the prediction result. We call such a type of feature as procedural bias. Therefore, in this paper, we propose that the features/voxels in neuroimage data are consist of three orthogonal parts: lesion features, procedural bias, and null features. To stably select lesion features and leverage procedural bias into prediction, we propose an iterative algorithm (termed GSplit LBI) as a discretization of differential inclusion of inverse scale space, which is the combination of Variable Splitting scheme and Linearized Bregman Iteration (LBI). Specifically, with a variable the splitting term, two estimators are introduced and split apart, i.e. one is for feature selection (the sparse estimator) and the other is for prediction (the dense estimator). Implemented with Linearized Bregman Iteration (LBI), the solution path of both estimators can be returned with different sparsity levels on the sparse estimator for the selection of lesion features. Besides, the dense the estimator can additionally leverage procedural bias to further improve prediction results. To test the efficacy of our method, we conduct experiments on the simulated study and Alzheimer's Disease Neuroimaging Initiative (ADNI) database. The validity and the benefit of our model can be shown by the improvement of prediction results and the interpretability of visualized procedural bias and lesion features.

[1]  Jonathan E. Taylor,et al.  Interpretable whole-brain prediction analysis with GraphNet , 2013, NeuroImage.

[2]  Y-K Tu,et al.  Problems of correlations between explanatory variables in multiple regression analyses in the dental literature , 2005, British Dental Journal.

[3]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[4]  Wotao Yin,et al.  An Iterative Regularization Method for Total Variation-Based Image Restoration , 2005, Multiscale Model. Simul..

[5]  Yuan Yao,et al.  A Unified Dynamic Approach to Sparse Model Selection , 2018, AISTATS.

[6]  John Ashburner,et al.  A fast diffeomorphic image registration algorithm , 2007, NeuroImage.

[7]  Gaël Varoquaux,et al.  Total Variation Regularization for fMRI-Based Prediction of Behavior , 2011, IEEE Transactions on Medical Imaging.

[8]  R. Tibshirani,et al.  The solution path of the generalized lasso , 2010, 1005.1971.

[9]  Yuan Yao,et al.  FDR-HS: An Empirical Bayesian Identification of Heterogenous Features in Neuroimage Analysis , 2018, MICCAI.

[10]  Gaël Varoquaux,et al.  Identifying Predictive Regions from fMRI with TV-L1 Prior , 2013, 2013 International Workshop on Pattern Recognition in Neuroimaging.

[11]  Mark E. Schmidt,et al.  The Alzheimer's Disease Neuroimaging Initiative: Progress report and future plans , 2010, Alzheimer's & Dementia.

[12]  Wen Gao,et al.  Stable Feature Selection from Brain sMRI , 2015, AAAI.

[13]  Bradley Efron,et al.  Local False Discovery Rates , 2005 .

[14]  A. Janke,et al.  4D deformation modeling of cortical disease progression in Alzheimer's dementia , 2001, Magnetic resonance in medicine.

[15]  J B Poline,et al.  Direct voxel-based comparison between grey matter hypometabolism and atrophy in Alzheimer's disease. , 2007, Brain : a journal of neurology.

[16]  Bertrand Thirion,et al.  Multiscale Mining of fMRI Data with Hierarchical Structured Sparsity , 2012, SIAM J. Imaging Sci..

[17]  Maria Luisa Gorno-Tempini,et al.  Patterns of brain atrophy that differentiate corticobasal degeneration syndrome from progressive supranuclear palsy. , 2006, Archives of neurology.

[18]  Arthur F. Kramer,et al.  Age-related differences in regional brain volumes: A comparison of optimized voxel-based morphometry to manual volumetry , 2009, Neurobiology of Aging.

[19]  L. Rudin,et al.  Nonlinear total variation based noise removal algorithms , 1992 .

[20]  G. Box Robustness in the Strategy of Scientific Model Building. , 1979 .

[21]  Daoqiang Zhang,et al.  Ensemble sparse classification of Alzheimer's disease , 2012, NeuroImage.

[22]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[23]  P. Scheltens,et al.  Research criteria for the diagnosis of Alzheimer's disease: revising the NINCDS–ADRDA criteria , 2007, The Lancet Neurology.

[24]  Marc Teboulle,et al.  Mirror descent and nonlinear projected subgradient methods for convex optimization , 2003, Oper. Res. Lett..

[25]  Alexandre M. Bayen,et al.  Accelerated Mirror Descent in Continuous and Discrete Time , 2015, NIPS.

[26]  Weizhong Tian,et al.  Changes of brain gray matter structure in Parkinson's disease patients with dementia , 2013, Neural regeneration research.

[27]  G. Wenk,et al.  Neuropathologic changes in Alzheimer's disease. , 2003, The Journal of clinical psychiatry.

[28]  Karl J. Friston,et al.  Why Voxel-Based Morphometry Should Be Used , 2001, NeuroImage.

[29]  Stephen P. Boyd,et al.  An ADMM Algorithm for a Class of Total Variation Regularized Estimation Problems , 2012, 1203.1828.

[30]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[31]  Wotao Yin,et al.  A New Regularization Path for Logistic Regression via Linearized Bregman , 2012 .

[32]  Kiralee M. Hayashi,et al.  Three-dimensional gray matter atrophy mapping in mild cognitive impairment and mild Alzheimer disease. , 2007, Archives of neurology.

[33]  Yuan Yao,et al.  GSplit LBI: Taming the Procedural Bias in Neuroimaging for Disease Prediction , 2017, MICCAI.

[34]  Zheng Wang,et al.  Elastic Net based sparse feature learning and classification for Alzheimer's disease identification , 2017, 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[35]  S. Osher,et al.  Sparse Recovery via Differential Inclusions , 2014, 1406.7728.

[36]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[37]  Wen Gao,et al.  Efficient Generalized Fused Lasso and its Application to the Diagnosis of Alzheimer's Disease , 2014, AAAI.

[38]  Stefan Pollmann,et al.  PyMVPA: a Python Toolbox for Multivariate Pattern Analysis of fMRI Data , 2009, Neuroinformatics.

[39]  Karl J. Friston,et al.  Statistical parametric maps in functional imaging: A general linear approach , 1994 .

[40]  Yuan Yao,et al.  Split LBI: An Iterative Regularization Path with Structural Sparsity , 2016, NIPS.

[41]  P. Scheltens,et al.  Medial temporal lobe atrophy predicts Alzheimer's disease in patients with minor cognitive impairment , 2002, Journal of neurology, neurosurgery, and psychiatry.

[42]  John Darzentas,et al.  Problem Complexity and Method Efficiency in Optimization , 1983 .

[43]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[44]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[45]  C R Jack,et al.  Gray and white matter water diffusion in the syndromic variants of frontotemporal dementia , 2010, Neurology.

[46]  Yong He,et al.  Discriminative analysis of early Alzheimer's disease using multi-modal imaging and multi-level characterization with multi-classifier (M3) , 2012, NeuroImage.

[47]  Yuan Yao,et al.  A Tutorial on Libra: R package for the Linearized Bregman Algorithm in High Dimensional Statistics , 2016, 1604.05910.

[48]  Yuan Yao,et al.  Boosting with structural sparsity: A differential inclusion approach , 2017, Applied and Computational Harmonic Analysis.

[49]  Clifford R. Jack,et al.  Diagnostic neuroimaging across diseases , 2011, NeuroImage.

[50]  S. Resnick,et al.  Detection of prodromal Alzheimer's disease via pattern classification of magnetic resonance imaging , 2008, Neurobiology of Aging.

[51]  Rainer Goebel,et al.  Combining multivariate voxel selection and support vector machines for mapping and classification of fMRI spatial patterns , 2008, NeuroImage.

[52]  Jean-Philippe Vert,et al.  Group lasso with overlap and graph lasso , 2009, ICML '09.

[53]  Xiaohui Xie,et al.  Split Bregman method for large scale fused Lasso , 2010, Comput. Stat. Data Anal..

[54]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[55]  Robert D. Nowak,et al.  Sparse Overlapping Sets Lasso for Multitask Learning and its Application to fMRI Analysis , 2013, NIPS.

[56]  Dinggang Shen,et al.  Morphological classification of brains via high-dimensional shape transformations and machine learning methods , 2004, NeuroImage.

[57]  H. Braak,et al.  Neuropathological stageing of Alzheimer-related changes , 2004, Acta Neuropathologica.

[58]  Yaakov Stern,et al.  Multivariate and univariate neuroimaging biomarkers of Alzheimer's disease , 2008, NeuroImage.