Sparse shared structure based multi-task learning for MRI based cognitive performance prediction of Alzheimer's disease

A mixed sparse shared structure based multi-task learning is proposed.The formulation can be applied on regression, classification or clustering.An efficient optimization algorithm is derived to solve the nonsmooth formulation.Experimental results demonstrate significant performance improvements over the existing methods.Our method is able to discover the biomarkers relevant to cognitive performance and fuse the multi-modality data. Alzheimers disease (AD), the most common form of dementia, not only causes progressive impairment of memory and other cognitive functions of patients, but also becomes the substantial financial burden to the health care system. There is thus an urgent need to (1) accurately predict the cognitive performance of the disease, and (2) identify potential MRI-related biomarkers most predictive of the estimation of cognitive outcomes. In this paper, we develop a novel multi-task learning formulation to explore the correlation existing in Magnetic Resonance Imaging (MRI) and cognitive measures by a mixed norm incorporating a hierarchical group sparsity and shared subspace uncovering regularization, to learn a shared structure from multiple related tasks with considering implicit shared subspace structure and explicit subset of features as well as Region-of-Interests (ROIs) simultaneously. An efficient alternating optimization algorithm is derived to solve the proposed non-convex and non-smooth objective formulation. We comprehensively evaluate the proposed algorithm for the cognitive outcome prediction including all subjects from the Alzheimers Disease Neuroimaging Initiative (ADNI) dataset. The experimental results not only demonstrate the proposed method has superior performance over multiple state-of-the-art comparable approaches, but also identifies cognition-relevant MRI biomarkers that are consistent with prior knowledge.

[1]  Edward Challis,et al.  Gaussian process classification of Alzheimer's disease and mild cognitive impairment from resting-state fMRI , 2015, NeuroImage.

[2]  S. Shankar Sastry,et al.  Generalized principal component analysis (GPCA) , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Massimiliano Pontil,et al.  Multi-Task Feature Learning , 2006, NIPS.

[4]  Yaozong Gao,et al.  Longitudinal clinical score prediction in Alzheimer's disease with soft-split sparse regression based random forest , 2016, Neurobiology of Aging.

[5]  Svetha Venkatesh,et al.  Achieving stable subspace clustering by post-processing generic clustering results , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[6]  Jing Li,et al.  Learning Brain Connectivity of Alzheimer's Disease from Neuroimaging Data , 2009, NIPS.

[7]  Liang Chen,et al.  Multi-modal classification of Alzheimer's disease using nonlinear graph fusion , 2017, Pattern Recognit..

[8]  Jieping Ye,et al.  A convex formulation for learning shared structures from multiple tasks , 2009, ICML '09.

[9]  Shantanu H. Joshi,et al.  Brain connectivity and novel network measures for Alzheimer's disease classification , 2015, Neurobiology of Aging.

[10]  Paul M. Thompson,et al.  Multi-source learning for joint analysis of incomplete multi-modality neuroimaging data , 2012, KDD.

[11]  Xiaofeng Zhu,et al.  A novel matrix-similarity based loss function for joint regression and classification in AD diagnosis , 2014, NeuroImage.

[12]  Nicu Sebe,et al.  Feature Selection for Multimedia Analysis by Sharing Information Among Multiple Tasks , 2013, IEEE Transactions on Multimedia.

[13]  Bin Gu,et al.  Incremental Support Vector Learning for Ordinal Regression , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[14]  Kristen Grauman,et al.  Learning with Whom to Share in Multi-task Feature Learning , 2011, ICML.

[15]  Jieping Ye,et al.  A Convex Formulation for Learning a Shared Predictive Structure from Multiple Tasks , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  J. Trojanowski,et al.  Prediction of MCI to AD conversion, via MRI, CSF biomarkers, and pattern classification , 2011, Neurobiology of Aging.

[17]  Yuan Qi,et al.  Sparse Bayesian Learning for Identifying Imaging Biomarkers in AD Prediction , 2010, MICCAI.

[18]  Dimitrios I. Fotiadis,et al.  A supervised method to assist the diagnosis and monitor progression of Alzheimer's disease using data from an fMRI experiment , 2011, Artif. Intell. Medicine.

[19]  Margarida Silveira,et al.  Longitudinal FDG-PET features for the classification of Alzheimer's disease , 2014, 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[20]  Stephen P. Boyd,et al.  Proximal Algorithms , 2013, Found. Trends Optim..

[21]  Anders M. Dale,et al.  An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest , 2006, NeuroImage.

[22]  N. Meinshausen,et al.  Stability selection , 2008, 0809.2932.

[23]  Huan Liu,et al.  Embedded Unsupervised Feature Selection , 2015, AAAI.

[24]  Zenglin Xu,et al.  Sparse Bayesian Multiview Learning for Simultaneous Association Discovery and Diagnosis of Alzheimer's Disease , 2015, AAAI.

[25]  Suyash P. Awate,et al.  Riemannian Statistical Analysis of Cortical Geometry with Robustness to Partial Homology and Misalignment , 2016, MICCAI.

[26]  Fuzhen Zhuang,et al.  Shared Structure Learning for Multiple Tasks with Multiple Views , 2013, ECML/PKDD.

[27]  A. Simmons,et al.  Combination analysis of neuropsychological tests and structural MRI measures in differentiating AD, MCI and control groups—The AddNeuroMed study , 2011, Neurobiology of Aging.

[28]  Tong Zhang,et al.  A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , 2005, J. Mach. Learn. Res..

[29]  Ramon Casanova,et al.  High Dimensional Classification of Structural MRI Alzheimer’s Disease Data Based on Large Scale Regularization , 2011, Front. Neuroinform..

[30]  Jieping Ye,et al.  Sparse methods for biomedical data , 2012, SKDD.

[31]  Shannon L. Risacher,et al.  Sparse multi-task regression and feature selection to identify brain imaging predictors for memory performance , 2011, 2011 International Conference on Computer Vision.

[32]  Jundong Liu,et al.  Nonlinear feature transformation and deep fusion for Alzheimer's Disease staging analysis , 2017, Pattern Recognit..

[33]  Norbert Schuff,et al.  Locally linear embedding (LLE) for MRI based Alzheimer's disease classification , 2013, NeuroImage.

[34]  C. Jack,et al.  Ways toward an early diagnosis in Alzheimer’s disease: The Alzheimer’s Disease Neuroimaging Initiative (ADNI) , 2005, Alzheimer's & Dementia.

[35]  Alistair Burns,et al.  Diagnosis of Alzheimer's Disease , 1997, International Psychogeriatrics.

[36]  Tso-Jung Yen,et al.  Discussion on "Stability Selection" by Meinshausen and Buhlmann , 2010 .

[37]  C. Jack,et al.  Hypothetical model of dynamic biomarkers of the Alzheimer's pathological cascade , 2010, The Lancet Neurology.

[38]  Huan Liu,et al.  Unsupervised feature selection for linked social media data , 2012, KDD.

[39]  Jieping Ye,et al.  Moreau-Yosida Regularization for Grouped Tree Structure Learning , 2010, NIPS.

[40]  Dinggang Shen,et al.  Subspace Regularized Sparse Multitask Learning for Multiclass Neurodegenerative Disease Identification , 2016, IEEE Transactions on Biomedical Engineering.

[41]  Z. Khachaturian Diagnosis of Alzheimer's disease. , 1985, Archives of neurology.

[42]  H. Demirel,et al.  Feature-ranking-based Alzheimer's disease classification from structural MRI. , 2016, Magnetic resonance imaging.

[43]  Margarida Silveira,et al.  Classification of Alzheimer's disease from FDG-PET images using favourite class ensembles , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[44]  Feiping Nie,et al.  Robust Manifold Nonnegative Matrix Factorization , 2014, ACM Trans. Knowl. Discov. Data.

[45]  M. Weiner,et al.  Automated MRI measures identify individuals with mild cognitive impairment and Alzheimer's disease* , 2009, Brain : a journal of neurology.

[46]  S. Chung,et al.  No effect of recumbency duration on the occurrence of post-lumbar puncture headache with a 22G cutting needle , 2012, BMC Neurology.

[47]  Chris H. Q. Ding,et al.  Convex and Semi-Nonnegative Matrix Factorizations , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Lawrence Carin,et al.  Sparse multinomial logistic regression: fast algorithms and generalization bounds , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  Daniel Rueckert,et al.  Group-constrained manifold learning: Application to AD risk assessment , 2017, Pattern Recognit..

[50]  Daoqiang Zhang,et al.  Multi-modal multi-task learning for joint prediction of multiple regression and classification variables in Alzheimer's disease , 2012, NeuroImage.

[51]  René Vidal,et al.  Segmenting Motions of Different Types by Unsupervised Manifold Clustering , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[52]  Shannon L. Risacher,et al.  Sparse Bayesian multi-task learning for predicting cognitive outcomes from neuroimaging measures in Alzheimer's disease , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[53]  Pietro Perona,et al.  Grouping and dimensionality reduction by locally linear embedding , 2001, NIPS.

[54]  Andrew J. Saykin,et al.  Identifying the Neuroanatomical Basis of Cognitive Impairment in Alzheimer's Disease by Correlation- and Nonlinearity-Aware Sparse Bayesian Learning , 2014, IEEE Transactions on Medical Imaging.

[55]  Mary Mittelman,et al.  World Alzheimer Report 2012 , 2012 .

[56]  Jiayu Zhou,et al.  Modeling disease progression via multi-task learning , 2013, NeuroImage.

[57]  Nicu Sebe,et al.  Web Image Annotation Via Subspace-Sparsity Collaborated Feature Selection , 2012, IEEE Transactions on Multimedia.

[58]  Dinggang Shen,et al.  Structured Sparse Kernel Learning for Imaging Genetics Based Alzheimer's Disease Diagnosis , 2016, MICCAI.

[59]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[60]  Dazhe Zhao,et al.  Generalized fused group lasso regularized multi-task feature learning for predicting cognitive outcomes in Alzheimers disease , 2018, Comput. Methods Programs Biomed..

[61]  Jiayu Zhou,et al.  Integrating low-rank and group-sparse structures for robust multi-task learning , 2011, KDD.

[62]  Massimiliano Pontil,et al.  Convex multi-task feature learning , 2008, Machine Learning.

[63]  Jianping Yin,et al.  Multiple Kernel Learning in the Primal for Multimodal Alzheimer’s Disease Classification , 2013, IEEE Journal of Biomedical and Health Informatics.

[64]  David J. Kriegman,et al.  Clustering appearances of objects under varying illumination conditions , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[65]  J. Weuve,et al.  2016 Alzheimer's disease facts and figures , 2016 .

[66]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[67]  Zein Al-Atrache,et al.  CHLAMYDIA PNEUMONIAE-INFECTED ASTROCYTES ALTER THEIR EXPRESSION OF ADAM10, BACE1, AND PRESENILIN-1 PROTEASES , 2016, Alzheimer's & Dementia.

[68]  Jiawei Han,et al.  Spectral Regression: A Unified Approach for Sparse Subspace Learning , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[69]  H. Matsuda Voxel-based Morphometry of Brain MRI in Normal Aging and Alzheimer's Disease. , 2013, Aging and disease.

[70]  Rubén Armañanzas,et al.  Voxel-Based Diagnosis of Alzheimer's Disease Using Classifier Ensembles , 2017, IEEE Journal of Biomedical and Health Informatics.

[71]  Mark E. Schmidt,et al.  The Alzheimer's Disease Neuroimaging Initiative: Progress report and future plans , 2010, Alzheimer's & Dementia.

[72]  Marie Chupin,et al.  Automatic classi fi cation of patients with Alzheimer ' s disease from structural MRI : A comparison of ten methods using the ADNI database , 2010 .

[73]  Shannon L. Risacher,et al.  High-Order Multi-Task Feature Learning to Identify Longitudinal Phenotypic Markers for Alzheimer's Disease Progression Prediction , 2012, NIPS.

[74]  Vince D. Calhoun,et al.  Alterations in Memory Networks in Mild Cognitive Impairment and Alzheimer's Disease: An Independent Component Analysis , 2006, The Journal of Neuroscience.

[75]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[76]  Jieping Ye,et al.  Sparse learning and stability selection for predicting MCI to AD conversion using baseline ADNI data , 2012, BMC Neurology.

[77]  Bin Gu,et al.  A Robust Regularization Path Algorithm for $\nu $ -Support Vector Classification , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[78]  Jieping Ye,et al.  Multi-Task Feature Learning Via Efficient l2, 1-Norm Minimization , 2009, UAI.

[79]  A. Levey,et al.  Alterations in Cortical Thickness and White Matter Integrity in Mild Cognitive Impairment Measured by Whole-Brain Cortical Thickness Mapping and Diffusion Tensor Imaging , 2009, American Journal of Neuroradiology.