Hierarchical feature representation and multimodal fusion with deep learning for AD/MCI diagnosis

For the last decade, it has been shown that neuroimaging can be a potential tool for the diagnosis of Alzheimer's Disease (AD) and its prodromal stage, Mild Cognitive Impairment (MCI), and also fusion of different modalities can further provide the complementary information to enhance diagnostic accuracy. Here, we focus on the problems of both feature representation and fusion of multimodal information from Magnetic Resonance Imaging (MRI) and Positron Emission Tomography (PET). To our best knowledge, the previous methods in the literature mostly used hand-crafted features such as cortical thickness, gray matter densities from MRI, or voxel intensities from PET, and then combined these multimodal features by simply concatenating into a long vector or transforming into a higher-dimensional kernel space. In this paper, we propose a novel method for a high-level latent and shared feature representation from neuroimaging modalities via deep learning. Specifically, we use Deep Boltzmann Machine (DBM)(2), a deep network with a restricted Boltzmann machine as a building block, to find a latent hierarchical feature representation from a 3D patch, and then devise a systematic method for a joint feature representation from the paired patches of MRI and PET with a multimodal DBM. To validate the effectiveness of the proposed method, we performed experiments on ADNI dataset and compared with the state-of-the-art methods. In three binary classification problems of AD vs. healthy Normal Control (NC), MCI vs. NC, and MCI converter vs. MCI non-converter, we obtained the maximal accuracies of 95.35%, 85.67%, and 74.58%, respectively, outperforming the competing methods. By visual inspection of the trained model, we observed that the proposed method could hierarchically discover the complex latent patterns inherent in both MRI and PET.

[1]  Daoqiang Zhang,et al.  Hierarchical fusion of features and classifier decisions for Alzheimer's disease diagnosis , 2014, Human brain mapping.

[2]  Dinggang Shen,et al.  ABSORB: Atlas building by Self-Organized Registration and Bundling , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Dinggang Shen,et al.  HAMMER: hierarchical attribute matching mechanism for elastic registration , 2002, IEEE Transactions on Medical Imaging.

[4]  James T Becker,et al.  Mild cognitive impairment and alzheimer disease: patterns of altered cerebral blood flow at MR imaging. , 2009, Radiology.

[5]  Nassir Navab,et al.  Medical Image Computing and Computer-Assisted Intervention – MICCAI 2013 , 2013, Lecture Notes in Computer Science.

[6]  Honglak Lee,et al.  Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations , 2009, ICML '09.

[7]  Stephen M. Smith,et al.  Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm , 2001, IEEE Transactions on Medical Imaging.

[8]  P. Scheltens,et al.  Medial temporal lobe atrophy predicts Alzheimer's disease in patients with minor cognitive impairment , 2002, Journal of neurology, neurosurgery, and psychiatry.

[9]  Geoffrey E. Hinton,et al.  An Efficient Learning Procedure for Deep Boltzmann Machines , 2012, Neural Computation.

[10]  Shu Liao,et al.  Representation Learning: A Unified Deep Learning Framework for Automatic Prostate MR Segmentation , 2013, MICCAI.

[11]  Klaus-Robert Müller,et al.  Deep Boltzmann Machines as Feed-Forward Hierarchies , 2012, AISTATS.

[12]  Ciprian Catana,et al.  PET/MRI for Neurologic Applications , 2012, The Journal of Nuclear Medicine.

[13]  Christos Davatzikos,et al.  Voxel-Based Morphometry Using the RAVENS Maps: Methods and Validation Using Simulated Longitudinal Atrophy , 2001, NeuroImage.

[14]  Luca Maria Gambardella,et al.  Mitosis Detection in Breast Cancer Histology Images with Deep Neural Networks , 2013, MICCAI.

[15]  Heinz-Peter Schlemmer,et al.  PET/MRI: Paving the Way for the Next Generation of Clinical Multimodality Imaging Applications , 2010, Journal of Nuclear Medicine.

[16]  Vince D. Calhoun,et al.  Restricted Boltzmann machines for neuroimaging: An application in identifying intrinsic networks , 2014, NeuroImage.

[17]  Jesse S. Jin,et al.  Identification of Conversion from Mild Cognitive Impairment to Alzheimer's Disease Using Multivariate Predictors , 2011, PloS one.

[18]  Daoqiang Zhang,et al.  Identification of MCI individuals using structural and functional connectivity networks , 2012, NeuroImage.

[19]  H. Möller,et al.  Functional connectivity of the fusiform gyrus during a face-matching task in subjects with mild cognitive impairment. , 2006, Brain : a journal of neurology.

[20]  Andy C. H. Lee,et al.  Differentiating the Roles of the Hippocampus and Perirhinal Cortex in Processes beyond Long-Term Declarative Memory: A Double Dissociation in Dementia , 2006, The Journal of Neuroscience.

[21]  Yoshua Bengio,et al.  Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.

[22]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[23]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[24]  Toshiyuki Tanaka,et al.  A Theory of Mean Field Approximation , 1998, NIPS.

[25]  J T O'Brien,et al.  Medial temporal lobe atrophy on MRI differentiates Alzheimer's disease from dementia with Lewy bodies and vascular cognitive impairment: a prospective study with pathological verification of diagnosis. , 2009, Brain : a journal of neurology.

[26]  Daoqiang Zhang,et al.  Multi-modal multi-task learning for joint prediction of multiple regression and classification variables in Alzheimer's disease , 2012, NeuroImage.

[27]  J. Trojanowski,et al.  Prediction of MCI to AD conversion, via MRI, CSF biomarkers, and pattern classification , 2011, Neurobiology of Aging.

[28]  A. McKinney,et al.  Automated MRI measures identify individuals with mild cognitive impairment and Alzheimer's disease , 2010 .

[29]  Dinggang Shen,et al.  Hierarchical Anatomical Brain Networks for MCI Prediction: Revisiting Volumetric Measures , 2011, PloS one.

[30]  Daoqiang Zhang,et al.  Multimodal classification of Alzheimer's disease and mild cognitive impairment , 2011, NeuroImage.

[31]  Wenbin Li,et al.  Enriched white matter connectivity networks for accurate identification of MCI patients , 2011, NeuroImage.

[32]  C. Jack,et al.  Boosting power for clinical trials using classifiers based on multiple biomarkers , 2010, Neurobiology of Aging.

[33]  K. Ishii,et al.  Voxel-based morphometric comparison between early- and late-onset mild Alzheimer's disease and assessment of diagnostic performance of z score images. , 2005, AJNR. American journal of neuroradiology.

[34]  Simon J. Doran,et al.  Stacked Autoencoders for Unsupervised Feature Learning and Multiple Organ Detection in a Pilot Study Using 4D Patient Data , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[36]  Dinggang Shen,et al.  Statistical representation of high-dimensional deformation fields with application to statistically constrained 3D warping , 2006, Medical Image Anal..

[37]  Karl J. Friston Functional and effective connectivity in neuroimaging: A synthesis , 1994 .

[38]  Dinggang Shen,et al.  COMPARE: Classification of Morphological Patterns Using Adaptive Regional Elements , 2007, IEEE Transactions on Medical Imaging.

[39]  A. Fagan,et al.  Multimodal techniques for diagnosis and prognosis of Alzheimer's disease , 2009, Nature.

[40]  Dinggang Shen,et al.  Discriminative Group Sparse Representation for Mild Cognitive Impairment Classification , 2013, MLMI.

[41]  A. Simmons,et al.  Combining MRI and CSF measures for classification of Alzheimer's disease and prediction of mild cognitive impairment conversion , 2011, Alzheimer's & Dementia.

[42]  M. Greicius,et al.  Default-mode network activity distinguishes Alzheimer's disease from healthy aging: Evidence from functional MRI , 2004, Proc. Natl. Acad. Sci. USA.

[43]  H. Arrighi,et al.  Mild cognitive impairment: Disparity of incidence and prevalence estimates , 2012, Alzheimer's & Dementia.

[44]  Nitish Srivastava,et al.  Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..

[45]  Geoffrey E. Hinton,et al.  Implicit Mixtures of Restricted Boltzmann Machines , 2008, NIPS.

[46]  Geoffrey E. Hinton,et al.  Acoustic Modeling Using Deep Belief Networks , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[47]  Bill Wilson,et al.  Intensity of dementia through latent variable modelling (I-DeLV) in the AIBL cohort , 2012, Alzheimer's & Dementia.

[48]  Daoqiang Zhang,et al.  Predicting Future Clinical Changes of MCI Patients Using Longitudinal and Multimodal Biomarkers , 2012, PloS one.

[49]  L. Mosconi,et al.  Brain glucose metabolism in the early and specific diagnosis of Alzheimer’s disease , 2005, European Journal of Nuclear Medicine and Molecular Imaging.

[50]  Ranjan Duara,et al.  An investigation of PreMCI: Subtypes and longitudinal outcomes , 2012, Alzheimer's & Dementia.

[51]  I. Veer,et al.  Strongly reduced volumes of putamen and thalamus in Alzheimer's disease: an MRI study , 2008, Brain : a journal of neurology.

[52]  Yoshua Bengio,et al.  Classification using discriminative restricted Boltzmann machines , 2008, ICML '08.

[53]  Dinggang Shen,et al.  RABBIT: Rapid alignment of brains by building intermediate templates , 2009, NeuroImage.

[54]  B. Långström,et al.  The use of PET in Alzheimer disease , 2010, Nature Reviews Neurology.

[55]  R. Mayeux,et al.  Hippocampal and entorhinal atrophy in mild cognitive impairment , 2007, Neurology.

[56]  Vikas Singh,et al.  Predictive markers for AD in a multi-modality framework: An analysis of MCI progression in the ADNI population , 2011, NeuroImage.

[57]  M. Sitskoorn,et al.  Are subjective cognitive complaints relevant in preclinical Alzheimer’s Dementia? A review and guidelines for healthcare professional , 2013 .

[58]  A. Dale,et al.  Combining MR Imaging, Positron-Emission Tomography, and CSF Biomarkers in the Diagnosis and Prognosis of Alzheimer Disease , 2010, American Journal of Neuroradiology.

[59]  J. Baron,et al.  In Vivo Mapping of Gray Matter Loss with Voxel-Based Morphometry in Mild Alzheimer's Disease , 2001, NeuroImage.

[60]  D. Shen,et al.  Discriminant analysis of longitudinal cortical thickness changes in Alzheimer's disease using dynamic and network features , 2012, Neurobiology of Aging.

[61]  William M. Wells,et al.  Medical Image Computing and Computer-Assisted Intervention — MICCAI’98 , 1998, Lecture Notes in Computer Science.

[62]  H. Braak,et al.  Neuropathological stageing of Alzheimer-related changes , 2004, Acta Neuropathologica.

[63]  W. Thies,et al.  2008 Alzheimer’s disease facts and figures , 2008, Alzheimer's & Dementia.

[64]  Alan C. Evans,et al.  3D Anatomical Atlas of the Human Brain , 1998, NeuroImage.

[65]  Marie Chupin,et al.  Automatic classi fi cation of patients with Alzheimer ' s disease from structural MRI : A comparison of ten methods using the ADNI database , 2010 .

[66]  Dinggang Shen,et al.  Multivariate examination of brain abnormality using both structural and functional MRI , 2007, NeuroImage.

[67]  Dinggang Shen,et al.  Knowledge-Guided Robust MRI Brain Extraction for Diverse Large-Scale Neuroimaging Studies on Humans and Non-Human Primates , 2014, PloS one.

[68]  Dinggang Shen,et al.  Diffusion Tensor Image Registration Using Tensor Geometry and Orientation Features , 2008, MICCAI.

[69]  Alan C. Evans,et al.  A nonparametric method for automatic correction of intensity nonuniformity in MRI data , 1998, IEEE Transactions on Medical Imaging.

[70]  Dinggang Shen,et al.  Deep Learning-Based Feature Representation for AD/MCI Classification , 2013, MICCAI.

[71]  Geoffrey E. Hinton,et al.  Deep Boltzmann Machines , 2009, AISTATS.

[72]  Daoqiang Zhang,et al.  Ensemble sparse classification of Alzheimer's disease , 2012, NeuroImage.

[73]  Andreas Krause,et al.  Advances in Neural Information Processing Systems (NIPS) , 2014 .

[74]  Paul M. Thompson,et al.  Multi-source feature learning for joint analysis of incomplete multiple heterogeneous neuroimaging data , 2012, NeuroImage.

[75]  C. Jack,et al.  Prediction of conversion from mild cognitive impairment to Alzheimer's disease dementia based upon biomarkers and neuropsychological test performance , 2012, Neurobiology of Aging.

[76]  John-Dylan Haynes,et al.  Multi-scale classification of disease using structural MRI and wavelet transform , 2012, NeuroImage.

[77]  C. Jack,et al.  Alzheimer's Disease Neuroimaging Initiative , 2008 .

[78]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[79]  N. Schuff,et al.  Pattern of cerebral hypoperfusion in Alzheimer disease and mild cognitive impairment measured with arterial spin-labeling MR imaging: initial experience. , 2005, Radiology.

[80]  S. Gauthier,et al.  Training-related brain plasticity in subjects at risk of developing Alzheimer's disease. , 2011, Brain : a journal of neurology.

[81]  Arthur W. Toga,et al.  A wavelet-based statistical analysis of fMRI data , 2007, Neuroinformatics.

[82]  Alan C. Evans,et al.  Spatial patterns of cortical thinning in mild cognitive impairment and Alzheimer's disease. , 2006, Brain : a journal of neurology.

[83]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..