Multi-Hypergraph Learning for Incomplete Multimodality Data

Multi-modality data convey complementary information that can be used to improve the accuracy of prediction models in disease diagnosis. However, effectively integrating multi-modality data remains a challenging problem, especially when the data are incomplete. For instance, more than half of the subjects in the Alzheimer's disease neuroimaging initiative (ADNI) database have no fluorodeoxyglucose positron emission tomography and cerebrospinal fluid data. Currently, there are two commonly used strategies to handle the problem of incomplete data: 1) discard samples having missing features; and 2) impute those missing values via specific techniques. In the first case, a significant amount of useful information is lost and, in the second case, additional noise and artifacts might be introduced into the data. Also, previous studies generally focus on the pairwise relationships among subjects, without considering their underlying complex (e.g., high-order) relationships. To address these issues, in this paper, we propose a multi-hypergraph learning method for dealing with incomplete multimodality data. Specifically, we first construct multiple hypergraphs to represent the high-order relationships among subjects by dividing them into several groups according to the availability of their data modalities. A hypergraph regularized transductive learning method is then applied to these groups for automatic diagnosis of brain diseases. Extensive evaluation of the proposed method using all subjects in the baseline ADNI database indicates that our method achieves promising results in AD/MCI classification, compared with the state-of-the-art methods.

[1]  R. Fletcher,et al.  Clinical Epidemiology: The Essentials , 1982 .

[2]  Stefan Klein,et al.  Feature Selection Based on the SVM Weight Vector for Classification of Dementia , 2015, IEEE Journal of Biomedical and Health Informatics.

[3]  et al.,et al.  Discrimination between Alzheimer Dementia and Controls by Automated Analysis of Multicenter FDG PET , 2002, NeuroImage.

[4]  J. Baron,et al.  Mild cognitive impairment , 2003, Neurology.

[5]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[6]  Alan C. Evans,et al.  A nonparametric method for automatic correction of intensity nonuniformity in MRI data , 1998, IEEE Transactions on Medical Imaging.

[7]  刘明霞 View-centralized multi-atlas classification for Alzheimer's disease diagnosis , 2015 .

[8]  C. DeCarli,et al.  FDG-PET improves accuracy in distinguishing frontotemporal dementia and Alzheimer's disease. , 2007, Brain : a journal of neurology.

[9]  Martine D. F. Schlag,et al.  Multi-level spectral hypergraph partitioning with arbitrary vertex sizes , 1996, Proceedings of International Conference on Computer Aided Design.

[10]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[11]  Paul M. Thompson,et al.  Multi-source feature learning for joint analysis of incomplete multiple heterogeneous neuroimaging data , 2012, NeuroImage.

[12]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[13]  Stephen M. Smith,et al.  Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm , 2001, IEEE Transactions on Medical Imaging.

[14]  Luke Bloy,et al.  Using Multiparametric Data with Missing Features for Learning Patterns of Pathology , 2012, MICCAI.

[15]  Bernhard Schölkopf,et al.  Learning with Hypergraphs: Clustering, Classification, and Embedding , 2006, NIPS.

[16]  Dinggang Shen,et al.  A Robust Deep Model for Improved Classification of AD/MCI Patients , 2015, IEEE Journal of Biomedical and Health Informatics.

[17]  Jianping Yin,et al.  Multiple Kernel Learning in the Primal for Multimodal Alzheimer’s Disease Classification , 2013, IEEE Journal of Biomedical and Health Informatics.

[18]  Yaozong Gao,et al.  Detecting Anatomical Landmarks for Fast Alzheimer’s Disease Diagnosis , 2016, IEEE Transactions on Medical Imaging.

[19]  B. Scholkopf,et al.  Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[20]  Trevor Hastie,et al.  Imputing Missing Data for Gene Expression Arrays , 2001 .

[21]  Thierry Denoeux,et al.  Selecting radiomic features from FDG-PET images for cancer treatment outcome prediction , 2016, Medical Image Anal..

[22]  Yue Gao,et al.  3-D Object Retrieval and Recognition With Hypergraph Analysis , 2012, IEEE Transactions on Image Processing.

[23]  Yaozong Gao,et al.  Dual‐core steered non‐rigid registration for multi‐modal images via bi‐directional image synthesis , 2017, Medical Image Anal..

[24]  Marie Chupin,et al.  Automatic classi fi cation of patients with Alzheimer ' s disease from structural MRI : A comparison of ten methods using the ADNI database , 2010 .

[25]  K. Blennow,et al.  Association between CSF biomarkers and incipient Alzheimer's disease in patients with mild cognitive impairment: a follow-up study , 2006, The Lancet Neurology.

[26]  Kathryn Ziegler-Graham,et al.  Forecasting the global burden of Alzheimer’s disease , 2007, Alzheimer's & Dementia.

[27]  Xuelong Li,et al.  Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search , 2013, IEEE Transactions on Image Processing.

[28]  Claude Berge,et al.  Graphs and Hypergraphs , 2021, Clustering.

[29]  Emmanuel J. Candès,et al.  Exact Matrix Completion via Convex Optimization , 2009, Found. Comput. Math..

[30]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[31]  D. Rueckert,et al.  Multi-Method Analysis of MRI Images in Early Diagnostics of Alzheimer's Disease , 2011, PloS one.

[32]  J. Rodríguez On the Laplacian Spectrum and Walk-regular Hypergraphs , 2003 .

[33]  Marianna Bolla,et al.  Spectra, Euclidean representations and clusterings of hypergraphs , 1993, Discret. Math..

[34]  T. Schneider Analysis of Incomplete Climate Data: Estimation of Mean Values and Covariance Matrices and Imputation of Missing Values. , 2001 .

[35]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[36]  Dinggang Shen,et al.  View‐aligned hypergraph learning for Alzheimer's disease diagnosis with incomplete multi‐modality data , 2017, Medical Image Anal..

[37]  Jun Zhang,et al.  Detecting Anatomical Landmarks From Limited Medical Imaging Data Using Two-Stage Task-Oriented Deep Neural Networks , 2017, IEEE Transactions on Image Processing.

[38]  Gene H. Golub,et al.  Singular value decomposition and least squares solutions , 1970, Milestones in Matrix Computation.

[39]  Russ B. Altman,et al.  Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[40]  Dinggang Shen,et al.  HAMMER: hierarchical attribute matching mechanism for elastic registration , 2002, IEEE Transactions on Medical Imaging.

[41]  Dinggang Shen,et al.  Neurodegenerative disease diagnosis using incomplete multi-modality data via matrix shrinkage and completion , 2014, NeuroImage.

[42]  Paul M. Thompson,et al.  Bi-level multi-source learning for heterogeneous block-wise missing data , 2014, NeuroImage.

[43]  Robert P. Goldman,et al.  Imputation of Missing Data Using Machine Learning Techniques , 1996, KDD.

[44]  Yaozong Gao,et al.  Landmark-Based Alzheimer's Disease Diagnosis Using Longitudinal Structural MR Images , 2016, MCV/BAMBI@MICCAI.

[45]  P. Tariot,et al.  Alzheimer's prevention initiative: a proposal to evaluate presymptomatic treatments as quickly as possible. , 2010, Biomarkers in medicine.

[46]  Nick C Fox,et al.  The Alzheimer's disease neuroimaging initiative (ADNI): MRI methods , 2008, Journal of magnetic resonance imaging : JMRI.

[47]  Serge J. Belongie,et al.  Higher order learning with graphs , 2006, ICML.

[48]  Fei Wang,et al.  Label Propagation through Linear Neighborhoods , 2008, IEEE Trans. Knowl. Data Eng..

[49]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[50]  Jure Leskovec,et al.  Higher-order organization of complex networks , 2016, Science.

[51]  Mikio Shoji,et al.  Age-Dependent Changes in Brain, CSF, and Plasma Amyloid β Protein in the Tg2576 Transgenic Mouse Model of Alzheimer's Disease , 2001, The Journal of Neuroscience.