Longitudinal clinical score prediction in Alzheimer's disease with soft-split sparse regression based random forest

Alzheimer's disease (AD) is an irreversible neurodegenerative disease and affects a large population in the world. Cognitive scores at multiple time points can be reliably used to evaluate the progression of the disease clinically. In recent studies, machine learning techniques have shown promising results on the prediction of AD clinical scores. However, there are multiple limitations in the current models such as linearity assumption and missing data exclusion. Here, we present a nonlinear supervised sparse regression-based random forest (RF) framework to predict a variety of longitudinal AD clinical scores. Furthermore, we propose a soft-split technique to assign probabilistic paths to a test sample in RF for more accurate predictions. In order to benefit from the longitudinal scores in the study, unlike the previous studies that often removed the subjects with missing scores, we first estimate those missing scores with our proposed soft-split sparse regression-based RF and then utilize those estimated longitudinal scores at all the previous time points to predict the scores at the next time point. The experiment results demonstrate that our proposed method is superior to the traditional RF and outperforms other state-of-art regression models. Our method can also be extended to be a general regression framework to predict other disease scores.

[1]  J. Shotton,et al.  Decision Forests for Classification, Regression, Density Estimation, Manifold Learning and Semi-Supervised Learning , 2011 .

[2]  Dinggang Shen,et al.  HAMMER: hierarchical attribute matching mechanism for elastic registration , 2002, IEEE Transactions on Medical Imaging.

[3]  Dinggang Shen,et al.  Identification of Alzheimer's Disease Using Incomplete Multimodal Dataset via Matrix Shrinkage and Completion , 2013, MLMI.

[4]  Yaozong Gao,et al.  Automatic labeling of MR brain images by hierarchical learning of atlas forests. , 2016, Medical physics.

[5]  Chris Frost,et al.  Differential regional atrophy of the cingulate gyrus in Alzheimer disease: a volumetric MRI study. , 2005, Cerebral cortex.

[6]  Michael W. Weiner,et al.  Crowdsourced estimation of cognitive decline and resilience in Alzheimer's disease , 2016, Alzheimer's & Dementia.

[7]  P. Scheltens,et al.  Medial temporal lobe atrophy predicts Alzheimer's disease in patients with minor cognitive impairment , 2002, Journal of neurology, neurosurgery, and psychiatry.

[8]  Dinggang Shen,et al.  Neurodegenerative disease diagnosis using incomplete multi-modality data via matrix shrinkage and completion , 2014, NeuroImage.

[9]  David L. Dowe,et al.  Decision Forests with Oblique Decision Trees , 2006, MICAI.

[10]  Clifford R. Jack,et al.  Predicting Clinical Scores from Magnetic Resonance Scans in Alzheimer's Disease , 2010, NeuroImage.

[11]  Paul M. Thompson,et al.  Multiple Stages Classification of Alzheimer’s Disease Based on Structural Brain Networks Using Generalized Low Rank Approximations (GLRAM) , 2014, MICCAI 2014.

[12]  I. Veer,et al.  Strongly reduced volumes of putamen and thalamus in Alzheimer's disease: an MRI study , 2008, Brain : a journal of neurology.

[13]  Xiaofeng Zhu,et al.  A novel matrix-similarity based loss function for joint regression and classification in AD diagnosis , 2014, NeuroImage.

[14]  Daoqiang Zhang,et al.  Predicting Future Clinical Changes of MCI Patients Using Longitudinal and Multimodal Biomarkers , 2012, PloS one.

[15]  M. Abramowitz,et al.  Handbook of Mathematical Functions With Formulas, Graphs and Mathematical Tables (National Bureau of Standards Applied Mathematics Series No. 55) , 1965 .

[16]  A. Simmons,et al.  AddNeuroMed—The European Collaboration for the Discovery of Novel Biomarkers for Alzheimer's Disease , 2009, Annals of the New York Academy of Sciences.

[17]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[18]  Ullrich Köthe,et al.  On Oblique Random Forests , 2011, ECML/PKDD.

[19]  Vladimir Fonov,et al.  Standardized evaluation of algorithms for computer-aided diagnosis of dementia based on structural MRI: The CADDementia challenge , 2015, NeuroImage.

[20]  Johan H. C. Reiber,et al.  MMSE scores correlate with local ventricular enlargement in the spectrum from cognitively normal to Alzheimer disease , 2008, NeuroImage.

[21]  Yaozong Gao,et al.  Estimating CT Image From MRI Data Using Structured Random Forest and Auto-Context Model , 2016, IEEE Transactions on Medical Imaging.

[22]  Dinggang Shen,et al.  Affine-invariant image retrieval by correspondence matching of shapes , 1999, Image Vis. Comput..

[23]  Tianzi Jiang,et al.  Regional coherence changes in the early stages of Alzheimer’s disease: A combined structural and resting-state functional MRI study , 2007, NeuroImage.

[24]  Yaozong Gao,et al.  LINKS: Learning-based multi-source IntegratioN frameworK for Segmentation of infant brain images , 2015, NeuroImage.

[25]  Dinggang Shen,et al.  Simulating deformations of MR brain images for validation of atlas-based segmentation and registration algorithms , 2006, NeuroImage.

[26]  N. Tzourio-Mazoyer,et al.  Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain , 2002, NeuroImage.

[27]  Cassandra D. Leonardo,et al.  Comparison of nine tractography algorithms for detecting abnormal structural brain networks in Alzheimer’s disease , 2015, Front. Aging Neurosci..

[28]  Nick C Fox,et al.  Mapping the evolution of regional atrophy in Alzheimer's disease: Unbiased analysis of fluid-registered serial MRI , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[29]  D. Louis Collins,et al.  Relating one-year cognitive change in mild cognitive impairment to baseline MRI features , 2009, NeuroImage.

[30]  D. Selkoe Alzheimer's disease. , 2011, Cold Spring Harbor perspectives in biology.

[31]  K. Krishnan,et al.  The Alzheimer's disease assessment scale , 1997, Neurology.

[32]  Yaozong Gao,et al.  Learning of Atlas Forest Hierarchy for Automatic Labeling of MR Brain Images , 2014, MLMI.

[33]  Dinggang Shen,et al.  Structured Sparse Kernel Learning for Imaging Genetics Based Alzheimer's Disease Diagnosis , 2016, MICCAI.

[34]  Kim-Han Thung,et al.  Joint Diagnosis and Conversion Time Prediction of Progressive Mild Cognitive Impairment (pMCI) Using Low-Rank Subspace Clustering and Matrix Completion , 2015, MICCAI.

[35]  Paul M. Thompson,et al.  Voxelwise Spectral Diffusional Connectivity and Its Applications to Alzheimer's Disease and Intelligence Prediction , 2013, MICCAI.

[36]  Martin Klein,et al.  Precuneus atrophy in early-onset Alzheimer’s disease: a morphometric structural MRI study , 2007, Neuroradiology.

[37]  Ana Solodkin,et al.  Pathology of the Insular Cortex in Alzheimer Disease Depends on Cortical Architecture , 2005, Journal of neuropathology and experimental neurology.

[38]  S. Folstein,et al.  "Mini-mental state". A practical method for grading the cognitive state of patients for the clinician. , 1975, Journal of psychiatric research.

[39]  Paul M. Thompson,et al.  Automated multi-atlas labeling of the fornix and its integrity in alzheimer's disease , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[40]  Ying Wang,et al.  High-dimensional Pattern Regression Using Machine Learning: from Medical Images to Continuous Clinical Variables However, Support Vector Regression Has Some Disadvantages That Become Especially , 2022 .

[41]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[42]  Dinggang Shen,et al.  Multilevel Deficiency of White Matter Connectivity Networks in Alzheimer's Disease: A Diffusion MRI Study with DTI and HARDI Models , 2016, Neural plasticity.

[43]  Amity E. Green,et al.  Automated 3D mapping of hippocampal atrophy and its clinical correlates in 400 subjects with Alzheimer's disease, mild cognitive impairment, and elderly controls , 2009, Human brain mapping.

[44]  Dinggang Shen,et al.  Multi-view Classification for Identification of Alzheimer's Disease , 2015, MLMI.

[45]  Kiralee M. Hayashi,et al.  3D Mapping of Mini-mental State Examination Performance in Clinical and Preclinical Alzheimer Disease , 2006, Alzheimer disease and associated disorders.

[46]  Daoqiang Zhang,et al.  Multi-modal multi-task learning for joint prediction of multiple regression and classification variables in Alzheimer's disease , 2012, NeuroImage.

[47]  Jiayu Zhou,et al.  Modeling disease progression via multi-task learning , 2013, NeuroImage.

[48]  Dinggang Shen,et al.  Abnormal Changes of Brain Cortical Anatomy and the Association with Plasma MicroRNA107 Level in Amnestic Mild Cognitive Impairment , 2016, Front. Aging Neurosci..

[49]  Sabina Sonia Tangaro,et al.  Integrating longitudinal information in hippocampal volume measurements for the early detection of Alzheimer's disease , 2016, NeuroImage.

[50]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[51]  J. Morris The Clinical Dementia Rating (CDR) , 1993, Neurology.

[52]  D. Shen,et al.  Identification of progressive mild cognitive impairment patients using incomplete longitudinal MRI scans , 2015, Brain Structure and Function.

[53]  K. Davis,et al.  A new rating scale for Alzheimer's disease. , 1984, The American journal of psychiatry.

[54]  Dinggang Shen,et al.  Joint estimation of multiple clinical variables of neurological diseases from imaging patterns , 2010, 2010 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[55]  Zhuowen Tu,et al.  Probabilistic boosting-tree: learning discriminative models for classification, recognition, and clustering , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[56]  Sid E O'Bryant,et al.  Staging dementia using Clinical Dementia Rating Scale Sum of Boxes scores: a Texas Alzheimer's research consortium study. , 2008, Archives of neurology.

[57]  Chris H. Q. Ding,et al.  K-means clustering via principal component analysis , 2004, ICML.