Machine learning for modeling the progression of Alzheimer disease dementia using clinical data: a systematic literature review

Abstract Objective Alzheimer disease (AD) is the most common cause of dementia, a syndrome characterized by cognitive impairment severe enough to interfere with activities of daily life. We aimed to conduct a systematic literature review (SLR) of studies that applied machine learning (ML) methods to clinical data derived from electronic health records in order to model risk for progression of AD dementia. Materials and Methods We searched for articles published between January 1, 2010, and May 31, 2020, in PubMed, Scopus, ScienceDirect, IEEE Explore Digital Library, Association for Computing Machinery Digital Library, and arXiv. We used predefined criteria to select relevant articles and summarized them according to key components of ML analysis such as data characteristics, computational algorithms, and research focus. Results There has been a considerable rise over the past 5 years in the number of research papers using ML-based analysis for AD dementia modeling. We reviewed 64 relevant articles in our SLR. The results suggest that majority of existing research has focused on predicting progression of AD dementia using publicly available datasets containing both neuroimaging and clinical data (neurobehavioral status exam scores, patient demographics, neuroimaging data, and laboratory test values). Discussion Identifying individuals at risk for progression of AD dementia could potentially help to personalize disease management to plan future care. Clinical data consisting of both structured data tables and clinical notes can be effectively used in ML-based approaches to model risk for AD dementia progression. Data sharing and reproducibility of results can enhance the impact, adaptation, and generalizability of this research.

[1]  Malek Adjouadi,et al.  Early Diagnosis of Mild Cognitive Impairment Using Random Forest Feature Selection , 2018, 2018 IEEE Biomedical Circuits and Systems Conference (BioCAS).

[2]  J. Karlawish,et al.  The A4 Study: Stopping AD Before Symptoms Begin? , 2014, Science Translational Medicine.

[3]  S. Berretta,et al.  Stratifying risk for dementia onset using large-scale electronic health record data: a retrospective cohort study , 2019, Alzheimer's & Dementia.

[4]  Chih-Lin Chi,et al.  Personalized long-term prediction of cognitive function: Using sequential assessments to improve model performance , 2017, J. Biomed. Informatics.

[5]  Pietro Liò,et al.  A parameter-efficient deep learning approach to predict conversion from mild cognitive impairment to Alzheimer's disease , 2018, NeuroImage.

[6]  Lin-Ching Chang,et al.  Early Detection of Alzheimer’s Disease Using Patient Neuropsychological and Cognitive Data and Machine Learning Techniques , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[7]  Stephen Todd,et al.  A practical computerized decision support system for predicting the severity of Alzheimer’s disease of an individual , 2019, bioRxiv.

[8]  Jinsung Yoon,et al.  Dynamic Prediction in Clinical Survival Analysis Using Temporal Convolutional Networks , 2020, IEEE Journal of Biomedical and Health Informatics.

[9]  Dimitri Van De Ville,et al.  Predicting Pure Amnestic Mild Cognitive Impairment Conversion to Alzheimer's Disease Using Joint Modeling of Imaging and Clinical Data , 2015, 2015 International Workshop on Pattern Recognition in NeuroImaging.

[10]  Jerry L. Prince,et al.  Temporal Trajectory and Progression Score Estimation from Voxelwise Longitudinal Imaging Measures: Application to Amyloid Imaging , 2015, IPMI.

[11]  Carmelo J. A. Bastos Filho,et al.  Using artificial neural networks to select the parameters for the prognostic of mild cognitive impairment and dementia in elderly individuals , 2017, Comput. Methods Programs Biomed..

[12]  Xuelong Li,et al.  Modeling Disease Progression via Multisource Multitask Learners: A Case Study With Alzheimer’s Disease , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[13]  Hans Förstl,et al.  Clinical features of Alzheimer’s disease , 1999, European Archives of Psychiatry and Clinical Neuroscience.

[14]  A. Wimo,et al.  The global prevalence of dementia: A systematic review and metaanalysis , 2013, Alzheimer's & Dementia.

[15]  Jonathan R. Walsh,et al.  Machine learning for comprehensive forecasting of Alzheimer’s Disease progression , 2018, Scientific Reports.

[16]  Diego Castillo-Barnes,et al.  Studying the Manifold Structure of Alzheimer's Disease: A Deep Learning Approach Using Convolutional Autoencoders , 2020, IEEE Journal of Biomedical and Health Informatics.

[17]  D. Moher,et al.  Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement , 2009, BMJ : British Medical Journal.

[18]  Hongfang Liu,et al.  Automatic extraction and assessment of lifestyle exposures for Alzheimer's disease using natural language processing , 2019, Int. J. Medical Informatics.

[19]  Mikhail Belyaev,et al.  Predicting Conversion of Mild Cognitive Impairments to Alzheimer's Disease and Exploring Impact of Neuroimaging , 2018, GRAIL/Beyond-MIC@MICCAI.

[20]  Jack Albright,et al.  Forecasting the progression of Alzheimer's disease using neural networks and a novel preprocessing algorithm , 2019, Alzheimer's & dementia.

[21]  Narges Razavian,et al.  Variationally regularized graph-based representation learning for electronic health records , 2021, CHIL.

[22]  M. Ümit Uyar,et al.  Gene Expression Based Computation Methods for Alzheimer's Disease Progression using Hippocampal Volume Loss and MMSE Scores , 2016, BCB.

[23]  Rizwan Ahmed Khan,et al.  Secondary Use of Electronic Health Record: Opportunities and Challenges , 2020, IEEE Access.

[24]  Peter Herscovitch,et al.  Appropriate use criteria for amyloid PET: A report of the Amyloid Imaging Task Force, the Society of Nuclear Medicine and Molecular Imaging, and the Alzheimer's Association , 2013, Alzheimer's & Dementia.

[25]  Ning An,et al.  Deep ensemble learning for Alzheimers disease classification , 2019, J. Biomed. Informatics.

[26]  C. Mathers,et al.  Global prevalence of dementia: a Delphi consensus study , 2005, The Lancet.

[27]  C. Rowe,et al.  Amyloid β deposition, neurodegeneration, and cognitive decline in sporadic Alzheimer's disease: a prospective cohort study , 2013, The Lancet Neurology.

[28]  Sang Won Seo,et al.  Prediction of cognitive impairment via deep learning trained with multi-center neuropsychological test data , 2019, BMC Medical Informatics and Decision Making.

[29]  Mert R. Sabuncu,et al.  A Probabilistic Disease Progression Model for Predicting Future Clinical Outcome , 2018, ArXiv.

[30]  P J Moore,et al.  Random forest prediction of Alzheimer’s disease using pairwise selection from time series data , 2018, PloS one.

[31]  J. Massaro,et al.  Using data science to diagnose and characterize heterogeneity of Alzheimer's disease , 2019, Alzheimer's & dementia.

[32]  K. Mills,et al.  Methods and considerations for longitudinal structural brain imaging analysis across development , 2014, Developmental Cognitive Neuroscience.

[33]  Jeffrey A Linder,et al.  Thoughtless design of the electronic health record drives overuse, but purposeful design can nudge improved patient care , 2018, BMJ Quality & Safety.

[34]  Sheng Luo,et al.  Functional joint model for longitudinal and time‐to‐event data: an application to Alzheimer's disease , 2017, Statistics in medicine.

[35]  Luiz M. R. Gadelha,et al.  Exploring Reproducibility and FAIR Principles in Data Science Using Ecological Niche Modeling as a Case Study , 2019, ER Workshops.

[36]  Enrico Pellegrini,et al.  Machine learning of neuroimaging for assisted diagnosis of cognitive impairment and dementia: A systematic review , 2018, Alzheimer's & dementia.

[37]  Hong-Woo Chun,et al.  Longitudinal Study-Based Dementia Prediction for Public Health , 2017, International journal of environmental research and public health.

[38]  M. Mallar Chakravarty,et al.  Modeling and prediction of clinical symptom trajectories in Alzheimer’s disease using longitudinal data , 2018, PLoS Comput. Biol..

[39]  Anne Corbett,et al.  Alzheimer's disease , 2011, The Lancet.

[40]  Yanhong Luo,et al.  Predicting Alzheimer's disease based on survival data and longitudinally measured performance on cognitive and functional scales , 2020, Psychiatry Research.

[41]  Gemma Piella,et al.  A survey on machine and statistical learning for longitudinal analysis of neuroimaging data in Alzheimer's disease , 2020, Comput. Methods Programs Biomed..

[42]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[43]  LeighAnne Olsen,et al.  Clinical Data as the Basic Staple of Health Learning: Creating and Protecting a Public Good: Workshop Summary , 2011 .

[44]  Richard D. White,et al.  Predicting rate of cognitive decline at baseline using a deep neural network with multidata analysis , 2020, Journal of medical imaging.

[45]  Po-Wei Huang,et al.  Classification of Alzheimer's Disease, Mild Cognitive Impairment, and Cognitively Normal Based on Neuropsychological Data via Supervised Learning , 2019, TENCON 2019 - 2019 IEEE Region 10 Conference (TENCON).

[46]  Philip R. O. Payne,et al.  Questions for Artificial Intelligence in Health Care. , 2019, JAMA.

[47]  Yaozong Gao,et al.  Longitudinal clinical score prediction in Alzheimer's disease with soft-split sparse regression based random forest , 2016, Neurobiology of Aging.

[48]  Roy H. Campbell,et al.  Learning the progression and clinical subtypes of Alzheimer's disease from longitudinal clinical data , 2018, ArXiv.

[49]  Malek Adjouadi,et al.  A Deep Neural Network Approach for Early Diagnosis of Mild Cognitive Impairment Using Multiple Features , 2018, 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA).

[50]  Malek Adjouadi,et al.  Profile-Specific Regression Model for Progression Prediction of Alzheimer's Disease Using Longitudinal Data , 2018, 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA).

[51]  Darshak M Sanghavi,et al.  Machine learning models to predict onset of dementia: A label learning approach , 2019, Alzheimer's & dementia.

[52]  Tingyan Wang,et al.  Predictive Modeling of the Progression of Alzheimer’s Disease with Recurrent Neural Networks , 2018, Scientific Reports.

[53]  Tze-Yun Leong,et al.  Modeling Multi-View Dependence in Bayesian Networks for Alzheimer's Disease Detection , 2019, MedInfo.

[54]  Ferran Sanz,et al.  Identifying temporal patterns in patient disease trajectories using dynamic time warping: A population-based study , 2018, Scientific Reports.

[55]  Iain Buchan,et al.  Data-driven identification of endophenotypes of Alzheimer’s disease progression: implications for clinical trials and therapeutic interventions , 2018, Alzheimer's Research & Therapy.

[56]  Jenna Wiens,et al.  Characterizing heterogeneity in the progression of Alzheimer's disease using longitudinal clinical and neuroimaging biomarkers , 2018, Alzheimer's & dementia.

[57]  Gerald A. Higgins,et al.  A similarity-based approach to leverage multi-cohort medical data on the diagnosis and prognosis of Alzheimer's disease , 2018, GigaScience.

[58]  Mohammad Asif Emon,et al.  Using Multi-Scale Genetic, Neuroimaging and Clinical Data for Predicting Alzheimer’s Disease and Reconstruction of Relevant Biological Mechanisms , 2018, Scientific Reports.

[59]  Reasons for Failed Trials of Disease-Modifying Treatments for Alzheimer Disease and Their Contribution in Recent Research , 2019, Biomedicines.

[60]  K. Blennow,et al.  Appropriate use criteria for lumbar puncture and cerebrospinal fluid testing in the diagnosis of Alzheimer’s disease , 2018, Alzheimer's & Dementia.

[61]  Bo Zhao,et al.  Deep learning in clinical natural language processing: a methodical review , 2019, J. Am. Medical Informatics Assoc..

[62]        Global prevalence of dementia: a Delphi consensus study , 2006 .

[63]  Juan Manuel Górriz,et al.  Automatic Differentiation between Alzheimer's Disease and Mild Cognitive Impairment Combining PET Data and Psychological Scores , 2013, 2013 International Workshop on Pattern Recognition in Neuroimaging.

[64]  C. Jack,et al.  Preclinical Alzheimer's disease: Definition, natural history, and diagnostic criteria , 2016, Alzheimer's & Dementia.

[65]  Y. Nigmatullina,et al.  Machine Learning Algorithm Helps Identify Non-Diagnosed Prodromal Alzheimer’s Disease Patients in the General Population , 2019, The Journal of Prevention of Alzheimer's Disease.

[66]  M. Maes,et al.  Characteristics of Mild Cognitive Impairment Using the Thai Version of the Consortium to Establish a Registry for Alzheimer’s Disease Tests: A Multivariate and Machine Learning Study , 2018, Dementia and Geriatric Cognitive Disorders.

[67]  Anderson Amendoeira Namen,et al.  A hybrid data mining model for diagnosis of patients with clinical suspicion of dementia , 2018, Comput. Methods Programs Biomed..

[68]  Christian Salvatore,et al.  Optimizing Neuropsychological Assessments for Cognitive, Behavioral, and Functional Impairment Classification: A Machine Learning Study , 2017, Behavioural neurology.

[69]  Christian Wachinger,et al.  A Wide and Deep Neural Network for Survival Analysis from Anatomical Shape and Tabular Clinical Data , 2019, PKDD/ECML Workshops.

[70]  Thar Baker,et al.  Effective Use of Data Science Toward Early Prediction of Alzheimer's Disease , 2018, 2018 IEEE 20th International Conference on High Performance Computing and Communications; IEEE 16th International Conference on Smart City; IEEE 4th International Conference on Data Science and Systems (HPCC/SmartCity/DSS).

[71]  Danial Hooshyar,et al.  Early Diagnosis of Dementia from Clinical Data by Machine Learning Techniques , 2017 .

[72]  Daniel Rueckert,et al.  Meta-Weighted Gaussian Process Experts for Personalized Forecasting of AD Cognitive Changes , 2019, MLHC.

[73]  J. Growdon,et al.  CLINICAL FEATURES OF ALZHEIMER??S DISEASE , 1992 .

[74]  Jieping Ye,et al.  Big Data Analytical Approaches to the NACC Dataset: Aiding Preclinical Trial Enrichment , 2017, Alzheimer disease and associated disorders.

[75]  P. Anderberg,et al.  Machine learning and microsimulation techniques on the prognosis of dementia: A systematic literature review , 2017, PloS one.

[76]  Dazhe Zhao,et al.  Generalized fused group lasso regularized multi-task feature learning for predicting cognitive outcomes in Alzheimers disease , 2018, Comput. Methods Programs Biomed..

[77]  Yan Yan,et al.  Classification of Alzheimer’s Disease with and without Imagery Using Gradient Boosted Machines and ResNet-50 , 2019, Brain sciences.

[78]  Vince D. Calhoun,et al.  An ensemble learning system for a 4-way classification of Alzheimer’s disease and mild cognitive impairment , 2018, Journal of Neuroscience Methods.

[79]  F. Arnaud,et al.  From core referencing to data re-use: two French national initiatives to reinforce paleodata stewardship (National Cyber Core Repository and LTER France Retro-Observatory) , 2017 .

[80]  B. Miller,et al.  CME Practice parameter : Diagnosis of dementia ( an evidence-based review ) Report of the Quality Standards Subcommittee of the American Academy of Neurology , 2001 .

[81]  Yun Liu,et al.  How to develop machine learning models for healthcare , 2019, Nature Materials.

[82]  Amir Shmuel,et al.  Performance of machine learning methods applied to structural MRI and ADAS cognitive scores in diagnosing Alzheimer's disease , 2019, Biomed. Signal Process. Control..

[83]  Jonathan L Lustgarten,et al.  Veterinary informatics: forging the future between veterinary medicine, human medicine, and One Health initiatives—a joint paper by the Association for Veterinary Informatics (AVI) and the CTSA One Health Alliance (COHA) , 2020, JAMIA open.

[84]  R. Lipton,et al.  Machine Learning Predictive Models Can Improve Efficacy of Clinical Trials for Alzheimer's Disease1,2. , 2020, Journal of Alzheimer's disease : JAD.

[85]  Muhammad Usman,et al.  Early Diagnosis of Alzheimer's Disease Using Informative Features of Clinical Data , 2018, ICMVA.

[86]  V. Kolachalama,et al.  Temporal association of neuropsychological test performance using unsupervised learning reveals a distinct signature of Alzheimer's disease status , 2019, Alzheimer's & dementia.

[87]  Reem Bin-Hezam,et al.  A Machine Learning Approach towards Detecting Dementia based on its Modifiable Risk Factors , 2019, International Journal of Advanced Computer Science and Applications.

[88]  J. Morris,et al.  The diagnosis of dementia due to Alzheimer’s disease: Recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer's disease , 2011, Alzheimer's & Dementia.

[89]  Yu-Chuan Li,et al.  Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers , 2015, MedInfo.

[90]  Kyung-Ah Sohn,et al.  MildInt: Deep Learning-Based Multimodal Longitudinal Data Integration Framework , 2019, Front. Genet..

[91]  Sterling C. Johnson,et al.  Predicting Alzheimer’s disease progression using multi-modal deep learning approach , 2019, Scientific Reports.

[92]  Y. Guan,et al.  COMPASS: A computational model to predict changes in MMSE scores 24-months after initial assessment of Alzheimer’s disease , 2016, Scientific Reports.

[93]  Yi Su,et al.  Heterogeneous multimodal biomarkers analysis for Alzheimer’s disease via Bayesian network , 2016, EURASIP J. Bioinform. Syst. Biol..