A metabolite-based machine learning approach to diagnose Alzheimer-type dementia in blood: Results from the European Medical Information Framework for Alzheimer disease biomarker discovery cohort

Machine learning (ML) may harbor the potential to capture the metabolic complexity in Alzheimer’s Disease (AD). Here we set out to test the performance of metabolites in blood to categorise AD when compared to CSF biomarkers.This study analysed samples from 242 cognitively normal (CN) people and 115 with AD-type dementia utilizing plasma metabolites (n=883). Deep Learning (DL), Extreme Gradient Boosting (XGBoost) and Random Forest (RF) were used to differentiate AD from CN. These models were internally validated using Nested Cross Validation (NCV).On the test data, DL produced the AUC of 0.85 (0.80-0.89), XGBoost produced 0.88 (0.86-0.89) and RF produced 0.85 (0.83-0.87). By comparison, CSF measures of amyloid, p-tau and t-tau (together with age and gender) produced with XGBoost the AUC values of 0.78, 0.83 and 0.87, respectively.This study showed that plasma metabolites have the potential to match the AUC of well-established AD CSF biomarkers in a relatively small cohort. Further studies in independent cohorts are needed to validate whether this specific panel of blood metabolites can separate AD from controls, and how specific it is for AD as compared with other neurodegenerative disorders

[1]  Daniel Stahl,et al.  Identifying psychosis spectrum disorder from experience sampling data using machine learning approaches , 2019, Schizophrenia Research.

[2]  K. Blennow,et al.  Primary fatty amides in plasma associated with brain amyloid burden, hippocampal volume, and memory in the European Medical Information Framework for Alzheimer’s Disease biomarker discovery cohort , 2019, Alzheimer's & Dementia.

[3]  Stephen Todd,et al.  A practical computerized decision support system for predicting the severity of Alzheimer’s disease of an individual , 2019, bioRxiv.

[4]  Fionn Murtagh,et al.  A Machine Learning Framework for Predicting Dementia and Mild Cognitive Impairment , 2018, 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA).

[5]  R. Dobson,et al.  The EMIF-AD Multimodal Biomarker Discovery study: design, methods and cohort characteristics , 2018, Alzheimer's Research & Therapy.

[6]  E. Green,et al.  The Influence of Nicotinamide on Health and Disease in the Central Nervous System , 2018, International journal of tryptophan research : IJTR.

[7]  David S. Wishart,et al.  MetaboAnalyst 4.0: towards more transparent and integrative metabolomics analysis , 2018, Nucleic Acids Res..

[8]  Xianlin Han,et al.  Altered bile acid profile associates with cognitive impairment in Alzheimer's disease—An emerging role for gut microbiome , 2018, Alzheimer's & Dementia.

[9]  A. Saykin,et al.  Brain and blood metabolite signatures of pathology and progression in Alzheimer disease: A targeted metabolomics study , 2018, PLoS medicine.

[10]  L. Ellegård,et al.  Metabolic profiles from two different breakfast meals characterized by 1H NMR-based metabolomics. , 2017, Food chemistry.

[11]  Michael W. Weiner,et al.  Metabolic network failures in Alzheimer's disease: A biochemical road map , 2017, Alzheimer's & Dementia.

[12]  Michelle K. Lupton,et al.  Association of blood lipids with Alzheimer's disease: A comprehensive lipidomics analysis , 2017, Alzheimer's & Dementia.

[13]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[14]  G. Reiser,et al.  Brain Lipotoxicity of Phytanic Acid and Very Long-chain Fatty Acids. Harmful Cellular/Mitochondrial Activities in Refsum Disease and X-Linked Adrenoleukodystrophy , 2016, Aging and disease.

[15]  P. Sham,et al.  Plasma lipidomics analysis finds long chain cholesteryl esters to be associated with Alzheimer's disease , 2015, Translational Psychiatry.

[16]  Max Kuhn,et al.  Applied Predictive Modeling , 2013 .

[17]  C. Legido-Quigley,et al.  Current strategies in the discovery of small-molecule biomarkers for Alzheimer's disease. , 2011, Bioanalysis.

[18]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[19]  Xianlin Han,et al.  Plasmalogen deficiency in early Alzheimer's disease subjects and in animal models: molecular characterization using electrospray ionization mass spectrometry , 2001, Journal of neurochemistry.

[20]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[21]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[22]  O. Fiehn Metabolomics – the link between genotypes and phenotypes , 2004, Plant Molecular Biology.

[23]  Trevor Hastie,et al.  The elements of statistical learning. 2001 , 2001 .