Benchmarking machine learning models for late-onset alzheimer’s disease prediction from genomic data

BackgroundLate-Onset Alzheimer’s Disease (LOAD) is a leading form of dementia. There is no effective cure for LOAD, leaving the treatment efforts to depend on preventive cognitive therapies, which stand to benefit from the timely estimation of the risk of developing the disease. Fortunately, a growing number of Machine Learning methods that are well positioned to address this challenge are becoming available.ResultsWe conducted systematic comparisons of representative Machine Learning models for predicting LOAD from genetic variation data provided by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) cohort. Our experimental results demonstrate that the classification performance of the best models tested yielded ∼72% of area under the ROC curve.ConclusionsMachine learning models are promising alternatives for estimating the genetic risk of LOAD. Systematic machine learning model selection also provides the opportunity to identify new genetic markers potentially associated with the disease.

[1]  Melissa Wake,et al.  Machine Learning SNP Based Prediction for Precision Medicine , 2019, Front. Genet..

[2]  R. Green,et al.  Genetic studies of quantitative MCI and AD phenotypes in ADNI: Progress, opportunities, and plans , 2015, Alzheimer's & Dementia.

[3]  Sterling C. Johnson,et al.  Predicting Alzheimer’s disease progression using multi-modal deep learning approach , 2019, Scientific Reports.

[4]  Beatriz López,et al.  Single Nucleotide Polymorphism relevance learning with Random Forests for Type 2 diabetes risk prediction , 2017, Artif. Intell. Medicine.

[5]  D. Selkoe Alzheimer's disease. , 2011, Cold Spring Harbor perspectives in biology.

[6]  M. Prince,et al.  Epidemiology of dementias and Alzheimer's disease. , 2012, Archives of medical research.

[7]  José G. Tamez-Peña,et al.  Improved multimodal biomarkers for Alzheimer's disease and mild cognitive impairment diagnosis: data from ADNI , 2013, Medical Imaging.

[8]  Rongling Li,et al.  Quality Control Procedures for Genome‐Wide Association Studies , 2011, Current protocols in human genetics.

[9]  Anne Corbett,et al.  Alzheimer's disease , 2011, The Lancet.

[10]  Xiaomin Wang,et al.  Genetics of Alzheimer’s disease: From pathogenesis to clinical usage , 2017, Journal of Clinical Neuroscience.

[11]  Victor Treviño,et al.  Magnetization-prepared rapid acquisition with gradient echo magnetic resonance imaging signal and texture features for the prediction of mild cognitive impairment to Alzheimer’s disease progression , 2014, Journal of medical imaging.

[12]  H. Hakonarson,et al.  Large sample size, wide variant spectrum, and advanced machine-learning technique boost risk prediction for inflammatory bowel disease. , 2013, American journal of human genetics.

[13]  Li Shen,et al.  Genome-wide association and interaction studies of CSF T-tau/Aβ42 ratio in ADNI cohort , 2017, Neurobiology of Aging.

[14]  Jason H. Moore,et al.  Alzheimer's Disease Neuroimaging Initiative biomarkers as quantitative phenotypes: Genetics core aims, progress, and plans , 2010, Alzheimer's & Dementia.

[15]  Nick C Fox,et al.  Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer's disease , 2013, Nature Genetics.

[16]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[17]  M. Nöthen,et al.  Exploring Genetic Associations of Alzheimer’s Disease Loci With Mild Cognitive Impairment Neurocognitive Endophenotypes , 2018, Front. Aging Neurosci..

[18]  M. Ergun,et al.  Analysis of genetics and risk factors of Alzheimer’s Disease , 2016, Neuroscience.

[19]  M. Glymour,et al.  Prediction to prevention in Alzheimer's disease and dementia , 2018, The Lancet Neurology.

[20]  W. M. van der Flier,et al.  Genome-wide significant risk factors for Alzheimer's disease: role in progression to dementia due to Alzheimer's disease among subjects with mild cognitive impairment , 2016, Molecular Psychiatry.

[21]  N. Fitz,et al.  The Role of APOE and TREM2 in Alzheimer′s Disease—Current Understanding and Perspectives , 2018, International journal of molecular sciences.

[22]  G. Tosto,et al.  Genetics of Alzheimer’s Disease: the Importance of Polygenic and Epistatic Components , 2017, Current Neurology and Neuroscience Reports.

[23]  A. Dale,et al.  Combining MR Imaging, Positron-Emission Tomography, and CSF Biomarkers in the Diagnosis and Prognosis of Alzheimer Disease , 2010, American Journal of Neuroradiology.

[24]  H. Stefánsson,et al.  Meta-analysis of Alzheimer’s disease on 9,751 samples from Norway and IGAP study identifies four risk loci , 2018, Scientific Reports.

[25]  K. Rhodes,et al.  The antibody aducanumab reduces Aβ plaques in Alzheimer’s disease , 2016, Nature.

[26]  Yang I Li,et al.  An Expanded View of Complex Traits: From Polygenic to Omnigenic , 2017, Cell.

[27]  Dhiya Al-Jumeily,et al.  Deep Learning Classification of Polygenic Obesity using Genome Wide Association Study SNPs , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[28]  A. Alexiou,et al.  A Bayesian Model for the Prediction and Early Diagnosis of Alzheimer's Disease , 2017, Front. Aging Neurosci..

[29]  for the Alzheimer’s Disease Neuroimaging Initiative Predicting Alzheimer’s disease progression using multi-modal deep learning approach , 2019 .

[30]  Olivier Rouaud,et al.  APP, PSEN1, and PSEN2 mutations in early-onset Alzheimer disease: A genetic screening study of familial and sporadic cases , 2017, PLoS medicine.

[31]  Sudha Seshadri,et al.  Genome-wide analysis of genetic loci associated with Alzheimer disease. , 2010, JAMA.