A Machine Learning Method for Identifying Critical Interactions Between Gene Pairs in Alzheimer's Disease Prediction

Background: Alzheimer's disease (AD) is the most common type of dementia. Scientists have discovered that the causes of AD may include a combination of genetic, lifestyle, and environmental factors, but the exact cause has not yet been elucidated. Effective strategies to prevent and treat AD therefore remain elusive. The identified genetic causes of AD mainly focus on individual genes, but growing evidence has shown that complex diseases are usually affected by the interaction of genes in a network. Few studies have focused on the interactions and correlations between genes and how they are gradually destroyed or disappear during AD progression. A differential network analysis has been recognized as an essential tool for identifying the underlying pathogenic mechanisms and significant genes for prediction analysis. We therefore aim to conduct a differential network analysis to reveal potential networks involved in the neuropathogenesis of AD and identify genes for AD prediction. Methods: In this paper, we selected 365 samples from the Religious Orders Study and the Rush Memory and Aging Project, including 193 clinically and neuropathologically confirmed AD subjects and 172 no cognitive impairment (NCI) controls. Then, we selected 158 genes belonging to the AD pathway (hsa05010) of the Kyoto Encyclopedia of Genes and Genomes. We employed a machine learning method, namely, joint density-based non-parametric differential interaction network analysis and classification (JDINAC), in the analysis of gene expression data (RNA-seq data). We searched for the differential networks in the RNA-seq data with a pathological diagnosis of AD. Finally, an optimal prediction model was built through cross-validation, which showed good discrimination and calibration for AD prediction. Results: We used JDINAC to derive a gene co-expression network and to explore the relationship between the interaction of gene pairs and AD, and the top 10 differential gene pairs were identified. We then compared the prediction performance between JDINAC and individual genes based on prediction methods. JDINAC provides better accuracy of classification than the latest methods, such as random forest and penalized logistic regression. Conclusions: The interaction between gene pairs is related to AD and can provide more insight than the individual genes in AD prediction.

[1]  Lei Zhang,et al.  Analyzing the genes related to Alzheimer’s disease via a network and pathway-based approach , 2017, Alzheimer's Research & Therapy.

[2]  N. Friedman,et al.  Comprehensive comparative analysis of strand-specific RNA sequencing methods , 2010, Nature Methods.

[3]  Margarida Silveira,et al.  Predicting conversion from MCI to AD with FDG-PET brain images at different prodromal stages , 2015, Comput. Biol. Medicine.

[4]  M. Murray,et al.  Mitochondrial ATP synthase activity is impaired by suppressed O-GlcNAcylation in Alzheimer's disease. , 2015, Human molecular genetics.

[5]  Hui Yu,et al.  Bioinformatics Applications Note Gene Expression Dcgl: an R Package for Identifying Differentially Coexpressed Genes and Links from Gene Expression Microarray Data , 2022 .

[6]  Jing Xu,et al.  A powerful weighted statistic for detecting group differences of directed biological networks , 2016, Scientific Reports.

[7]  Xiaoshuai Zhang,et al.  A powerful score-based statistical test for group difference in weighted biological networks , 2016, BMC Bioinformatics.

[8]  Steve Iliffe,et al.  Alzheimer’s disease , 2009, BMJ : British Medical Journal.

[9]  Margaret A. Pericak-Vance,et al.  Novel late-onset Alzheimer disease loci variants associate with brain gene expression , 2012, Neurology.

[10]  D. G. Clark,et al.  Effects of multiple genetic loci on age at onset in late-onset Alzheimer disease: a genome-wide association study. , 2014, JAMA neurology.

[11]  Aviv Regev,et al.  Comprehensive comparative analysis of RNA sequencing methods for degraded or low input samples , 2013, Nature Methods.

[12]  Pharmacological antagonism of interleukin-8 receptor CXCR2 inhibits inflammatory reactivity and is neuroprotective in an animal model of Alzheimer’s disease , 2015, Journal of Neuroinflammation.

[13]  H. Braak,et al.  Neuropathological stageing of Alzheimer-related changes , 2004, Acta Neuropathologica.

[14]  Ronald C Petersen,et al.  Essentials of the proper diagnoses of mild cognitive impairment, dementia, and major subtypes of dementia. , 2003, Mayo Clinic proceedings.

[15]  J. Kauwe,et al.  Mitochondria and Alzheimer’s Disease: the Role of Mitochondrial Genetic Variation , 2018, Current Genetic Medicine Reports.

[16]  Laura Bleiler,et al.  2014 Alzheimer's disease facts and figures , 2014, Alzheimer's & Dementia.

[17]  Igor O. Korolev Alzheimer's Disease: A Clinical and Basic Science Review , 2014 .

[18]  Matthew N McCall,et al.  Estimation of Gene Regulatory Networks. , 2013, Postdoc journal : a journal of postdoctoral research and postdoctoral affairs.

[19]  Edward L. Huttlin,et al.  TIMMDC1/C3orf1 Functions as a Membrane-Embedded Mitochondrial Complex I Assembly Factor through Association with the MCIA Complex , 2013, Molecular and Cellular Biology.

[20]  Kenneth R Norman,et al.  Presenilin mutations deregulate mitochondrial Ca2+ homeostasis and metabolic activity causing neurodegeneration in Caenorhabditis elegans , 2018, eLife.

[21]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[22]  J. Schneider,et al.  Overview and findings from the religious orders study. , 2012, Current Alzheimer research.

[23]  H. Brodaty,et al.  ALZHEIMER'S DISEASE INTERNATIONAL , 1997, International journal of geriatric psychiatry.

[24]  Kristel Sleegers,et al.  Genetic variations underlying Alzheimer's disease: evidence from genome-wide association studies and beyond , 2016, The Lancet Neurology.

[25]  Yang Feng,et al.  JDINAC: joint density-based non-parametric differential interaction network analysis and classification using high-dimensional sparse omics data , 2017, bioRxiv.

[26]  K. Lunetta,et al.  Age-at-Onset in Late Onset Alzheimer Disease is Modified by Multiple Genetic Loci , 2014 .

[27]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.

[28]  Somnath Datta,et al.  Integrating gene regulatory pathways into differential network analysis of gene expression data , 2019, Scientific Reports.

[29]  Sourav Bandyopadhyay,et al.  Rewiring of Genetic Networks in Response to DNA Damage , 2010, Science.

[30]  Diego di Bernardo,et al.  Differential network analysis for the identification of condition-specific pathway activity and regulation , 2013, Bioinform..

[31]  Anne Corbett,et al.  Alzheimer's disease , 2011, The Lancet.

[32]  Tao Zhang,et al.  A novel chi‐square statistic for detecting group differences between pathways in systems epidemiology , 2016, Statistics in medicine.

[33]  J. Schneider,et al.  Overview and findings from the rush Memory and Aging Project. , 2012, Current Alzheimer research.

[34]  Jing Xu,et al.  Detection for pathway effect contributing to disease in systems epidemiology with a case–control design , 2015, BMJ Open.

[35]  Quantitative profiling brain proteomes revealed mitochondrial dysfunction in Alzheimer’s disease , 2019, Molecular Brain.

[36]  M. Beal,et al.  Amyloid beta, mitochondrial dysfunction and synaptic damage: implications for cognitive decline in aging and Alzheimer's disease. , 2008, Trends in molecular medicine.

[37]  E. Strehler Emanuel Strehler's work on calcium pumps and calcium signaling. , 2011, World journal of biological chemistry.

[38]  M. Beal,et al.  Mitochondrial dysfunction and oxidative stress in neurodegenerative diseases , 2006, Nature.

[39]  Lei Xie,et al.  A new insight into underlying disease mechanism through semi-parametric latent differential network model , 2018, bioRxiv.

[40]  Sara Fontanella,et al.  Machine learning to identify pairwise interactions between specific IgE antibodies and their association with asthma: A cross-sectional analysis within a population-based birth cohort , 2018, PLoS medicine.

[41]  T. Cai,et al.  Direct estimation of differential networks. , 2014, Biometrika.

[42]  Giovanni Montana,et al.  Differential analysis of biological networks , 2015, BMC Bioinformatics.

[43]  Charles C. White,et al.  A molecular network of the aging human brain provides insights into the pathology and cognitive decline of Alzheimer’s disease , 2018, Nature Neuroscience.