Identifying Alzheimer’s disease-related proteins by LRRGD

Alzheimer’s disease (AD) imposes a heavy burden on society and every family. Therefore, diagnosing AD in advance and discovering new drug targets are crucial, while these could be achieved by identifying AD-related proteins. The time-consuming and money-costing biological experiment makes researchers turn to develop more advanced algorithms to identify AD-related proteins. Firstly, we proposed a hypothesis “similar diseases share similar related proteins”. Therefore, five similarity calculation methods are introduced to find out others diseases which are similar to AD. Then, these diseases’ related proteins could be obtained by public data set. Finally, these proteins are features of each disease and could be used to map their similarity to AD. We developed a novel method ‘LRRGD’ which combines Logistic Regression (LR) and Gradient Descent (GD) and borrows the idea of Random Forest (RF). LR is introduced to regress features to similarities. Borrowing the idea of RF, hundreds of LR models have been built by randomly selecting 40 features (proteins) each time. Here, GD is introduced to find out the optimal result. To avoid the drawback of local optimal solution, a good initial value is selected by some known AD-related proteins. Finally, 376 proteins are found to be related to AD. Three hundred eight of three hundred seventy-six proteins are the novel proteins. Three case studies are done to prove our method’s effectiveness. These 308 proteins could give researchers a basis to do biological experiments to help treatment and diagnostic AD.

[1]  J. Henley,et al.  Profiles of SUMO and ubiquitin conjugation in an Alzheimer's disease model , 2011, Neuroscience Letters.

[2]  Gang Feng,et al.  Disease Ontology: a backbone for disease semantic integration , 2011, Nucleic Acids Res..

[3]  Su Deng,et al.  Bayesian network model for identification of pathways by integrating protein interaction with genetic interaction data , 2017, BMC Systems Biology.

[4]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[5]  Guanghua Xiao,et al.  Validation of a serum screen for Alzheimer's disease across assay platforms, species, and tissues. , 2014, Journal of Alzheimer's disease : JAD.

[6]  Yadong Wang,et al.  Towards integrative gene functional similarity measurement , 2014, BMC Bioinformatics.

[7]  Liang Cheng,et al.  Identification of Alzheimer's Disease-Related Genes Based on Data Integration Method , 2019, Front. Genet..

[8]  Michael W. Weiner,et al.  The future of blood-based biomarkers for Alzheimer's disease , 2014, Alzheimer's & Dementia.

[9]  L. D’Adamio,et al.  Amyloid beta protein precursor (AbetaPP), but not AbetaPP-like protein 2, is bridged to the kinesin light chain by the scaffold protein JNK-interacting protein 1. , 2003, The Journal of biological chemistry.

[10]  Xiaoyu Wang,et al.  Combining Gene Ontology with Deep Neural Networks to Enhance the Clustering of Single Cell RNA-Seq Data , 2018 .

[11]  I. Ferrer,et al.  Familial Alzheimer's disease-associated presenilin-1 alters cerebellar activity and calcium homeostasis. , 2014, The Journal of clinical investigation.

[12]  Guangmin Liang,et al.  k-Skip-n-Gram-RF: A Random Forest Based Method for Alzheimer's Disease Protein Identification , 2019, Front. Genet..

[13]  M. Staufenbiel,et al.  Neurofilament Light Chain in Blood and CSF as Marker of Disease Progression in Mouse Models and in Neurodegenerative Diseases , 2016, Neuron.

[14]  Matt Kaeberlein,et al.  A SYSTEMS-BIOLOGY APPROACH TO IDENTIFY CANDIDATE GENES FOR ALZHEIMER'S DISEASE BY INTEGRATING PROTEIN-PROTEIN INTERACTION NETWORK AND SUBSEQUENT IN VIVO VALIDATION OF CANDIDATE GENES USING A C. ELEGANS MODEL OF AB TOXICITY , 2014, Alzheimer's & Dementia.

[15]  Jiajie Peng,et al.  Predicting Parkinson's Disease Genes Based on Node2vec and Autoencoder , 2019, Front. Genet..

[16]  H. Stanley,et al.  Discovering disease-associated genes in weighted protein–protein interaction networks , 2018 .

[17]  Jiajie Peng,et al.  SemFunSim: A New Method for Measuring Disease Similarity by Integrating Semantic and Gene Functional Association , 2014, PloS one.

[18]  Deendayal Dinakarpandian,et al.  Finding disease similarity based on implicit semantic similarity , 2012, J. Biomed. Informatics.

[19]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[20]  Jie Sun,et al.  DincRNA: a comprehensive web-based bioinformatics toolkit for exploring disease associations and ncRNA function , 2018, Bioinform..

[21]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[22]  Henrik Zetterberg,et al.  Plasma tau levels in Alzheimer's disease , 2013, Alzheimer's Research & Therapy.

[23]  Chaoyang Zhang,et al.  SeqAssist: a novel toolkit for preliminary analysis of next-generation sequencing data , 2014, BMC Bioinformatics.

[24]  Jianye Hao,et al.  A learning-based framework for miRNA-disease association identification using neural networks , 2018, bioRxiv.

[25]  Qinghua Guo,et al.  LncRNA2Target v2.0: a comprehensive database for target genes of lncRNAs in human and mouse , 2018, Nucleic Acids Res..

[26]  K. Blennow,et al.  CSF and blood biomarkers for the diagnosis of Alzheimer's disease: a systematic review and meta-analysis , 2016, The Lancet Neurology.

[27]  K. Jellinger General aspects of neurodegeneration. , 2003, Journal of neural transmission. Supplementum.

[28]  J. Cummings,et al.  Repurposed agents in the Alzheimer’s disease drug development pipeline , 2020, Alzheimer's Research & Therapy.

[29]  J. Cummings,et al.  Alzheimer's disease drug development pipeline: 2017 , 2017, Alzheimer's & dementia.

[30]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[31]  B. Dubois,et al.  Paths to Alzheimer’s disease prevention: From modifiable risk factors to biomarker enrichment strategies , 2015, The journal of nutrition, health & aging.

[32]  L. Goldstein,et al.  Axonal Transport of Amyloid Precursor Protein Is Mediated by Direct Binding to the Kinesin Light Chain Subunit of Kinesin-I , 2000, Neuron.

[33]  L. D’Adamio,et al.  Amyloid β Protein Precursor (AβPP), but Not AβPP-like Protein 2, Is Bridged to the Kinesin Light Chain by the Scaffold Protein JNK-interacting Protein 1* , 2003, Journal of Biological Chemistry.

[34]  Borivoj Vojtesek,et al.  Hammock: a hidden Markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets , 2015, Bioinform..

[35]  J. J. Espinosa-Aguirre,et al.  Cytochrome P450 in the central nervous system as a therapeutic target in neurodegenerative diseases , 2018, Drug metabolism reviews.

[36]  P. Prather Preface to DMR special edition ‘Cannabinoid receptors and ligands: therapeutic drug development and abuse potential’ , 2018, Drug metabolism reviews.

[37]  Daniel Trejo-Baños,et al.  Integrating transcriptional activity in genome-scale models of metabolism , 2017, BMC Systems Biology.

[38]  M. Bobinski,et al.  Prediction of cognitive decline in normal elderly subjects with 2-[18F]fluoro-2-deoxy-d-glucose/positron-emission tomography (FDG/PET) , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[39]  Philip S. Yu,et al.  A new method to measure the semantic similarity of GO terms , 2007, Bioinform..