Predicting Parkinson's Disease Genes Based on Node2vec and Autoencoder

Identifying genes associated with Parkinson's disease plays an extremely important role in the diagnosis and treatment of Parkinson's disease. In recent years, based on the guilt-by-association hypothesis, many methods have been proposed to predict disease-related genes, but few of these methods are designed or used for Parkinson's disease gene prediction. In this paper, we propose a novel prediction method for Parkinson's disease gene prediction, named N2A-SVM. N2A-SVM includes three parts: extracting features of genes based on network, reducing the dimension using deep neural network, and predicting Parkinson's disease genes using a machine learning method. The evaluation test shows that N2A-SVM performs better than existing methods. Furthermore, we evaluate the significance of each step in the N2A-SVM algorithm and the influence of the hyper-parameters on the result. In addition, we train N2A-SVM on the recent dataset and used it to predict Parkinson's disease genes. The predicted top-rank genes can be verified based on literature study.

[1]  T. Gilliam,et al.  Molecular triangulation: bridging linkage and molecular-network information for identifying candidate genes in Alzheimer's disease. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Jing He,et al.  PICK1 inhibits the E3 ubiquitin ligase activity of Parkin and reduces its neuronal protective effect , 2018, Proceedings of the National Academy of Sciences.

[3]  M. Thiruchelvam,et al.  A Neurodevelopmental Origin for Pakinson's Disease: A Link to the Fetal Basis for Adult Disease Hypothesis , 2010 .

[4]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[5]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[6]  Gisèle Bonne,et al.  The 2019 version of the gene table of neuromuscular disorders (nuclear genome) , 2018, Neuromuscular Disorders.

[7]  Shuhui Liu,et al.  Improving the measurement of semantic similarity by combining gene ontology and co-functional network: a random walk based approach , 2018, BMC Systems Biology.

[8]  E. Marcotte,et al.  It's the machine that matters: Predicting gene function and phenotype from protein networks. , 2010, Journal of proteomics.

[9]  François Dubeau,et al.  High Rate of Recurrent De Novo Mutations in Developmental and Epileptic Encephalopathies. , 2017, American journal of human genetics.

[10]  Luonan Chen,et al.  Network‐Based Prediction of Protein Function , 2009 .

[11]  Liang Cheng,et al.  Identification of Alzheimer's Disease-Related Genes Based on Data Integration Method , 2019, Front. Genet..

[12]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[13]  Jagdish Chandra Patra,et al.  Genome-wide inferring gene-phenotype relationship by walking on the heterogeneous network , 2010, Bioinform..

[14]  A. Barabasi,et al.  Uncovering disease-disease relationships through the incomplete interactome , 2015, Science.

[15]  Jiajie Peng,et al.  Measuring phenotype-phenotype similarity through the interactome , 2017, BMC Bioinformatics.

[16]  R. Rodenburg,et al.  Mitochondrial complex I-linked disease. , 2016, Biochimica et biophysica acta.

[17]  Xuequn Shang,et al.  Predicting disease-related genes using integrated biomedical networks , 2017, BMC Genomics.

[18]  B. Snel,et al.  Predicting disease genes using protein–protein interactions , 2006, Journal of Medical Genetics.

[19]  R. Sharan,et al.  Network-based prediction of protein function , 2007, Molecular systems biology.

[20]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[21]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[22]  R. Talebi,et al.  Parkinson's disease and lactoferrin: Analysis of dependent protein networks , 2016 .

[23]  Jie Sun,et al.  DincRNA: a comprehensive web-based bioinformatics toolkit for exploring disease associations and ncRNA function , 2018, Bioinform..

[24]  Yadong Wang,et al.  A novel method to measure the semantic similarity of HPO terms , 2017, Int. J. Data Min. Bioinform..

[25]  Qinghua Guo,et al.  LncRNA2Target v2.0: a comprehensive database for target genes of lncRNAs in human and mouse , 2018, Nucleic Acids Res..

[26]  Avathvadi Venkatesan Srinivasan,et al.  PARK2 gene mutations in early onset Parkinson's disease patients of South India , 2012, Neuroscience Letters.

[27]  Esra Bozgeyik,et al.  Gene expression profiles of autophagy-related genes in multiple sclerosis. , 2016, Gene.

[28]  Q. Zou,et al.  Cancer Diagnosis Through IsomiR Expression with Machine Learning Method , 2016 .

[29]  Bolin Chen,et al.  A learning-based framework for miRNA-disease association identification using neural networks , 2018, bioRxiv.