Inferring Disease Associated Phosphorylation Sites via Random Walk on Multi-Layer Heterogeneous Network

As protein phosphorylation plays an important role in numerous cellular processes, many studies have been undertaken to analyze phosphorylation-related activities for drug design and disease treatment. However, although progresses have been made in illustrating the relationship between phosphorylation and diseases, no existing method focuses on disease-associated phosphorylation sites prediction. In this work, we proposed a multi-layer heterogeneous network model that makes use of the kinase information to infer disease-phosphorylation site relationship and implemented random walk on the heterogeneous network. Experimental results reveal that multi-layer heterogeneous network model with kinase layer is superior in inferring disease-phosphorylation site relationship when comparing with existing random walk model and common used classification methods.

[1]  Christos Faloutsos,et al.  Random walk with restart: fast solutions and applications , 2008, Knowledge and Information Systems.

[2]  Shao-Ping Shi,et al.  PSEA: Kinase-specific prediction and analysis of human phosphorylation substrates , 2014, Scientific Reports.

[3]  Yu Xue,et al.  PPSP: prediction of PK-specific phosphorylation site with Bayesian decision theory , 2006, BMC Bioinformatics.

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  D. Higgins,et al.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega , 2011, Molecular systems biology.

[6]  Eugene I Shakhnovich,et al.  Amino acids determining enzyme-substrate specificity in prokaryotic and eukaryotic protein kinases , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Tony Pawson,et al.  NetworKIN: a resource for exploring cellular phosphorylation networks , 2007, Nucleic Acids Res..

[8]  Jagdish Chandra Patra,et al.  Genome-wide inferring gene-phenotype relationship by walking on the heterogeneous network , 2010, Bioinform..

[9]  Jorng-Tzong Horng,et al.  KinasePhos: a web tool for identifying protein kinase-specific phosphorylation sites , 2005, Nucleic Acids Res..

[10]  Yi Shen,et al.  PKIS: computational identification of protein kinases for experimentally discovered protein phosphorylation sites , 2013, BMC Bioinformatics.

[11]  B. Kobe,et al.  Structural basis and prediction of substrate specificity in protein serine/threonine kinases , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Bin Zhang,et al.  PhosphoSitePlus: a comprehensive resource for investigating the structure and function of experimentally determined post-translational modifications in man and mouse , 2011, Nucleic Acids Res..

[13]  J. Ferrell,et al.  Mechanisms of specificity in protein phosphorylation , 2007, Nature Reviews Molecular Cell Biology.

[14]  T. Hunter,et al.  The Protein Kinase Complement of the Human Genome , 2002, Science.

[15]  Xing Chen,et al.  Drug-target interaction prediction by random walk on the heterogeneous network. , 2012, Molecular bioSystems.

[16]  Predrag Radivojac,et al.  Gain and Loss of Phosphorylation Sites in Human Cancer , 2022 .

[17]  Dong Xu,et al.  A New Machine Learning Approach for Protein Phosphorylation Site Prediction in Plants , 2009, BICoB.

[18]  Christopher T. Walsh,et al.  Posttranslational Modification of Proteins: Expanding Nature's Inventory , 2005 .

[19]  P. Ortiz de Montellano,et al.  Protein kinase Akt/PKB phosphorylates heme oxygenase‐1 in vitro and in vivo , 2004, FEBS letters.

[20]  Desmond G. Higgins,et al.  Sequence embedding for fast construction of guide trees for multiple sequence alignment , 2010, Algorithms for Molecular Biology.

[21]  Hsien-Da Huang,et al.  KinasePhos 2.0: a web server for identifying protein kinase-specific phosphorylation sites based on sequences and coupling patterns , 2007, Nucleic Acids Res..

[22]  Zexian Liu,et al.  Reconfiguring phosphorylation signaling by genetic polymorphisms affects cancer susceptibility. , 2015, Journal of molecular cell biology.

[23]  Natasa Przulj,et al.  Predicting disease associations via biological network analysis , 2014, BMC Bioinformatics.

[24]  S. Oliver Proteomics: Guilt-by-association goes global , 2000, Nature.

[25]  Paul Tempst,et al.  Phosphorylation and Functional Inactivation of TSC2 by Erk Implications for Tuberous Sclerosisand Cancer Pathogenesis , 2005, Cell.

[26]  Ao Li,et al.  Improving the performance of protein kinase identification via high dimensional protein-protein interactions and substrate structure data. , 2014, Molecular bioSystems.

[27]  Deendayal Dinakarpandian,et al.  Finding disease similarity based on implicit semantic similarity , 2012, J. Biomed. Informatics.

[28]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2002, Nucleic Acids Res..

[29]  Xiang Zhang,et al.  Drug repositioning by integrating target information through a heterogeneous network model , 2014, Bioinform..

[30]  Hongyang Wang,et al.  Systematic Analysis of Protein Phosphorylation Networks From Phosphoproteomic Data* , 2012, Molecular & Cellular Proteomics.

[31]  Yi Shen,et al.  Prediction of protein kinase-specific phosphorylation sites in hierarchical structure using functional information and random forest , 2014, Amino Acids.

[32]  Xiaoke Ma,et al.  Long non-coding RNAs function annotation: a global prediction method based on bi-colored networks , 2012, Nucleic acids research.

[33]  Hsien-Da Huang,et al.  RegPhos: a system to explore the protein kinase–substrate phosphorylation network in humans , 2010, Nucleic Acids Res..

[34]  Dong Xu,et al.  Musite, a Tool for Global Prediction of General and Kinase-specific Phosphorylation Sites* , 2010, Molecular & Cellular Proteomics.

[35]  H R Matthews,et al.  Protein kinases and phosphatases that act on histidine, lysine, or arginine residues in eukaryotic proteins: a possible regulator of the mitogen-activated protein kinase cascade. , 1995, Pharmacology & therapeutics.

[36]  H. Lowe,et al.  Understanding and using the medical subject headings (MeSH) vocabulary to perform literature searches. , 1994, JAMA.

[37]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[38]  D. Lipman,et al.  Rapid similarity searches of nucleic acid and protein data banks. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[39]  Geoffrey J. Barton,et al.  Kinomer v. 1.0: a database of systematically classified eukaryotic protein kinases , 2008, Nucleic Acids Res..

[40]  Lilia M. Iakoucheva,et al.  Loss of Post-Translational Modification Sites in Disease , 2010, Pacific Symposium on Biocomputing.

[41]  Satoru Kuhara,et al.  A novel method for gathering and prioritizing disease candidate genes based on construction of a set of disease-related MeSH® terms , 2014, BMC Bioinformatics.