EMDIP: An Entropy Measure to Discover Important Proteins in PPI networks

Discovering important proteins in Protein-Protein Interaction (PPI) networks has attracted a lot of attention in recent years. Most of the previous work applies different network centrality measures such as Closeness, Betweenness, PageRank and many others to discover the most influential proteins in PPI networks. Although entropy is a well-known graph-based method in computer science, according to our knowledge, it is not used in the biology domain for this purpose. In this paper, first, we annotate the human PPI network with available annotation data. Second, we introduce a new concept called annotation-context that describes each protein according to annotation data of its neighbors. Third, we apply an entropy measure to discover proteins with varied annotation-context. Empirical results indicate that our proposed method succeeded in (1) differentiating essential and non-essential proteins in PPI networks with annotation data; (2) outperforming centrality measures in the task of discovering essential nodes; (3) predicting new annotated proteins based on existing annotation data.

[1]  Yoshihiro Yamanishi,et al.  Predicting drug side-effect profiles: a chemical fragment-based approach , 2011, BMC Bioinformatics.

[2]  Yi Pan,et al.  United Neighborhood Closeness Centrality and Orthology for Predicting Essential Proteins , 2020, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[3]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[4]  G. Giaever,et al.  DNA-damaging agents in cancer chemotherapy: serendipity and chemical biology. , 2013, Chemistry & biology.

[5]  B. Snel,et al.  Predicting disease genes using protein–protein interactions , 2006, Journal of Medical Genetics.

[6]  E. Guney,et al.  Exploiting Protein-Protein Interaction Networks for Genome-Wide Disease-Gene Prioritization , 2012, PloS one.

[7]  J. Stephens,et al.  STAT5-Interacting Proteins: A Synopsis of Proteins that Regulate STAT5 Activity , 2017, Biology.

[8]  R. Sharan,et al.  Network-based prediction of protein function , 2007, Molecular systems biology.

[9]  H. Samuels,et al.  Nuclear receptor coactivator/coregulator NCoA6(NRC) is a pleiotropic coregulator involved in transcription, cell survival, growth and development , 2008, Nuclear receptor signaling.

[10]  Alan Ashworth,et al.  Targeting the DNA repair defect in BRCA mutant cells as a therapeutic strategy , 2005, Nature.

[11]  Xia Li,et al.  Network-based survival-associated module biomarker and its crosstalk with cell death genes in ovarian cancer , 2015, Scientific Reports.

[12]  Albert-László Barabási,et al.  Network-based prediction of protein interactions , 2018, Nature Communications.

[13]  Li Xiaojun,et al.  The Analysis of Logistics Influence of the Important Node Cities of Beijing-Tianjin-Hebei , 2017 .

[14]  Xinghuo Yu,et al.  A New Metric to Find the Most Vulnerable Node in Complex Networks , 2018, 2018 IEEE International Symposium on Circuits and Systems (ISCAS).

[15]  Barry Honig,et al.  Structure-based prediction of ligand–protein interactions on a genome-wide scale , 2017, Proceedings of the National Academy of Sciences.

[16]  Xing-Ming Zhao,et al.  OGEE v2: an update of the online gene essentiality database with special focus on differentially essential genes in human cancer cell lines , 2016, Nucleic Acids Res..

[17]  Ruth Nussinov,et al.  Structure and dynamics of molecular networks: A novel paradigm of drug discovery. A comprehensive review , 2012, Pharmacology & therapeutics.

[18]  Michael Knop,et al.  Evolution of Mutational Robustness in the Yeast Genome: A Link to Essential Genes and Meiotic Recombination Hotspots , 2009, PLoS genetics.

[19]  H. B. Mann,et al.  On a Test of Whether one of Two Random Variables is Stochastically Larger than the Other , 1947 .

[20]  M. Milik,et al.  Mapping adverse drug reactions in chemical space. , 2009, Journal of medicinal chemistry.

[21]  Brian David Dynlacht,et al.  CP110 cooperates with two calcium-binding proteins to regulate cytokinesis and genome stability. , 2006, Molecular biology of the cell.

[22]  Aideen Long,et al.  Roles for RACK1 in cancer cell migration and invasion. , 2017, Cellular signalling.

[23]  S. Calderwood,et al.  Cdc37 as a co-chaperone to Hsp90. , 2015, Sub-cellular biochemistry.

[24]  Xiaoming Fan,et al.  Suppression of RelA/p65 nuclear translocation independent of IκB-α degradation by cyclooxygenase-2 inhibitor in gastric cancer , 2003, Oncogene.

[25]  Anshuman Dixit,et al.  Identification of oral cancer related candidate genes by integrating protein-protein interactions, gene ontology, pathway analysis and immunohistochemistry , 2017, Scientific Reports.

[26]  Peng Zhang,et al.  DPRank centrality: Finding important vertices based on random walks with a new defined transition matrix , 2017, Future Gener. Comput. Syst..

[27]  Gary D Bader,et al.  Computational Prediction of Protein–Protein Interactions , 2008, Molecular biotechnology.

[28]  Ozlem Keskin,et al.  Topological, functional, and structural analyses of protein-protein interaction networks of breast cancer lung and brain metastases , 2017, 2017 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB).

[29]  Claus Hellerbrand,et al.  ENMD-1198, a novel tubulin-binding agent reduces HIF-1alpha and STAT3 activity in human hepatocellular carcinoma(HCC) cells, and inhibits growth and vascularization in vivo , 2008, BMC Cancer.

[30]  P Manimaran,et al.  The prediction of candidate genes for cervix related cancer through gene ontology and graph theoretical approach. , 2014, Molecular bioSystems.

[31]  Bagher Farhood,et al.  Transforming growth factor‐β signaling: Tumorigenesis and targeting for cancer therapy , 2018, Journal of cellular physiology.

[32]  Olaf Wolkenhauer,et al.  A systematic survey of centrality measures for protein-protein interaction networks , 2017, bioRxiv.

[33]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[34]  Alpan Raval,et al.  Identifying Hubs in Protein Interaction Networks , 2009, PloS one.

[35]  Takashi Tsuruo,et al.  Regulation of Kinase Activity of 3-Phosphoinositide-dependent Protein Kinase-1 by Binding to 14-3-3* , 2002, The Journal of Biological Chemistry.

[36]  H. McMahon,et al.  Mechanisms of endocytosis. , 2009, Annual review of biochemistry.

[37]  C Helma,et al.  Prediction of Adverse Drug Reactions Using Decision Tree Modeling , 2010, Clinical pharmacology and therapeutics.

[38]  P. Radivojac,et al.  An integrated approach to inferring gene–disease associations in humans , 2008, Proteins.

[39]  Federico Bussolino,et al.  Adaptor ShcA Protein Binds Tyrosine Kinase Tie2 Receptor and Regulates Migration and Sprouting but Not Survival of Endothelial Cells* , 2004, Journal of Biological Chemistry.

[40]  Takashi Okamoto,et al.  RING Finger Protein AO7 Supports NF-κB-mediated Transcription by Interacting with the Transactivation Domain of the p65 Subunit* , 2003, Journal of Biological Chemistry.

[41]  Hongkang Mei,et al.  Systematic Prediction of Pharmacodynamic Drug-Drug Interactions through Protein-Protein-Interaction Network , 2013, PLoS Comput. Biol..

[42]  Stanley Letovsky,et al.  Predicting protein function from protein/protein interaction data: a probabilistic approach , 2003, ISMB.

[43]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[44]  Kerstin Kutsche,et al.  Activating Mutations in PAK1, Encoding p21-Activated Kinase 1, Cause a Neurodevelopmental Disorder. , 2018, American journal of human genetics.

[45]  S. Hammes,et al.  Paxillin actions in the nucleus , 2017, Steroids.

[46]  K. Chou,et al.  Identification of Colorectal Cancer Related Genes with mRMR and Shortest Path in Protein-Protein Interaction Network , 2012, PloS one.

[47]  Hossein Rahmani,et al.  Discovering Important Nodes in Social Networks Using Entropy Measure , 2018 .

[48]  Dipanwita Roy Chowdhury,et al.  Human protein reference database as a discovery resource for proteomics , 2004, Nucleic Acids Res..

[49]  Daniel M. Romero,et al.  Influence and passivity in social media , 2010, ECML/PKDD.

[50]  G. Lorimer,et al.  Mammalian mitochondrial chaperonin 60 functions as a single toroidal ring. , 1992, The Journal of biological chemistry.

[51]  Andreas Bender,et al.  Using a Human Drug Network for generating novel hypotheses about drugs , 2016, Intell. Data Anal..

[52]  Matthias Ernst,et al.  Therapeutically exploiting STAT3 activity in cancer — using tissue repair as a road map , 2018, Nature Reviews Cancer.

[53]  N. Tuteja,et al.  Human DNA helicase II: a novel DNA unwinding enzyme identified as the Ku autoantigen. , 1994, The EMBO journal.

[54]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[55]  J. Chen,et al.  Predicting adverse side effects of drugs , 2011, BMC Genomics.

[56]  Athanasios Fevgas,et al.  DIANA-TarBase v7.0: indexing more than half a million experimentally supported miRNA:mRNA interactions , 2014, Nucleic Acids Res..

[57]  Peer Bork,et al.  The SIDER database of drugs and side effects , 2015, Nucleic Acids Res..

[58]  J. Kehrl,et al.  Localization of Giα proteins in the centrosomes and at the midbody: implication for their role in cell division , 2007, The Journal of cell biology.

[59]  Shiwen Zhao,et al.  Network-Based Relating Pharmacological and Genomic Spaces for Drug Target Identification , 2010, PloS one.

[60]  A. Azem,et al.  The effect of nucleotides and mitochondrial chaperonin 10 on the structure and chaperone activity of mitochondrial chaperonin 60. , 2001, European journal of biochemistry.

[61]  Jiří Mayer,et al.  Fostamatinib for the treatment of adult persistent and chronic immune thrombocytopenia: Results of two phase 3, randomized, placebo‐controlled trials , 2018, American journal of hematology.

[62]  Heinrich Sticht,et al.  Casein kinase 2-dependent serine phosphorylation of MuSK regulates acetylcholine receptor aggregation at the neuromuscular junction. , 2006, Genes & development.

[63]  Mark Gerstein,et al.  The Importance of Bottlenecks in Protein Networks: Correlation with Gene Essentiality and Expression Dynamics , 2007, PLoS Comput. Biol..

[64]  Andreas Bender,et al.  Neighbours of cancer-related proteins have key influence on pathogenesis and could increase the drug target space for anticancer therapies , 2017, npj Systems Biology and Applications.

[65]  Andreas Bender,et al.  Predicting genes involved in human cancer using network contextual information. , 2012, Journal of integrative bioinformatics.

[66]  P. Bork,et al.  Network Neighbors of Drug Targets Contribute to Drug Side-Effect Similarity , 2011, PloS one.

[67]  Andreas Bender,et al.  ARWAR: A network approach for predicting Adverse Drug Reactions , 2016, Comput. Biol. Medicine.

[68]  Katsutoshi Yada,et al.  Consumer Behavior Analysis by Graph Mining Technique , 2006, KES.

[69]  Hongming Mo,et al.  Evidential method to identify influential nodes in complex networks , 2015 .

[70]  Jianzhi Zhang,et al.  Why Do Hubs Tend to Be Essential in Protein Networks? , 2006, PLoS genetics.

[71]  Kui Zhang,et al.  Prediction of protein function using protein-protein interaction data , 2002, Proceedings. IEEE Computer Society Bioinformatics Conference.

[72]  A. Khorrami,et al.  The functional significance of 14-3-3 proteins in cancer: focus on lung cancer , 2017, Hormone molecular biology and clinical investigation.

[73]  Enrico Blanzieri,et al.  Identification of Essential Proteins Based on Ranking Edge-Weights in Protein-Protein Interaction Networks , 2014, PloS one.

[74]  Hsien-Da Huang,et al.  miRTarBase update 2014: an information resource for experimentally validated miRNA-target interactions , 2013, Nucleic Acids Res..

[75]  D. Ingber,et al.  High-Betweenness Proteins in the Yeast Protein Interaction Network , 2005, Journal of biomedicine & biotechnology.

[76]  Lada A. Adamic,et al.  Search in Power-Law Networks , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[77]  Andreas Bender,et al.  Collaboration-Based Function Prediction in Protein-Protein Interaction Networks , 2011, International Symposium on Intelligent Data Analysis.

[78]  C. Deane,et al.  Protein Interactions , 2002, Molecular & Cellular Proteomics.

[79]  Michael B Yaffe,et al.  Computational prediction of protein-protein interactions. , 2015, Methods in molecular biology.

[80]  Yi Pan,et al.  Predicting Essential Proteins Based on Weighted Degree Centrality , 2014, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[81]  Chan Yeong Kim,et al.  Network-assisted investigation of virulence and antibiotic-resistance systems in Pseudomonas aeruginosa , 2016, Scientific Reports.

[82]  David S. Wishart,et al.  DrugBank 5.0: a major update to the DrugBank database for 2018 , 2017, Nucleic Acids Res..

[83]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[84]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[85]  S. Elledge,et al.  The p21 Cdk-interacting protein Cip1 is a potent inhibitor of G1 cyclin-dependent kinases , 1993, Cell.

[86]  Guo Tian-zhu A new measure method of network node importance based on local characteristics , 2010 .

[87]  Jafar Adibi,et al.  Discovering important nodes through graph entropy the case of Enron email database , 2005, LinkKDD '05.

[88]  Roded Sharan,et al.  An Algorithmic Framework for Predicting Side-Effects of Drugs , 2010, RECOMB.