Using neighborhood cohesiveness to infer interactions between protein domains

MOTIVATION In recent years, large-scale studies have been undertaken to describe, at least partially, protein-protein interaction maps, or interactomes, for a number of relevant organisms, including human. However, current interactomes provide a somehow limited picture of the molecular details involving protein interactions, mostly because essential experimental information, especially structural data, is lacking. Indeed, the gap between structural and interactomics information is enlarging and thus, for most interactions, key experimental information is missing. We elaborate on the observation that many interactions between proteins involve a pair of their constituent domains and, thus, the knowledge of how protein domains interact adds very significant information to any interactomic analysis. RESULTS In this work, we describe a novel use of the neighborhood cohesiveness property to infer interactions between protein domains given a protein interaction network. We have shown that some clustering coefficients can be extended to measure a degree of cohesiveness between two sets of nodes within a network. Specifically, we used the meet/min coefficient to measure the proportion of interacting nodes between two sets of nodes and the fraction of common neighbors. This approach extends previous works where homolog coefficients were first defined around network nodes and later around edges. The proposed approach substantially increases both the number of predicted domain-domain interactions as well as its accuracy as compared with current methods.

[1]  Teresa M Przytycka,et al.  Predicting protein domain interactions from coevolution of conserved regions , 2007, Proteins.

[2]  S. Teichmann,et al.  Domain combinations in archaeal, eubacterial and eukaryotic proteomes. , 2001, Journal of molecular biology.

[3]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..

[4]  Ryan W. Solava,et al.  Revealing Missing Parts of the Interactome via Link Prediction , 2014, PloS one.

[5]  Raja Jothi,et al.  Co-evolutionary analysis of domains in interacting proteins reveals insights into domain-domain interactions mediating protein-protein interactions. , 2006, Journal of molecular biology.

[6]  E. Sprinzak,et al.  Correlated sequence-signatures as markers of protein-protein interaction. , 2001, Journal of molecular biology.

[7]  R. Tibbetts,et al.  Molecular Linkage Between the Kinase ATM and NF-κB Signaling in Response to Genotoxic Stimuli , 2006, Science.

[8]  Christopher J. Lee,et al.  Inferring protein domain interactions from databases of interacting proteins , 2005, Genome Biology.

[9]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[10]  Zohar Itzhaki,et al.  Evolutionary conservation of domain-domain interactions , 2006, Genome Biology.

[11]  J R Desjarlais,et al.  Toward rules relating zinc finger protein sequences and DNA binding site preferences. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[12]  See-Kiong Ng,et al.  Integrative approach for computationally inferring protein domain interactions , 2003, SAC '03.

[13]  D. Goldberg,et al.  Assessing experimentally derived interactions in a small world , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Dmitrij Frishman,et al.  Negatome 2.0: a database of non-interacting proteins derived by literature mining, manual annotation and protein structure analysis , 2013, Nucleic Acids Res..

[15]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[16]  Bo Zhang,et al.  Network target for screening synergistic drug combinations with application to traditional Chinese medicine , 2011, BMC Systems Biology.

[17]  Bernard Manderick,et al.  PDB file parser and structure class implemented in Python , 2003, Bioinform..

[18]  Sailu Yellaboina,et al.  DOMINE: a comprehensive collection of known and predicted domain-domain interactions , 2010, Nucleic Acids Res..

[19]  A. Elofsson,et al.  Multi-domain proteins in the three kingdoms of life: orphan domains and other unassigned regions. , 2005, Journal of molecular biology.

[20]  Chen Chen,et al.  Inferring domain-domain interactions using an extended parsimony model , 2011, 2011 IEEE International Conference on Systems Biology (ISB).

[21]  Arnaud Céol,et al.  3did: a catalog of domain-based interactions of known three-dimensional structure , 2013, Nucleic Acids Res..

[22]  Mei Liu,et al.  Knowledge-guided inference of domain–domain interactions from incomplete protein–protein interaction networks , 2009, Bioinform..

[23]  Pamela F. Jones,et al.  Improving the prediction of protein binding sites by combining heterogeneous data and Voronoi diagrams , 2011, BMC Bioinformatics.

[24]  Jaques Reifman,et al.  Unraveling the conundrum of seemingly discordant protein-protein interaction datasets , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[25]  P. Aloy,et al.  Interactome3D: adding structural details to protein networks , 2013, Nature Methods.

[26]  Hierarchical Organization of Modularity in Metabolic Networks Supporting Online Material , 2002 .

[27]  Alfonso Valencia,et al.  Protein co-evolution, co-adaptation and interactions , 2008, The EMBO journal.

[28]  Susumu Goto,et al.  Data, information, knowledge and principle: back to metabolism in KEGG , 2013, Nucleic Acids Res..

[29]  Sean R. Eddy,et al.  Hidden Markov model speed heuristic and iterative HMM search procedure , 2010, BMC Bioinformatics.

[30]  Patrick Aloy,et al.  The Role of Structural Disorder in the Rewiring of Protein Interactions through Evolution* , 2012, Molecular & Cellular Proteomics.

[31]  Ruth Nussinov,et al.  Principles of docking: An overview of search algorithms and a guide to scoring functions , 2002, Proteins.

[32]  David S. Goodsell,et al.  The RCSB Protein Data Bank: new resources for research and education , 2012, Nucleic Acids Res..

[33]  Sara Linse,et al.  Methods for the detection and analysis of protein–protein interactions , 2007, Proteomics.

[34]  William Stafford Noble,et al.  Learning to predict protein-protein interactions from protein sequences , 2003, Bioinform..

[35]  Sankar Ghosh,et al.  Phosphorylation of serine 68 in the IkappaB kinase (IKK)-binding domain of NEMO interferes with the structure of the IKK complex and tumor necrosis factor-alpha-induced NF-kappaB activity. , 2008, The Journal of biological chemistry.

[36]  Damian Szklarczyk,et al.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration , 2012, Nucleic Acids Res..

[37]  T. Pawson,et al.  Assembly of Cell Regulatory Systems Through Protein Interaction Domains , 2003, Science.

[38]  Tom M. W. Nye,et al.  Statistical analysis of domains in interacting protein pairs , 2005, Bioinform..

[39]  J. Thornton,et al.  Predicting protein function from sequence and structural data. , 2005, Current opinion in structural biology.

[40]  William Stafford Noble,et al.  Choosing negative examples for the prediction of protein-protein interactions , 2006, BMC Bioinformatics.

[41]  Nianjun Liu,et al.  Inferring protein-protein interactions through high-throughput interaction data from diverse organisms , 2005, Bioinform..

[42]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[43]  H. Koh,et al.  Inhibition of Akt and Its Anti-apoptotic Activities by Tumor Necrosis Factor-induced Protein Kinase C-related Kinase 2 (PRK2) Cleavage* , 2000, The Journal of Biological Chemistry.

[44]  Aurelio A. Moya-García,et al.  Insights into polypharmacology from drug-domain associations , 2013, Bioinform..