A Computational Model for Predicting Protein Interactions Based on Multidomain Collaboration

Recently, several domain-based computational models for predicting protein-protein interactions (PPIs) have been proposed. The conventional methods usually infer domain or domain combination (DC) interactions from already known interacting sets of proteins, and then predict PPIs using the information. However, the majority of these models often have limitations in providing detailed information on which domain pair (single domain interaction) or DC pair (multidomain interaction) will actually interact for the predicted protein interaction. Therefore, a more comprehensive and concrete computational model for the prediction of PPIs is needed. We developed a computational model to predict PPIs using the information of intraprotein domain cohesion and interprotein DC coupling interaction. A method of identifying the primary interacting DC pair was also incorporated into the model in order to infer actual participants in a predicted interaction. Our method made an apparent improvement in the PPI prediction accuracy, and the primary interacting DC pair identification was valid specifically in predicting multidomain protein interactions. In this paper, we demonstrate that 1) the intraprotein domain cohesion is meaningful in improving the accuracy of domain-based PPI prediction, 2) a prediction model incorporating the intradomain cohesion enables us to identify the primary interacting DC pair, and 3) a hybrid approach using the intra/interdomain interaction information can lead to a more accurate prediction.

[1]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[2]  Jer-Ming Chia,et al.  Implications for domain fusion protein-protein interactions based on structural information , 2004, BMC Bioinformatics.

[3]  Carlos Prieto,et al.  APID: Agile Protein Interaction DataAnalyzer , 2006, Nucleic Acids Res..

[4]  K. Guimaraes,et al.  Predicting domain-domain interactions using a parsimony approach , 2006, Genome Biology.

[5]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[6]  Ioannis Xenarios,et al.  DIP: The Database of Interacting Proteins: 2001 update , 2001, Nucleic Acids Res..

[7]  Anton J. Enright,et al.  Protein interaction maps for complete genomes based on gene fusion events , 1999, Nature.

[8]  E. Koonin,et al.  The structure of the protein universe and genome evolution , 2002, Nature.

[9]  Christopher J. Lee,et al.  Inferring protein domain interactions from databases of interacting proteins , 2005, Genome Biology.

[10]  Robert D. Finn,et al.  iPfam: visualization of protein?Cprotein interactions in PDB at domain and amino acid resolutions , 2005, Bioinform..

[11]  S. Teichmann,et al.  Domain combinations in archaeal, eubacterial and eukaryotic proteomes. , 2001, Journal of molecular biology.

[12]  Adam J. Smith,et al.  The Database of Interacting Proteins: 2004 update , 2004, Nucleic Acids Res..

[13]  Hong-Soog Kim,et al.  PreSPI: design and implementation of protein-protein interaction prediction service system. , 2004, Genome informatics. International Conference on Genome Informatics.

[14]  Toshihisa Takagi,et al.  Improving the Performance of an SVM-Based Method for Predicting Protein-Protein Interactions , 2006, Silico Biol..

[15]  Dong-Soo Han,et al.  PreSPI: a domain combination based prediction system for protein-protein interaction. , 2004, Nucleic acids research.

[16]  Ting Chen,et al.  An integrated approach to the prediction of domain-domain interactions , 2006, BMC Bioinformatics.

[17]  Luonan Chen,et al.  Inferring protein interactions from experimental data by association probabilistic method , 2006, Proteins.

[18]  Hung Fei-Hung,et al.  Protein-Protein Interaction Prediction based on Association Rules of Protein Functional Regions , 2007, Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007).

[19]  Baris E. Suzek,et al.  The Universal Protein Resource (UniProt) in 2010 , 2009, Nucleic Acids Res..

[20]  K. Aihara,et al.  A discriminative approach for identifying domain–domain interactions from protein–protein interactions , 2010, Proteins.

[21]  Y. Zhang,et al.  IntAct—open source resource for molecular interaction data , 2006, Nucleic Acids Res..

[22]  Dong-Soo Han,et al.  Identification of Conserved Domain Combinations in S.cerevisiae Proteins , 2007, 2007 IEEE 7th International Symposium on BioInformatics and BioEngineering.

[23]  Zohar Itzhaki,et al.  Preferential use of protein domain pairs as interaction mediators: order and transitivity , 2010, Bioinform..

[24]  Teresa M. Przytycka,et al.  DOMINE: a database of protein domain interactions , 2007, Nucleic Acids Res..

[25]  Erik L. L. Sonnhammer,et al.  Comparative analysis and unification of domain-domain interaction networks , 2009, Bioinform..

[26]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[27]  Nianjun Liu,et al.  Inferring protein-protein interactions through high-throughput interaction data from diverse organisms , 2005, Bioinform..

[28]  S. Teichmann,et al.  Multi-domain protein families and domain pairs: comparison with known structures and a random model of domain recombination , 2004, Journal of Structural and Functional Genomics.

[29]  Maria Victoria Schneider,et al.  MINT: a Molecular INTeraction database. , 2002, FEBS letters.

[30]  Teresa M. Przytycka,et al.  Interrogating domain-domain interactions with parsimony based approaches , 2008, BMC Bioinformatics.

[31]  María Martín,et al.  The Universal Protein Resource (UniProt) in 2010 , 2010 .

[32]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[33]  Livia Perfetto,et al.  MINT, the molecular interaction database: 2012 update , 2011, Nucleic Acids Res..

[34]  E. Sprinzak,et al.  Correlated sequence-signatures as markers of protein-protein interaction. , 2001, Journal of molecular biology.

[35]  R. Treisman,et al.  The POZ domain: a conserved protein-protein interaction motif. , 1994, Genes & development.

[36]  Luonan Chen,et al.  Analysis on multi-domain cooperation for predicting protein-protein interactions , 2007, BMC Bioinformatics.

[37]  See-Kiong Ng,et al.  Integrative approach for computationally inferring protein domain interactions , 2003, SAC '03.

[38]  D. Eisenberg,et al.  Detecting protein function and protein-protein interactions from genome sequences. , 1999, Science.

[39]  M. Carlson,et al.  A family of proteins containing a conserved domain that mediates interaction with the yeast SNF1 protein kinase complex. , 1994, The EMBO journal.

[40]  Yu Zong Chen,et al.  prediction of protein-protein interactions , 2004 .

[41]  C. Peterson,et al.  Topological properties of citation and metabolic networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.