Inferring domain-domain interactions using an extended parsimony model

High-throughput technologies have produced a large number of protein-protein interactions (PPIs) for different species. As protein domains are functional and structural units of proteins, many computational efforts have been made to identify domain-domain interactions (DDIs) from PPIs. Parsimony assumption is widely used in computational biology as the evolution of the nature is considered as a continuous optimization process. In the context of identifying DDIs, parsimony methods try to find a minimal set of DDIs that can explain the observed PPIs. This category of methods are promising since they can be formulated and solved easily. Besides, researches have shown that they could detect specific DDIs, which is often hard for many probabilistic methods. In this paper, we revisit the parsimony model by presenting two important extensions. First, ‘complex networks’ as an emerging concept is incorporated as prior knowledge into the parsimony model. With this improvement, the prediction accuracy increases, which to some extent enhances the biological meaning of the common property of complex networks. Second, two randomization tests are designed to show the parsimony nature of the DDIs in mediating PPIs, which corroborates the model validation.

[1]  M. Newman,et al.  Mixing patterns in networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  Alessandro Vespignani,et al.  Detecting rich-club ordering in complex networks , 2006, physics/0602134.

[3]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[4]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[5]  Luonan Chen,et al.  Inferring Protein-Protein Interactions by Combinatorial Models , 2007 .

[6]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[7]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[9]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[10]  Ralph Snyderman,et al.  Prospective health care: the second transformation of medicine , 2006, Genome Biology.

[11]  E. Sprinzak,et al.  Correlated sequence-signatures as markers of protein-protein interaction. , 2001, Journal of molecular biology.

[12]  Gary D Bader,et al.  Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry , 2002, Nature.

[13]  Ioannis Xenarios,et al.  DIP: the Database of Interacting Proteins , 2000, Nucleic Acids Res..

[14]  Robert D. Finn,et al.  iPfam: visualization of protein?Cprotein interactions in PDB at domain and amino acid resolutions , 2005, Bioinform..

[15]  Christopher J. Lee,et al.  Inferring protein domain interactions from databases of interacting proteins , 2005, Genome Biology.

[16]  K. Guimaraes,et al.  Predicting domain-domain interactions using a parsimony approach , 2006, Genome Biology.

[17]  Robert B. Russell,et al.  3did: interacting protein domains of known three-dimensional structure , 2004, Nucleic Acids Res..

[18]  Teresa M. Przytycka,et al.  Interrogating domain-domain interactions with parsimony based approaches , 2008, BMC Bioinformatics.

[19]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.