Coverage of protein domain families with structural protein-protein interactions: current progress and future trends.

Protein interactions have evolved into highly precise and regulated networks adding an immense layer of complexity to cellular systems. The most accurate atomistic description of protein binding sites can be obtained directly from structures of protein complexes. The availability of structurally characterized protein interfaces significantly improves our understanding of interactomes, and the progress in structural characterization of protein-protein interactions (PPIs) can be measured by calculating the structural coverage of protein domain families. We analyze the coverage of protein domain families (defined according to CDD and Pfam databases) by structures, structural protein-protein complexes and unique protein binding sites. Structural PPI coverage of currently available protein families is about 30% without any signs of saturation in coverage growth dynamics. Given the current growth rates of domain databases and structural PPI deposition, complete domain coverage with PPIs is not expected in the near future. As a result of this study we identify families without any protein-protein interaction evidence (listed on a supporting website http://www.ncbi.nlm.nih.gov/Structure/ibis/coverage/) and propose them as potential targets for structural studies with a focus on protein interactions.

[1]  R. Russell,et al.  The relationship between sequence and interaction divergence in proteins. , 2003, Journal of molecular biology.

[2]  Haruki Nakamura,et al.  Data Deposition and Annotation at the Worldwide Protein Data Bank , 2009, Molecular biotechnology.

[3]  Zoran Obradovic,et al.  ProtBuD: a database of biological unit structures of protein families and superfamilies , 2006, Bioinform..

[4]  Andrej Sali,et al.  Localization of protein‐binding sites within families of proteins , 2005, Protein science : a publication of the Protein Society.

[5]  Christine A Orengo,et al.  Comparative evolutionary analysis of protein complexes in E. coli and yeast , 2010, BMC Genomics.

[6]  Sarel J Fleishman,et al.  Emerging themes in the computational design of novel enzymes and protein–protein interfaces , 2013, FEBS letters.

[7]  N. Srinivasan,et al.  Stability of domain structures in multi-domain proteins , 2011, Scientific reports.

[8]  Benoit H. Dessailly,et al.  Functional site plasticity in domain superfamilies☆ , 2013, Biochimica et biophysica acta.

[9]  Angelo D. Favia,et al.  Protein promiscuity and its implications for biotechnology , 2009, Nature Biotechnology.

[10]  A. Panchenko,et al.  Phosphorylation in protein-protein binding: effect on stability and function. , 2011, Structure.

[11]  Stephen L Mayo,et al.  A de novo designed protein–protein interface , 2007, Protein science : a publication of the Protein Society.

[12]  A. Barabasi,et al.  An empirical framework for binary interactome mapping , 2008, Nature Methods.

[13]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[14]  A. Panchenko,et al.  Mechanisms of protein oligomerization, the critical role of insertions and deletions in maintaining different oligomeric states , 2010, Proceedings of the National Academy of Sciences.

[15]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[16]  Yanli Wang,et al.  MMDB: 3D structures and macromolecular interactions , 2011, Nucleic Acids Res..

[17]  Gaetano T. Montelione,et al.  The Protein Structure Initiative: achievements and visions for the future , 2012, F1000 biology reports.

[18]  Qifang Xu,et al.  The protein common interface database (ProtCID)—a comprehensive database of interactions of homologous proteins in multiple crystal forms , 2010, Nucleic Acids Res..

[19]  R. Nussinov,et al.  Predicting protein-protein interactions on a proteome scale by matching evolutionary and structural similarities at interfaces using PRISM , 2011, Nature Protocols.

[20]  Emil Alexov,et al.  Nucleic Acids Research Advance Access published October 28, 2006 PROTCOM: searchable database of protein complexes enhanced with domain–domain structures , 2006 .

[21]  H. Wolfson,et al.  A new, structurally nonredundant, diverse data set of protein–protein interfaces and its implications , 2004, Protein science : a publication of the Protein Society.

[22]  Benjamin A. Shoemaker,et al.  IBIS (Inferred Biomolecular Interaction Server) reports, predicts and integrates multiple types of conserved interactions for proteins , 2011, Nucleic Acids Res..

[23]  S. Jones,et al.  Protein domain interfaces: characterization and comparison with oligomeric protein interfaces. , 2000, Protein engineering.

[24]  Raquel Norel,et al.  Protein interface conservation across structure space , 2010, Proceedings of the National Academy of Sciences.

[25]  Marco Punta,et al.  An estimated 5% of new protein structures solved today represent a new Pfam family , 2013, Acta crystallographica. Section D, Biological crystallography.

[26]  G. Montelione,et al.  Contributions to the NIH-NIGMS Protein Structure Initiative from the PSI Production Centers. , 2008, Structure.

[27]  Benjamin A. Shoemaker,et al.  Finding biologically relevant protein domain interactions: Conserved binding mode analysis , 2006, Protein science : a publication of the Protein Society.

[28]  Francis Rodier,et al.  Protein–protein interaction at crystal contacts , 1995, Proteins.

[29]  Chenghua Shao,et al.  Trendspotting in the Protein Data Bank , 2013, FEBS letters.

[30]  Narmada Thanki,et al.  CDD: conserved domains and protein three-dimensional structure , 2012, Nucleic Acids Res..

[31]  Burkhard Rost,et al.  Alternative Protein-Protein Interfaces Are Frequent Exceptions , 2012, PLoS Comput. Biol..

[32]  Charlotte M. Deane,et al.  What Evidence Is There for the Homology of Protein-Protein Interactions? , 2012, PLoS Comput. Biol..

[33]  Fred P. Davis,et al.  PIBASE: a comprehensive database of structurally defined protein interfaces , 2005, Bioinform..

[34]  Jordi Mestres,et al.  FCP: functional coverage of the proteome by structures , 2006, Bioinform..

[35]  Benjamin A. Shoemaker,et al.  Evolution of protein binding modes in homooligomers. , 2010, Journal of molecular biology.

[36]  Dietlind L. Gerloff,et al.  BISC: Binary SubComplexes in proteins database , 2010, Nucleic Acids Res..

[37]  W. Bialek,et al.  Information-based clustering. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Benjamin A. Shoemaker,et al.  Inferred Biomolecular Interaction Server—a web server to analyze and predict protein interacting partners and binding sites , 2009, Nucleic Acids Res..

[39]  Gary D Bader,et al.  Domain‐mediated protein interaction prediction: From genome to network , 2012, FEBS letters.

[40]  M. Levitt Nature of the protein universe , 2009, Proceedings of the National Academy of Sciences.