‘Double water exclusion’: a hypothesis refining the O-ring theory for the hot spots at protein interfaces

Motivation: The O-ring theory reveals that the binding hot spot at a protein interface is surrounded by a ring of residues that are energetically less important than the residues in the hot spot. As this ring of residues is served to occlude water molecules from the hot spot, the O-ring theory is also called ‘water exclusion’ hypothesis. We propose a ‘double water exclusion’ hypothesis to refine the O-ring theory by assuming the hot spot itself is water-free. To computationally model a water-free hot spot, we use a biclique pattern that is defined as two maximal groups of residues from two chains in a protein complex holding the property that every residue contacts with all residues in the other group. Methods and Results: Given a chain pair A and B of a protein complex from the Protein Data Bank (PDB), we calculate the interatomic distance of all possible pairs of atoms between A and B. We then represent A and B as a bipartite graph based on these distance information. Maximal biclique subgraphs are subsequently identified from all of the bipartite graphs to locate biclique patterns at the interfaces. We address two properties of biclique patterns: a non-redundant occurrence in PDB, and a correspondence with hot spots when the solvent-accessible surface area (SASA) of a biclique pattern in the complex form is small. A total of 1293 biclique patterns are discovered which have a non-redundant occurrence of at least five, and which each have a minimum two and four residues at the two sides. Through extensive queries to the HotSprint and ASEdb databases, we verified that biclique patterns are rich of true hot residues. Our algorithm and results provide a new way to identify hot spots by examining proteins' structural data. Availability: The biclique mining algorithm is available at http://www.ntu.edu.sg/home/jyli/dwe.html. Contact: jyli@ntu.edu.sg Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  P. Privalov,et al.  What drives proteins into the major or minor grooves of DNA? , 2007, Journal of molecular biology.

[2]  W. Delano,et al.  Convergent solutions to binding at a protein-protein interface. , 2000, Science.

[3]  Ozlem Keskin,et al.  HotSprint: database of computational hot spots in protein interfaces , 2007, Nucleic Acids Res..

[4]  Robert Preissner,et al.  Dictionary of Interfaces in Proteins (DIP). Data Bank of complementary molecular surface patches , 1998, German Conference on Bioinformatics.

[5]  Pedro A Fernandes,et al.  Hot spots—A review of the protein–protein interface determinant amino‐acid residues , 2007, Proteins.

[6]  David Eppstein,et al.  Arboricity and Bipartite Subgraph Listing Algorithms , 1994, Inf. Process. Lett..

[7]  Ruth Nussinov,et al.  Generation and analysis of a protein–protein interface data set with similar chemical and spatial patterns of interactions , 2005, Proteins.

[8]  Jie Liang,et al.  Protein-protein interactions: hot spots and structurally conserved residues often locate in complemented pockets that pre-organized in the unbound states: implications for docking. , 2004, Journal of molecular biology.

[9]  Hongbo Zhu,et al.  NOXclass: prediction of protein-protein interaction types , 2006, BMC Bioinformatics.

[10]  R. Häggkvist,et al.  Bipartite graphs and their applications , 1998 .

[11]  Dongxiao Zhu,et al.  BMC Bioinformatics BioMed Central , 2005 .

[12]  H. Wolfson,et al.  Shape complementarity at protein–protein interfaces , 1994, Biopolymers.

[13]  A. Bogan,et al.  Anatomy of hot spots in protein interfaces. , 1998, Journal of molecular biology.

[14]  H. Wolfson,et al.  Studies of protein‐protein interfaces: A statistical analysis of the hydrophobic effect , 1997, Protein science : a publication of the Protein Society.

[15]  Deok-Soo Kim,et al.  A protein domain interaction interface database: InterPare , 2005, BMC Bioinformatics.

[16]  Fred P. Davis,et al.  PIBASE: a comprehensive database of structurally defined protein interfaces , 2005, Bioinform..

[17]  Kurt S. Thorn,et al.  ASEdb: a database of alanine mutations and their effects on the free energy of binding in protein interactions , 2001, Bioinform..

[18]  Luhua Lai,et al.  Structure-based method for analyzing protein–protein interfaces , 2004, Journal of molecular modeling.

[19]  A J Olson,et al.  Morphology of protein-protein interfaces. , 1998, Structure.

[20]  Jinyan Li,et al.  Maximal Biclique Subgraphs and Closed Pattern Pairs of the Adjacency Matrix: A One-to-One Correspondence and Mining Algorithms , 2007, IEEE Transactions on Knowledge and Data Engineering.

[21]  S. Vajda,et al.  Anchor residues in protein-protein interactions. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Andrej Sali,et al.  Localization of protein‐binding sites within families of proteins , 2005, Protein science : a publication of the Protein Society.

[23]  Jinyan Li,et al.  Interacting Amino Acid Preferences of 3D Pattern Pairs at the Binding Sites of Transient and Obligate Protein Complexes , 2007, APBC.

[24]  Hanah Margalit,et al.  Characterization and prediction of protein–protein interactions within and between complexes , 2006, Proceedings of the National Academy of Sciences.

[25]  Sarah A. Teichmann,et al.  Principles of protein-protein interactions , 2002, ECCB.

[26]  J. Janin,et al.  Dissecting protein–protein recognition sites , 2002, Proteins.

[27]  Desmond J. Higham,et al.  A lock-and-key model for protein-protein interactions , 2006, Bioinform..

[28]  B. Rost,et al.  Analysing six types of protein-protein interfaces. , 2003, Journal of molecular biology.

[29]  Jinyan Li,et al.  Bioinformatics Original Paper Discovering Motif Pairs at Interaction Sites from Protein Sequences on a Proteome-wide Scale , 2022 .

[30]  H. Wolfson,et al.  A dataset of protein-protein interfaces generated with a sequence-order-independent comparison technique. , 1996, Journal of molecular biology.

[31]  Ariel Fernández,et al.  Dehydron: a structurally encoded signal for protein interaction. , 2003, Biophysical journal.

[32]  T. Clackson,et al.  A hot spot of binding energy in a hormone-receptor interface , 1995, Science.

[33]  C. Chothia,et al.  Principles of protein–protein recognition , 1975, Nature.

[34]  H. Wolfson,et al.  A new, structurally nonredundant, diverse data set of protein–protein interfaces and its implications , 2004, Protein science : a publication of the Protein Society.

[35]  H. Wolfson,et al.  Protein-Protein Interactions: Coupling of Structurally Conserved Residues and of Hot Spots across Interfaces. Implications for Docking , 2004 .

[36]  Z. Weng,et al.  Structure, function, and evolution of transient and obligate protein-protein interactions. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[37]  R. Nussinov,et al.  Hot regions in protein--protein interactions: the organization and contribution of structurally conserved hot spot residues. , 2005, Journal of molecular biology.

[38]  M Levitt,et al.  Simulating the minimum core for hydrophobic collapse in globular proteins , 1997, Protein science : a publication of the Protein Society.

[39]  W. Delano Unraveling hot spots in binding interfaces: progress and challenges. , 2002, Current opinion in structural biology.

[40]  Juergen Koepke,et al.  pH modulates the quinone position in the photosynthetic reaction center from Rhodobacter sphaeroides in the neutral and charge separated states. , 2007, Journal of molecular biology.