M-ORBIS: Mapping of mOleculaR Binding sItes and Surfaces

M-ORBIS is a Molecular Cartography approach that performs integrative high-throughput analysis of structural data to localize all types of binding sites and associated partners by homology and to characterize their properties and behaviors in a systemic way. The robustness of our binding site inferences was compared to four curated datasets corresponding to protein heterodimers and homodimers and protein–DNA/RNA assemblies. The Molecular Cartographies of structurally well-detailed proteins shows that 44% of their surfaces interact with non-solvent partners. Residue contact frequencies with water suggest that ∼86% of their surfaces are transiently solvated, whereas only 15% are specifically solvated. Our analysis also reveals the existence of two major binding site families: specific binding sites which can only bind one type of molecule (protein, DNA, RNA, etc.) and polyvalent binding sites that can bind several distinct types of molecule. Specific homodimer binding sites are for instance nearly twice as hydrophobic than previously described and more closely resemble the protein core, while polyvalent binding sites able to form homo and heterodimers more closely resemble the surfaces involved in crystal packing. Similarly, the regions able to bind DNA and to alternatively form homodimers, are more hydrophobic and less polar than previously described DNA binding sites.

[1]  Eduardo Garcia Urdiales,et al.  Accurate Prediction of Peptide Binding Sites on Protein Surfaces , 2009, PLoS Comput. Biol..

[2]  Julie Bernauer,et al.  DiMoVo: a Voronoi tessellation-based method for discriminating crystallographic and biological protein-protein interactions , 2008, Bioinform..

[3]  K Nadassy,et al.  Structural features of protein-nucleic acid recognition sites. , 1999, Biochemistry.

[4]  P. Chambon,et al.  In vivo activation of PPAR target genes by RXR homodimers , 2004, The EMBO journal.

[5]  J. Janin,et al.  Dissecting protein–protein recognition sites , 2002, Proteins.

[6]  Kengo Kinoshita,et al.  PiSite: a database of protein interaction sites using multiple binding states in the PDB , 2008, Nucleic Acids Res..

[7]  良二 上田 J. Appl. Cryst.の発刊に際して , 1970 .

[8]  Thomas Lengauer,et al.  Conformational analysis of alternative protein structures , 2007, Bioinform..

[9]  Sarah A. Teichmann,et al.  Principles of protein-protein interactions , 2002, ECCB.

[10]  J. Janin,et al.  Dissecting subunit interfaces in homodimeric proteins , 2003, Proteins.

[11]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Olivier Poch,et al.  PipeAlign: a new toolkit for protein family analysis , 2003, Nucleic Acids Res..

[13]  Aleksey A. Porollo,et al.  Prediction‐based fingerprints of protein–protein interactions , 2006, Proteins.

[14]  D. Moras,et al.  Defining and characterizing protein surface using alpha shapes , 2009, Proteins.

[15]  Bin Li,et al.  Fast protein tertiary structure retrieval based on global surface shape similarity , 2008, Proteins.

[16]  J. Skolnick,et al.  What is the relationship between the global structures of apo and holo proteins? , 2007, Proteins.

[17]  J. Thornton,et al.  Discriminating between homodimeric and monomeric proteins in the crystalline state , 2000, Proteins.

[18]  R L Stanfield,et al.  Protein-peptide interactions. , 1995, Current opinion in structural biology.

[19]  J. Janin,et al.  Dissecting protein–RNA recognition sites , 2008, Nucleic acids research.

[20]  Irena Roterman,et al.  Localization of ligand binding site in proteins identified in silico , 2007, Journal of molecular modeling.

[21]  P. Chambon,et al.  Crystal structure of a heterodimeric complex of RAR and RXR ligand-binding domains. , 2000, Molecular cell.

[22]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[23]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[24]  P. Bourne,et al.  Exploiting sequence and structure homologs to identify protein–protein binding sites , 2005, Proteins.

[25]  Donald Hamelberg,et al.  The role of conserved water molecules in the catalytic domain of protein kinases , 2009, Proteins.

[26]  Kornelia Polyak,et al.  Mechanism of CDK activation revealed by the structure of a cyclinA-CDK2 complex , 1995, Nature.

[27]  Olivier Poch,et al.  Signature of the oligomeric behaviour of nuclear receptors at the sequence and structural level , 2004, EMBO reports.

[28]  K. Henrick,et al.  Inference of macromolecular assemblies from crystalline state. , 2007, Journal of molecular biology.

[29]  Hongbo Zhu,et al.  NOXclass: prediction of protein-protein interaction types , 2006, BMC Bioinformatics.

[30]  J. Janin,et al.  A dissection of specific and non-specific protein-protein interfaces. , 2004, Journal of molecular biology.

[31]  Janet M Thornton,et al.  Inferring protein function from structure. , 2003, Methods of biochemical analysis.

[32]  Janet M. Thornton,et al.  Automatic inference of protein quaternary structure from crystals , 2003 .

[33]  Joël Janin,et al.  Specific versus non-specific contacts in protein crystals , 1997, Nature Structural Biology.

[34]  Zhilei Chen,et al.  A highly sensitive selection method for directed evolution of homing endonucleases , 2005, Nucleic acids research.

[35]  Z. Weng,et al.  Protein–protein docking benchmark version 3.0 , 2008, Proteins.

[36]  J. Janin,et al.  Protein–protein interaction and quaternary structure , 2008, Quarterly Reviews of Biophysics.

[37]  T. Clackson,et al.  Structural and functional analysis of the 1:1 growth hormone:receptor complex reveals the molecular basis for receptor affinity. , 1998, Journal of molecular biology.

[38]  T. M. Raschke,et al.  Water structure and interactions with protein surfaces. , 2006, Current opinion in structural biology.

[39]  A. Bogan,et al.  Anatomy of hot spots in protein interfaces. , 1998, Journal of molecular biology.

[40]  Sung-Hou Kim,et al.  Crystal structure of cyclin-dependent kinase 2 , 1993, Nature.

[41]  C. Chothia,et al.  The atomic structure of protein-protein recognition sites. , 1999, Journal of molecular biology.

[42]  Peter B. McGarvey,et al.  UniRef: comprehensive and non-redundant UniProt reference clusters , 2007, Bioinform..

[43]  J. Thornton,et al.  PQS: a protein quaternary structure file server. , 1998, Trends in biochemical sciences.

[44]  Pinak Chakrabarti,et al.  Hydration of protein–protein interfaces , 2005, Proteins.

[45]  Zhiping Weng,et al.  Protein–protein docking benchmark version 4.0 , 2010, Proteins.

[46]  J. Thornton,et al.  Searching for functional sites in protein structures. , 2004, Current opinion in chemical biology.