VASP-S: A Volumetric Analysis and Statistical Model for Predicting Steric Influences on Protein-Ligand Binding Specificity

Many fields seek to identify steric influences in protein-ligand binding specificity. In some cases, these influences can be found by visually comparing protein structures, but subtler influences, whose significance may only be apparent from the analysis of many structures, are harder to find. To assist this process, we present VASP-S (Volumetric Analysis of Surface Properties with Statistics), an unsupervised volumetric analysis and statistical model for isolating statistically significant structural variations that may influence specificity. We applied these methods to analyze sequentially nonredundant structural representatives of two well-studied protein families: the canonical serine proteases and the enolase super family. We observed that statistically significant structural variations, as identified by VASP-S, reproduced experimentally established determinants of specificity. These results suggest that unsupervised methods, supported by statistical models, may be able to automatically identify variations that sterically influence specific binding in catalytic sites.

[1]  J F Gibrat,et al.  Surprising similarities in structure comparison. , 1996, Current opinion in structural biology.

[2]  D. Shotton,et al.  Three-dimensional Structure of Tosyl-elastase , 1970, Nature.

[3]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[4]  Xiaoyu Zhang,et al.  Application of New Multiresolution Methods for the Comparison of Biomolecular Electrostatic Properties in the Absence of Global Structural Similarity , 2006, Multiscale Model. Simul..

[5]  L. Hedstrom Serine protease mechanism and specificity. , 2002, Chemical reviews.

[6]  Anthony C. Bishop,et al.  Structural basis for selective inhibition of Src family kinases by PP1. , 1999, Chemistry & biology.

[7]  Szymon Rusinkiewicz,et al.  Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors , 2003, Symposium on Geometry Processing.

[8]  Lydia E. Kavraki,et al.  The LabelHash algorithm for substructure matching , 2010, BMC Bioinformatics.

[9]  Ruth Nussinov,et al.  Recognition of Binding Patterns Common to a Set of Protein Structures , 2005, RECOMB.

[10]  K. Sharp,et al.  Travel depth, a new shape descriptor for macromolecules: application to ligand binding. , 2006, Journal of molecular biology.

[11]  N P Willassen,et al.  Purification and characterization of pancreatic elastase from North Atlantic salmon (Salmo salar). , 1998, Molecular marine biology and biotechnology.

[12]  Lydia E. Kavraki,et al.  The MASH Pipeline for Protein Function Prediction and an Algorithm for the Geometric Refinement of 3D Motifs , 2007, J. Comput. Biol..

[13]  Fabiana Bahna,et al.  Type II Cadherin Ectodomain Structures: Implications for Classical Cadherin Specificity , 2006, Cell.

[14]  Jie Liang,et al.  CASTp: Computed Atlas of Surface Topography of proteins , 2003, Nucleic Acids Res..

[15]  Michael K. Gilson,et al.  Evaluating the Substrate-Envelope Hypothesis: Structural Analysis of Novel HIV-1 Protease Inhibitors Designed To Be Robust against Drug Resistance , 2010, Journal of Virology.

[16]  K Morihara,et al.  Comparison of the specificities of various serine proteinases from microorganisms. , 1969, Archives of biochemistry and biophysics.

[17]  Ronald L. Rivest,et al.  Introduction to Algorithms, Second Edition , 2001 .

[18]  Barry Honig,et al.  GRASP2: visualization, surface properties, and electrostatics of macromolecular structures and sequences. , 2003, Methods in enzymology.

[19]  Izhar Wallach,et al.  Prediction of sub-cavity binding preferences using an adaptive physicochemical structure representation , 2009, Bioinform..

[20]  Janet M. Thornton,et al.  An algorithm for constraint-based structural template matching: application to 3D templates with statistical analysis , 2003, Bioinform..

[21]  Patricia C. Babbitt,et al.  Automated discovery of 3D motifs for protein function annotation , 2006, Bioinform..

[22]  W R Taylor,et al.  SSAP: sequential structure alignment program for protein structure comparison. , 1996, Methods in enzymology.

[23]  Barry Honig,et al.  VASP: A Volumetric Analysis of Surface Properties Yields Insights into Protein-Ligand Binding Specificity , 2010, PLoS Comput. Biol..

[24]  Graham J. L. Kemp,et al.  Fast computation, rotation, and comparison of low resolution spherical harmonic molecular surfaces , 1999, J. Comput. Chem..

[25]  B. Luisi,et al.  Crystal structure of the Escherichia coli RNA degradosome component enolase. , 2001, Journal of molecular biology.

[26]  Lei Xie,et al.  Detecting evolutionary relationships across existing fold space, using sequence order-independent profile–profile alignments , 2008, Proceedings of the National Academy of Sciences.

[27]  A. Bogan,et al.  Anatomy of hot spots in protein interfaces. , 1998, Journal of molecular biology.

[28]  Patricia C Babbitt,et al.  Evolution of enzymatic activities in the enolase superfamily: L-rhamnonate dehydratase. , 2008, Biochemistry.

[29]  Olivier Lichtarge,et al.  Cavity-aware motifs reduce false positives in protein function prediction. , 2006, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[30]  J. Thornton,et al.  Shape variation in protein binding pockets and their ligands. , 2007, Journal of molecular biology.

[31]  B. Honig,et al.  On the nature of cavities on protein surfaces: Application to the identification of drug‐binding sites , 2006, Proteins.

[32]  Andrzej Joachimiak,et al.  Protein Functional Surfaces: Global Shape Matching and Local Spatial Alignments of Ligand Binding Sites , 2008, BMC Structural Biology.

[33]  K. Kinoshita,et al.  Identification of the ligand binding sites on the molecular surface of proteins , 2005, Protein science : a publication of the Protein Society.

[34]  H. Wolfson,et al.  Efficient detection of three-dimensional structural motifs in biological macromolecules by computer vision techniques. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[35]  M. L. Connolly Solvent-accessible surfaces of proteins and nucleic acids. , 1983, Science.

[36]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[37]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[38]  G L Kenyon,et al.  Mechanism of the reaction catalyzed by mandelate racemase: structure and mechanistic properties of the D270N mutant. , 1995, Biochemistry.

[39]  Lydia E. Kavraki,et al.  Algorithms for Structural Comparison and Statistical Analysis of 3D Protein Motifs , 2004, Pacific Symposium on Biocomputing.

[40]  R. Nussinov,et al.  Molecular shape comparisons in searches for active sites and functional similarity. , 1998, Protein engineering.

[41]  K Morihara,et al.  Comparison of the specificities of various neutral proteinases from microorganisms. , 1968, Archives of biochemistry and biophysics.

[42]  P C Babbitt,et al.  Evolution of an enzyme active site: the structure of a new crystal form of muconate lactonizing enzyme compared with mandelate racemase and enolase. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[43]  D M Shotton,et al.  The three-dimensional structure of crystalline porcine pancreatic elastase. , 1970, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[44]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[45]  P. Babbitt,et al.  Divergent evolution of enzymatic function: mechanistically diverse superfamilies and functionally distinct suprafamilies. , 2001, Annual review of biochemistry.

[46]  L Szilágyi,et al.  Electrostatic complementarity within the substrate-binding pocket of trypsin. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[47]  R. Russell,et al.  Detection of protein three-dimensional side-chain patterns: new examples of convergent evolution. , 1998, Journal of molecular biology.

[48]  Ruth Nussinov,et al.  FlexProt: Alignment of Flexible Protein Structures Without a Predefinition of Hinge Regions , 2004, J. Comput. Biol..

[49]  G. H. Reed,et al.  The enolase superfamily: a general strategy for enzyme-catalyzed abstraction of the alpha-protons of carboxylic acids. , 1996, Biochemistry.

[50]  C Sander,et al.  Mapping the Protein Universe , 1996, Science.

[51]  Robert B Russell,et al.  A model for statistical significance of local similarities in structure. , 2003, Journal of molecular biology.

[52]  J. Thornton,et al.  A method for localizing ligand binding pockets in protein structures , 2005, Proteins.

[53]  M. G. Stone,et al.  Face Traverses and a Volume Algorithm for Polyhedra , 1991, New Results and New Trends in Computer Science.

[54]  R. Laskowski SURFNET: a program for visualizing molecular surfaces, cavities, and intermolecular interactions. , 1995, Journal of molecular graphics.

[55]  B Honig,et al.  An integrated approach to the analysis and modeling of protein sequences and structures. I. Protein structural alignment and a quantitative measure for protein structural distance. , 2000, Journal of molecular biology.

[56]  D. Goodin,et al.  Artificial protein cavities as specific ligand-binding templates: characterization of an engineered heterocyclic cation-binding site that preserves the evolved specificity of the parent protein. , 2002, Journal of molecular biology.

[57]  Jie Liang,et al.  Inferring functional relationships of proteins from local sequence and spatial surface patterns. , 2003, Journal of molecular biology.

[58]  G. Klebe,et al.  A new method to detect related function among proteins independent of sequence and fold homology. , 2002, Journal of molecular biology.