A statistical model of overlapping volume in ligand binding cavities

Understanding and predicting protein-ligand binding preferences is an essential aspect of research in many fields, especially drug design. To assist in this effort, this paper presents VASP-I (Volumetric Analysis of Surface Properties for Intersections), a statistical model for estimating the probability that a set of cavities exhibit the same conserved region, and may thus have the same binding preferences. We applied this method to analyze ligand binding cavities of sequentially nonredundant structural representatives of the serine protease and enolase superfamilies. On these datasets VASP-I correctly distinguished sets of cavities with identical binding preferences from other sets with varying binding preferences. These results indicate that it can be possible to predict binding cavities that exhibit different binding preferences, even when the biochemical mechanism is unknown.

[1]  Nick V. Grishin,et al.  Structural drift: a possible path to protein fold change , 2005, Bioinform..

[2]  B. Luisi,et al.  Crystal structure of the Escherichia coli RNA degradosome component enolase. , 2001, Journal of molecular biology.

[3]  Adam Godzik,et al.  Multiple flexible structure alignment using partial order graphs , 2005, Bioinform..

[4]  Xiaoyu Zhang,et al.  Application of New Multiresolution Methods for the Comparison of Biomolecular Electrostatic Properties in the Absence of Global Structural Similarity , 2006, Multiscale Model. Simul..

[5]  L. Hedstrom Serine protease mechanism and specificity. , 2002, Chemical reviews.

[6]  A. Wlodawer,et al.  Structural characterization of B and non-B subtypes of HIV-protease: insights into the natural susceptibility to drug resistance development. , 2007, Journal of molecular biology.

[7]  David W. Ritchie,et al.  Fast computation, rotation, and comparison of low resolution spherical harmonic molecular surfaces , 1999, Journal of Computational Chemistry.

[8]  K. Sharp,et al.  Travel depth, a new shape descriptor for macromolecules: application to ligand binding. , 2006, Journal of molecular biology.

[9]  A. Bogan,et al.  Anatomy of hot spots in protein interfaces. , 1998, Journal of molecular biology.

[10]  H. Wolfson,et al.  Efficient detection of three-dimensional structural motifs in biological macromolecules by computer vision techniques. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[11]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[12]  R. Kolodny,et al.  Protein structure comparison: implications for the nature of 'fold space', and structure and function prediction. , 2006, Current opinion in structural biology.

[13]  Michael E. Mortenson,et al.  Geometric Modeling , 2008, Encyclopedia of GIS.

[14]  B. Honig,et al.  High-throughput computational structure-based characterization of protein families: START domains and implications for structural genomics , 2010, Journal of Structural and Functional Genomics.

[15]  M. L. Connolly Solvent-accessible surfaces of proteins and nucleic acids. , 1983, Science.

[16]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[17]  Jie Liang,et al.  CASTp: Computed Atlas of Surface Topography of proteins , 2003, Nucleic Acids Res..

[18]  Janet M. Thornton,et al.  An algorithm for constraint-based structural template matching: application to 3D templates with statistical analysis , 2003, Bioinform..

[19]  Barry Honig,et al.  GRASP2: visualization, surface properties, and electrostatics of macromolecular structures and sequences. , 2003, Methods in enzymology.

[20]  R. Nussinov,et al.  Molecular shape comparisons in searches for active sites and functional similarity. , 1998, Protein engineering.

[21]  Szymon Rusinkiewicz,et al.  Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors , 2003, Symposium on Geometry Processing.

[22]  Lei Xie,et al.  Using multiple structure alignments, fast model building, and energetic analysis in fold recognition and homology modeling , 2003, Proteins.

[23]  M. G. Stone,et al.  Face Traverses and a Volume Algorithm for Polyhedra , 1991, New Results and New Trends in Computer Science.

[24]  R. Laskowski SURFNET: a program for visualizing molecular surfaces, cavities, and intermolecular interactions. , 1995, Journal of molecular graphics.

[25]  N P Willassen,et al.  Purification and characterization of pancreatic elastase from North Atlantic salmon (Salmo salar). , 1998, Molecular marine biology and biotechnology.

[26]  Lawrence Shapiro,et al.  Specificity of cell-cell adhesion by classical cadherins: Critical role for low-affinity dimerization through beta-strand swapping. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Lydia E. Kavraki,et al.  The MASH Pipeline for Protein Function Prediction and an Algorithm for the Geometric Refinement of 3D Motifs , 2007, J. Comput. Biol..

[28]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[29]  Aristides A. G. Requicha,et al.  Geometric Modeling of Mechanical Parts and Processes , 1977, Computer.

[30]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[31]  Barry Honig,et al.  Is protein classification necessary? Toward alternative approaches to function annotation. , 2009, Current opinion in structural biology.

[32]  K Morihara,et al.  Comparison of the specificities of various serine proteinases from microorganisms. , 1969, Archives of biochemistry and biophysics.

[33]  Brian Yuan Chen,et al.  VASP-S: A Volumetric Analysis and Statistical Model for Predicting Steric Influences on Protein-Ligand Binding Specificity , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine.

[34]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[35]  R. Russell,et al.  Detection of protein three-dimensional side-chain patterns: new examples of convergent evolution. , 1998, Journal of molecular biology.

[36]  Ruth Nussinov,et al.  FlexProt: Alignment of Flexible Protein Structures Without a Predefinition of Hinge Regions , 2004, J. Comput. Biol..

[37]  G L Kenyon,et al.  Mechanism of the reaction catalyzed by mandelate racemase: structure and mechanistic properties of the D270N mutant. , 1995, Biochemistry.

[38]  Peter G. Schultz,et al.  A chemical switch for inhibitor-sensitive alleles of any protein kinase , 2000, Nature.

[39]  Patricia C. Babbitt,et al.  Automated discovery of 3D motifs for protein function annotation , 2006, Bioinform..

[40]  W R Taylor,et al.  SSAP: sequential structure alignment program for protein structure comparison. , 1996, Methods in enzymology.

[41]  J F Gibrat,et al.  Surprising similarities in structure comparison. , 1996, Current opinion in structural biology.

[42]  Anthony C. Bishop,et al.  Structural basis for selective inhibition of Src family kinases by PP1. , 1999, Chemistry & biology.

[43]  Patricia C Babbitt,et al.  Evolution of enzymatic activities in the enolase superfamily: L-rhamnonate dehydratase. , 2008, Biochemistry.

[44]  Olivier Lichtarge,et al.  Cavity-aware motifs reduce false positives in protein function prediction. , 2006, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[45]  Fabiana Bahna,et al.  Type II Cadherin Ectodomain Structures: Implications for Classical Cadherin Specificity , 2006, Cell.

[46]  Lydia E. Kavraki,et al.  Analysis of substructural variation in families of enzymatic proteins with applications to protein function prediction , 2010, BMC Bioinformatics.

[47]  Barry Honig,et al.  VASP: A Volumetric Analysis of Surface Properties Yields Insights into Protein-Ligand Binding Specificity , 2010, PLoS Comput. Biol..

[48]  L Szilágyi,et al.  Electrostatic complementarity within the substrate-binding pocket of trypsin. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[49]  K. Kinoshita,et al.  Identification of the ligand binding sites on the molecular surface of proteins , 2005, Protein science : a publication of the Protein Society.

[50]  Lydia E. Kavraki,et al.  Cavity Scaling: Automated Refinement of Cavity-Aware motifs in protein Function Prediction , 2007, J. Bioinform. Comput. Biol..

[51]  G. H. Reed,et al.  The enolase superfamily: a general strategy for enzyme-catalyzed abstraction of the alpha-protons of carboxylic acids. , 1996, Biochemistry.

[52]  C Sander,et al.  Mapping the Protein Universe , 1996, Science.

[53]  Robert B Russell,et al.  A model for statistical significance of local similarities in structure. , 2003, Journal of molecular biology.

[54]  Jie Liang,et al.  Structural signatures of enzyme binding pockets from order-independent surface alignment: a study of metalloendopeptidase and NAD binding proteins. , 2011, Journal of molecular biology.

[55]  J. Thornton,et al.  A method for localizing ligand binding pockets in protein structures , 2005, Proteins.

[56]  B Honig,et al.  An integrated approach to the analysis and modeling of protein sequences and structures. I. Protein structural alignment and a quantitative measure for protein structural distance. , 2000, Journal of molecular biology.

[57]  Jie Liang,et al.  Inferring functional relationships of proteins from local sequence and spatial surface patterns. , 2003, Journal of molecular biology.

[58]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[59]  Andrzej Joachimiak,et al.  Protein Functional Surfaces: Global Shape Matching and Local Spatial Alignments of Ligand Binding Sites , 2008, BMC Structural Biology.

[60]  Olivier Lichtarge,et al.  Composite motifs integrating multiple protein structures increase sensitivity for function prediction. , 2007, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[61]  Adam Godzik,et al.  Flexible structure alignment by chaining aligned fragment pairs allowing twists , 2003, ECCB.

[62]  Lydia E. Kavraki,et al.  Geometric Sieving: Automated Distributed Optimization of 3D Motifs for Protein Function Prediction , 2006, RECOMB.

[63]  K Morihara,et al.  Comparison of the specificities of various neutral proteinases from microorganisms. , 1968, Archives of biochemistry and biophysics.

[64]  S. Umeyama,et al.  Least-Squares Estimation of Transformation Parameters Between Two Point Patterns , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[65]  Lei Xie,et al.  Detecting evolutionary relationships across existing fold space, using sequence order-independent profile–profile alignments , 2008, Proceedings of the National Academy of Sciences.

[66]  J. Thornton,et al.  Shape variation in protein binding pockets and their ligands. , 2007, Journal of molecular biology.

[67]  B. Honig,et al.  On the nature of cavities on protein surfaces: Application to the identification of drug‐binding sites , 2006, Proteins.