On the Characterization and Selection of Diverse Conformational Ensembles with Applications to Flexible Docking

To address challenging flexible docking problems, a number of docking algorithms pregenerate large collections of candidate conformers. To remove the redundancy from such ensembles, a central problem in this context is to report a selection of conformers maximizing some geometric diversity criterion. We make three contributions to this problem. First, we resort to geometric optimization so as to report selections maximizing the molecular volume or molecular surface area (MSA) of the selection. Greedy strategies are developed, together with approximation bounds. Second, to assess the efficacy of our algorithms, we investigate two conformer ensembles corresponding to a flexible loop of four protein complexes. By focusing on the MSA of the selection, we show that our strategy matches the MSA of standard selection methods, but resorting to a number of conformers between one and two orders of magnitude smaller. This observation is qualitatively explained using the Betti numbers of the union of balls of the selection. Finally, we replace the conformer selection problem in the context of multiple-copy flexible docking. On the aforementioned systems, we show that using the loops selected by our strategy can improve the result of the docking process.

[1]  Herbert Edelsbrunner,et al.  An incremental algorithm for Betti numbers of simplicial complexes on the 3-sphere , 1995, Comput. Aided Geom. Des..

[2]  Thomas Lengauer,et al.  FlexE: efficient molecular docking considering protein structure variations. , 2001, Journal of molecular biology.

[3]  Peter C. Fishburn,et al.  Pick-and-choose heuristics for partial set covering , 1989, Discret. Appl. Math..

[4]  C. Chothia,et al.  Principles of protein–protein recognition , 1975, Nature.

[5]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[6]  Zhiping Weng,et al.  A protein–protein docking benchmark , 2003, Proteins.

[7]  Adrian A Canutescu,et al.  Access the most recent version at doi: 10.1110/ps.03154503 References , 2003 .

[8]  Gilbert Saporta,et al.  L'analyse des données , 1981 .

[9]  G. Nemhauser,et al.  Exceptional Paper—Location of Bank Accounts to Optimize Float: An Analytic Study of Exact and Approximate Algorithms , 1977 .

[10]  Cinque S. Soto,et al.  Evaluating conformational free energies: The colony energy and its application to the problem of loop prediction , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Monique Teillaud,et al.  Design of the CGAL 3D Spherical Kernel and application to arrangements of circles on a sphere , 2009, Comput. Geom..

[12]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[13]  Berk Hess,et al.  GROMACS 3.0: a package for molecular simulation and trajectory analysis , 2001 .

[14]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[15]  Jean-Claude Latombe,et al.  Efficient Algorithms to Explore Conformation Spaces of Flexible Protein Loops , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[16]  L. Kavraki,et al.  Modeling protein conformational ensembles: From missing loops to equilibrium fluctuations , 2006, Proteins.

[17]  Joël Janin,et al.  The third CAPRI assessment meeting Toronto, Canada, April 20-21, 2007. , 2007, Structure.

[18]  M. Zacharias,et al.  Accounting for loop flexibility during protein–protein docking , 2005, Proteins.

[19]  Martin Zacharias,et al.  Protein–protein docking with a reduced protein model accounting for side‐chain flexibility , 2003, Protein science : a publication of the Protein Society.

[20]  J. Changeux,et al.  ON THE NATURE OF ALLOSTERIC TRANSITIONS: A PLAUSIBLE MODEL. , 1965, Journal of molecular biology.

[21]  J. William Ahwood,et al.  CLASSIFICATION , 1931, Foundations of Familiar Language.

[22]  Rebecca C. Wade,et al.  Protein‐Protein Docking , 2001 .

[23]  W. Marsden I and J , 2012 .

[24]  George L. Nemhauser,et al.  Note--On "Location of Bank Accounts to Optimize Float: An Analytic Study of Exact and Approximate Algorithms" , 1979 .

[25]  Michael Henle,et al.  A combinatorial introduction to topology , 1978 .

[26]  Beat Kleiner,et al.  Graphical Methods for Data Analysis , 1983 .

[27]  Jean-Claude Latombe,et al.  Efficient Algorithms to Explore Conformation Spaces of Flexible Protein Loops , 2008, TCBB.

[28]  P. Koehl,et al.  Application of a self-consistent mean field theory to predict protein side-chains conformation and estimate their conformational entropy. , 1994, Journal of molecular biology.

[29]  J. Janin,et al.  Revisiting the Voronoi description of protein–protein interfaces , 2006, Protein science : a publication of the Protein Society.

[30]  Herbert Edelsbrunner,et al.  Triangulating the Surface of a Molecule , 1996, Discret. Appl. Math..

[31]  P. Koehl,et al.  A self consistent mean field approach to simultaneous gap closure and side-chain positioning in homology modelling , 1995, Nature Structural Biology.

[32]  Frédéric Cazals,et al.  Computing the arrangement of circles on a sphere, with applications in structural biology , 2009, Comput. Geom..

[33]  A. Sali,et al.  Modeling of loops in protein structures , 2000, Protein science : a publication of the Protein Society.

[34]  A. D. McLachlan,et al.  Solvation energy in protein folding and binding , 1986, Nature.

[35]  Martin Zacharias,et al.  ATTRACT: Protein–protein docking in CAPRI using a reduced protein model , 2005, Proteins.

[36]  Fernanda L. Sirota,et al.  Mechanism and energy landscape of domain swapping in the B1 domain of protein G. , 2008, Journal of molecular biology.

[37]  Jean-Claude Latombe,et al.  On the inverse kinematics of a fragment of protein backbone , 2006, ARK.

[38]  Richard Lavery,et al.  Docking macromolecules with flexible segments , 2003, J. Comput. Chem..

[39]  S. Wodak,et al.  Assessment of CAPRI predictions in rounds 3–5 shows progress in docking procedures , 2005, Proteins.

[40]  Nir London,et al.  Funnel hunting in a rough terrain: learning and discriminating native energy funnels. , 2008, Structure.

[41]  Michael Levitt,et al.  Combining efficient conformational sampling with a deformable elastic network model facilitates structure refinement at low resolution. , 2007, Structure.

[42]  Thierry Siméon,et al.  A path planning approach for computing large-amplitude motions of flexible molecules , 2005, ISMB.

[43]  Dan S. Tawfik,et al.  Antibody Multispecificity Mediated by Conformational Diversity , 2003, Science.

[44]  Pierre Alliez,et al.  Computational geometry algorithms library , 2008, SIGGRAPH '08.

[45]  S. Wodak,et al.  Docking and scoring protein complexes: CAPRI 3rd Edition , 2007, Proteins.

[46]  G. Hummer,et al.  HYDROPHOBIC FORCE FIELD AS A MOLECULAR ALTERNATIVE TO SURFACE-AREA MODELS , 1999 .

[47]  G. Vriend,et al.  Prediction of protein conformational freedom from distance constraints , 1997, Proteins.

[48]  Chao Chen,et al.  Quantifying Homology Classes II: Localization and Stability , 2007, ArXiv.

[49]  M. Nilges,et al.  Complementarity of structure ensembles in protein-protein binding. , 2004, Structure.

[50]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[51]  Frédéric Cazals,et al.  Computing the exact arrangement of circles on a sphere, with applications in structural biology: video , 2007, SCG '07.

[52]  Jack Snoeyink,et al.  Probik: Protein Backbone Motion by Inverse Kinematics , 2005, WAFR.