Computing Protein Structures from Electron Density Maps: The Missing Fragment Problem

Rapid protein structure determination relies greatly on the availability of software that can automatically generate a protein model from an experimental electron density map. Tremendous advances in this area have been achieved recently. In favorable cases, available software can build over 90% of the final model. However, in less favorable circumstances, particularly at medium-low resolution, only about 2/3 completeness is attained. Manual completion of these partial models is usually feasible but time-consuming. The electron density in areas of missing fragments is often of poorer quality, especially for flexible loops, making manual interpretation particularly difficult. Except for the beginning and end of the protein chain, the end points of each missing fragment are known from the partial model. Thanks to the kinematic chain structure of the protein backbone, loop completion can be approached as an inverse kinematics problem. A fast, two-stage inverse kinematics algorithm is presented that fits a protein chain of known sequence to the electron density map between two anchor points. Our approach first samples a large set of candidates that meet the closure constraint and then refines the most promising candidates to improve the fit. The algorithm has been tested and used to aid protein model completion in areas of poor density, closing loops of up to 12 residues to within 0.25A RMSD of the final refined structure. It has also been used to close missing loops of the same length in partial models built at medium-low resolution

[1]  Oussama Khatib,et al.  Whole-Body Dynamic Behavior and Control of Human-like Robots , 2004, Int. J. Humanoid Robotics.

[2]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[3]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[4]  Thierry Siméon,et al.  A random loop generator for planning the motions of closed kinematic chains using PRM methods , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[5]  Ming Zhang,et al.  A New Method for Fast and Accurate Derivation of Molecular Conformations , 2002, J. Chem. Inf. Comput. Sci..

[6]  T. A. Jones,et al.  Using known substructures in protein model building and crystallography. , 1986, The EMBO journal.

[7]  R. Diamond A real-space refinement procedure for proteins , 1971 .

[8]  J. Badger,et al.  An evaluation of automated model-building procedures for protein crystallography. , 2003, Acta crystallographica. Section D, Biological crystallography.

[9]  Chih-Cheng Chen,et al.  A combined optimization method for solving the inverse kinematics problems of mechanical manipulators , 1991, IEEE Trans. Robotics Autom..

[10]  M. DePristo,et al.  Ab initio construction of polypeptide fragments: Efficient generation of accurate, representative ensembles , 2003, Proteins.

[11]  Chaok Seok,et al.  A kinematic view of loop closure , 2004, J. Comput. Chem..

[12]  T A Jones,et al.  Electron-density map interpretation. , 1997, Methods in enzymology.

[13]  Bernard Roth,et al.  Kinematic analysis of the 6R manipulator of general geometry , 1991 .

[14]  Dinesh Manocha,et al.  Kinematic Manipulation of Molecular Chains Subject to Rigid Constraint , 1994, ISMB.

[15]  A. Kirfel,et al.  New analytical scattering‐factor functions for free atoms and ions , 1995 .

[16]  Oussama Khatib,et al.  Operational space dynamics: efficient algorithms for modeling and control of branching mechanisms , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[17]  K. Fidelis,et al.  Comparison of systematic search and database methods for constructing segments of protein structure. , 1994, Protein engineering.

[18]  Thierry Siméon,et al.  Geometric algorithms for the conformational analysis of long protein loops , 2004, J. Comput. Chem..

[19]  Adrian A Canutescu,et al.  Cyclic coordinate descent: A robotics algorithm for protein loop closure , 2003, Protein science : a publication of the Protein Society.

[20]  C. Levinthal,et al.  Predicting antibody hypervariable loop conformations II: Minimization and molecular dynamics studies of MCPC603 from many randomly generated loop conformations , 1986, Proteins.

[21]  Lydia E. Kavraki,et al.  Simulated knot tying , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[22]  H. Scheraga,et al.  Monte Carlo-minimization approach to the multiple-minima problem in protein folding. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[23]  G. Murshudov,et al.  Refinement of macromolecular structures by the maximum-likelihood method. , 1997, Acta crystallographica. Section D, Biological crystallography.

[24]  H. Scheraga,et al.  Exact analytical loop closure in proteins using polynomial equations , 1999 .

[25]  R. Abagyan,et al.  Biased probability Monte Carlo conformational searches and electrostatic calculations for peptides and proteins. , 1994, Journal of molecular biology.

[26]  Oussama Khatib,et al.  A unified approach for motion and force control of robot manipulators: The operational space formulation , 1987, IEEE J. Robotics Autom..

[27]  Lydia E. Kavraki,et al.  Probabilistic roadmaps for path planning in high-dimensional configuration spaces , 1996, IEEE Trans. Robotics Autom..

[28]  Ronald M Levy,et al.  Have we seen all structures corresponding to short protein fragments in the Protein Data Bank? An update. , 2003, Protein engineering.

[29]  M. Karplus,et al.  PDB-based protein loop prediction: parameters for selection and methods for optimization. , 1997, Journal of molecular biology.

[30]  Leonidas J. Guibas,et al.  Inverse Kinematics in Biology: The Protein Loop Closure Problem , 2005, Int. J. Robotics Res..

[31]  Thomas C. Terwilliger,et al.  Electronic Reprint Biological Crystallography Automated Main-chain Model Building by Template Matching and Iterative Fragment Extension , 2022 .

[32]  Adam Godzik,et al.  Crystal structure of a zinc‐containing glycerol dehydrogenase (TM0423) from Thermotoga maritima at 1.5 Å resolution , 2002, Proteins.

[33]  T J Oldfield,et al.  A number of real-space torsion-angle refinement techniques for proteins, nucleic acids, ligands and solvent. , 2001, Acta crystallographica. Section D, Biological crystallography.

[34]  C. Deane,et al.  A novel exhaustive search algorithm for predicting the conformation of polypeptide segments in proteins , 2000, Proteins.

[35]  Qiang Zheng,et al.  Loop closure via bond scaling and relaxation , 1993, J. Comput. Chem..

[36]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[37]  Ian W. Davis,et al.  Structure validation by Cα geometry: ϕ,ψ and Cβ deviation , 2003, Proteins.

[38]  Michael S. Chapman,et al.  Restrained real-space macromolecular atomic refinement using a new resolution-dependent electron-density function , 1995 .

[39]  A. Sali,et al.  Modeling of loops in protein structures , 2000, Protein science : a publication of the Protein Society.

[40]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[41]  W. Braun,et al.  Rapid calculation of first and second derivatives of conformational energy with respect to dihedral angles for proteins general recurrent equations , 1984, Comput. Chem..

[42]  Jeff M. Phillips,et al.  Spacecraft Rendezvous and Docking with Real-Time, Randomized Optimization , 2003 .

[43]  V S Lamzin,et al.  wARP: improvement and extension of crystallographic phases by weighted averaging of multiple-refined dummy atomic models. , 1997, Acta crystallographica. Section D, Biological crystallography.

[44]  H. Stanley,et al.  Statistical physics of macromolecules , 1995 .

[45]  J. Burdick On the inverse kinematics of redundant manipulators: characterization of the self-motion manifolds , 1989 .

[46]  Dinesh Manocha,et al.  Conformational analysis of molecular chains using nano-kinematics , 1995, Comput. Appl. Biosci..

[47]  D. Levitt,et al.  A new software routine that automates the fitting of protein X-ray crystallographic electron-density maps. , 2001, Acta crystallographica. Section D, Biological crystallography.

[48]  Peter Adams,et al.  The EMMIX software for the fitting of mixtures of normal and t-components , 1999 .

[49]  J. Garnier,et al.  Modeling of protein loops by simulated annealing , 1993, Protein science : a publication of the Protein Society.

[50]  Nancy M. Amato,et al.  A kinematics-based probabilistic roadmap method for high DOF closed chain systems , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[51]  Thomas R Ioerger,et al.  TEXTAL system: artificial intelligence techniques for automated protein model building. , 2003, Methods in enzymology.

[52]  Jeffrey C. Trinkle,et al.  Complete Path Planning for Closed Kinematic Chains with Spherical Joints , 2002, Int. J. Robotics Res..

[53]  Adam Godzik,et al.  Structural genomics of the Thermotoga maritima proteome implemented in a high-throughput structure determination pipeline , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[54]  J. Moult,et al.  An algorithm for determining the conformation of polypeptide segments in proteins by systematic search , 1986, Proteins.

[55]  N. Go,et al.  Ring Closure and Local Conformational Deformations of Chain Molecules , 1970 .

[56]  Lydia E. Kavraki,et al.  Finding Solutions of the Inverse Kinematics Problems in Computer-aided Drug Design , 2002 .

[57]  Gregory S. Chirikjian,et al.  General methods for computing hyper-redundant manipulator inverse kinematics , 1993, Proceedings of 1993 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS '93).

[58]  Dinesh Manocha,et al.  Efficient inverse kinematics for general 6R manipulators , 1994, IEEE Trans. Robotics Autom..

[59]  Ian D. Walker,et al.  A consistent null-space based approach to inverse kinematics of redundant robots , 1993, [1993] Proceedings IEEE International Conference on Robotics and Automation.

[60]  C. Levinthal,et al.  Predicting antibody hypervariable loop conformation. I. Ensembles of random conformations for ringlike structures , 1987, Biopolymers.

[61]  R. Huber,et al.  Accurate Bond and Angle Parameters for X-ray Protein Structure Refinement , 1991 .

[62]  Lydia E. Kavraki,et al.  Randomized path planning for linkages with closed kinematic chains , 2001, IEEE Trans. Robotics Autom..

[63]  Kamal K. Gupta,et al.  The kinematic roadmap: a motion planning based global approach for inverse kinematics of redundant robots , 1999, IEEE Trans. Robotics Autom..

[64]  M. Karplus,et al.  Prediction of the folding of short polypeptide segments by uniform conformational sampling , 1987, Biopolymers.