Creating protein models from electron-density maps using particle-filtering methods

MOTIVATION One bottleneck in high-throughput protein crystallography is interpreting an electron-density map, that is, fitting a molecular model to the 3D picture crystallography produces. Previously, we developed ACMI (Automatic Crystallographic Map Interpreter), an algorithm that uses a probabilistic model to infer an accurate protein backbone layout. Here, we use a sampling method known as particle filtering to produce a set of all-atom protein models. We use the output of ACMI to guide the particle filter's sampling, producing an accurate, physically feasible set of structures. RESULTS We test our algorithm on 10 poor-quality experimental density maps. We show that particle filtering produces accurate all-atom models, resulting in fewer chains, lower sidechain RMS error and reduced R factor, compared to simply placing the best-matching sidechains on ACMI's trace. We show that our approach produces a more accurate model than three leading methods--Textal, Resolve and ARP/WARP--in terms of main chain completeness, sidechain identification and crystallographic R factor. AVAILABILITY Source code and experimental density maps available at http://ftp.cs.wisc.edu/machine-learning/shavlik-group/programs/acmi/

[1]  Guoli Wang,et al.  PISCES: a protein sequence culling server , 2003, Bioinform..

[2]  Jude W. Shavlik,et al.  Improved Methods for Template-Matching in Electron-Density Maps Using Spherical Harmonics , 2007, 2007 IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2007).

[3]  Simon J. Godsill,et al.  On sequential Monte Carlo sampling methods for Bayesian filtering , 2000, Stat. Comput..

[4]  Thomas R Ioerger,et al.  Automatic modeling of protein backbones in electron-density maps via prediction of Calpha coordinates. , 2002, Acta crystallographica. Section D, Biological crystallography.

[5]  Kevin Cowtan,et al.  The Buccaneer software for automated model building , 2006 .

[6]  Richard J Morris,et al.  ARP/wARP and automatic interpretation of protein electron density maps. , 2003, Methods in enzymology.

[7]  Collaborative Computational,et al.  The CCP4 suite: programs for protein crystallography. , 1994, Acta crystallographica. Section D, Biological crystallography.

[8]  Tomio Ogasawara,et al.  A cell-free protein synthesis system for high-throughput proteomics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Thomas R Ioerger,et al.  TEXTAL system: artificial intelligence techniques for automated protein model building. , 2003, Methods in enzymology.

[10]  George N Phillips,et al.  Ensemble refinement of protein crystal structures: validation and application. , 2007, Structure.

[11]  M. DePristo,et al.  Is one solution good enough? , 2006, Nature Structural &Molecular Biology.

[12]  Steven E Brenner,et al.  The Impact of Structural Genomics: Expectations and Outcomes , 2005, Science.

[13]  Axel T. Brunger,et al.  Thermal Motion and Conformational Disorder in Protein Crystal Structures: Comparison of Multi‐Conformer and Time‐Averaging Models , 1994 .

[14]  M. DePristo,et al.  Heterogeneity and inaccuracy in protein structures solved by X-ray crystallography. , 2004, Structure.

[15]  Kevin Cowtan,et al.  The Buccaneer software for automated model building. 1. Tracing protein chains. , 2006, Acta crystallographica. Section D, Biological crystallography.

[16]  Jude W. Shavlik,et al.  A probabilistic approach to protein backbone tracing in electron density maps , 2006, ISMB.

[17]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Helen M Berman,et al.  The Impact of Structural Genomics on the Protein Data Bank , 2004, American journal of pharmacogenomics : genomics-related research in drug development and clinical practice.

[19]  A. Brünger Free R value: a novel statistical quantity for assessing the accuracy of crystal structures , 1992, Nature.

[20]  Jun S. Liu,et al.  Sequential Imputations and Bayesian Missing Data Problems , 1994 .

[21]  G. Murshudov,et al.  Refinement of macromolecular structures by the maximum-likelihood method. , 1997, Acta crystallographica. Section D, Biological crystallography.

[22]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[23]  Thomas C. Terwilliger,et al.  Electronic Reprint Biological Crystallography Automated Main-chain Model Building by Template Matching and Iterative Fragment Extension , 2022 .

[24]  Gyorgy Snell,et al.  Automated sample mounting and alignment system for biological crystallography at a synchrotron source. , 2004, Structure.

[25]  Shuangquan Zang,et al.  N‐(4‐Nitro­benz­yl)quinolinium bis­(2‐thioxo‐1,3‐dithiole‐4,5‐dithiol­ato)palladium(III) acetone solvate , 2006 .