FINDMOL: automated identification of macromolecules in electron-density maps.

Automating the determination of novel macromolecular structures via X-ray crystallographic methods involves building a model into an electron-density map. Unfortunately, the conventional crystallographic asymmetric unit volumes are usually not well matched to the biological molecular units. In most cases, the facets of the asymmetric unit cut the molecules into a number of disconnected fragments, rendering interpretation by the crystallographer significantly more difficult. The FINDMOL algorithm is designed to quickly parse the arrangement of trace points (pseudo-atoms) derived from a skeletonized electron-density map without requiring higher level prior information such as sequence information or number of molecules in the asymmetric unit. The algorithm was tested with a variety of density-modified maps computed with medium- to low-resolution data. Typically, the resulting volume resembles the biological unit. In the remaining cases the number of disconnected fragments is very small. In all examples, secondary-structural elements such as alpha-helices or beta-sheets are easily identifiable in the defragmented arrangement. FINDMOL can greatly assist a crystallographer during manual model building or in cases where automatic model building can only build partial models owing to limitations of the data such as low resolution and/or poor phases.

[1]  Randy J Read,et al.  Recent developments in the PHENIX software for automated crystallographic structure determination. , 2004, Journal of synchrotron radiation.

[2]  J. Tanner,et al.  Flavin reductase P: structure of a dimeric enzyme that reduces flavin. , 1996, Biochemistry.

[3]  Reinhard Jahn,et al.  Crystal structure of a SNARE complex involved in synaptic exocytosis at 2.4 Å resolution , 1998, Nature.

[4]  J. Thornton,et al.  Protein–protein interfaces: Analysis of amino acid conservation in homodimers , 2001, Proteins.

[5]  M. A. Wilson,et al.  The 1.0 A crystal structure of Ca(2+)-bound calmodulin: an analysis of disorder and implications for functionally relevant plasticity. , 2000, Journal of molecular biology.

[6]  良二 上田 J. Appl. Cryst.の発刊に際して , 1970 .

[7]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[8]  T. Oldfield Applications for macromolecular map interpretation: X-AUTOFIT, X-POWERFIT, X-BUILD, X-LIGAND, and X-SOLVATE. , 2003, Methods in enzymology.

[9]  S H Bryant,et al.  Extent and nature of contacts between protein molecules in crystal lattices and between subunits of protein oligomers , 1997, Proteins.

[10]  Nicholas K. Sauter,et al.  The Computational Crystallography Toolbox: crystallographic algorithms in a reusable software framework , 2002 .

[11]  Patrick Argos,et al.  NADP‐Dependent enzymes. II: Evolution of the mono‐ and dinucleotide binding domains , 1997, Proteins.

[12]  G. Langlet,et al.  International Tables for Crystallography , 2002 .

[13]  Jean Serra,et al.  Image Analysis and Mathematical Morphology , 1983 .

[14]  Thomas Terwilliger,et al.  SOLVE and RESOLVE: automated structure solution, density modification and model building. , 2004, Journal of synchrotron radiation.

[15]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[16]  J Greer Three-dimensional pattern recognition: an approach to automated interpretation of electron density maps of proteins. , 1974, Journal of molecular biology.

[17]  J. Thornton,et al.  PQS: a protein quaternary structure file server. , 1998, Trends in biochemical sciences.

[18]  William I. Weis,et al.  Three-Dimensional Structure of the Armadillo Repeat Region of β-Catenin , 1997, Cell.

[19]  Thomas R Ioerger,et al.  TEXTAL system: artificial intelligence techniques for automated protein model building. , 2003, Methods in enzymology.

[20]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[21]  K D Cowtan,et al.  Density modification for macromolecular phase improvement. , 1999, Progress in biophysics and molecular biology.