γ-TEMPy: Simultaneous Fitting of Components in 3D-EM Maps of Their Assembly Using a Genetic Algorithm

Summary We have developed a genetic algorithm for building macromolecular complexes using only a 3D-electron microscopy density map and the atomic structures of the relevant components. For efficient sampling the method uses map feature points calculated by vector quantization. The fitness function combines a mutual information score that quantifies the goodness of fit with a penalty score that helps to avoid clashes between components. Testing the method on ten assemblies (containing 3–8 protein components) and simulated density maps at 10, 15, and 20 Å resolution resulted in identification of the correct topology in 90%, 70%, and 60% of the cases, respectively. We further tested it on four assemblies with experimental maps at 7.2–23.5 Å resolution, showing the ability of the method to identify the correct topology in all cases. We have also demonstrated the importance of the map feature-point quality on assembly fitting in the lack of additional experimental information.

[1]  Andrej Sali,et al.  Inferential optimization for simultaneous fitting of multiple components into a CryoEM map of their assembly. , 2009, Journal of molecular biology.

[2]  A. Sali,et al.  Comparative protein structure modeling by iterative alignment, model building and model assessment. , 2003, Nucleic acids research.

[3]  John D. Westbrook,et al.  EMDataBank.org: unified data resource for CryoEM , 2010, Nucleic Acids Res..

[4]  Yifan Cheng Single-Particle Cryo-EM at Crystallographic Resolution , 2015, Cell.

[5]  J. Mccammon,et al.  Situs: A package for docking crystal structures into low-resolution maps from electron microscopy. , 1999, Journal of structural biology.

[6]  D. Vasishtan,et al.  Extracellular Vesicles: A Platform for the Structure Determination of Membrane Proteins by Cryo-EM , 2014, Structure.

[7]  Min Xu,et al.  A fast mathematical programming procedure for simultaneous fitting of assembly components into cryoEM density maps , 2010, Bioinform..

[8]  Keren Lasker,et al.  Finding the right fit: chiseling structures out of cryo-electron microscopy maps. , 2014, Current opinion in structural biology.

[9]  P Chacón,et al.  Reconstruction of protein form with X-ray solution scattering and a genetic algorithm. , 2000, Journal of molecular biology.

[10]  Frank Alber,et al.  Conformational States of Macromolecular Assemblies Explored by Integrative Structure Calculation , 2013, Structure.

[11]  Bruno Contreras-Moreira,et al.  Novel use of a genetic algorithm for protein structure prediction: Searching template and sequence alignment space , 2003, Proteins.

[12]  T. Kawabata Multiple Subunit Fitting into a Low-Resolution Density Map of a Macromolecular Complex Using a Gaussian Mixture Model , 2008, Biophysical journal.

[13]  Ben M. Webb,et al.  Protein structure fitting and refinement guided by cryo-EM density. , 2008, Structure.

[14]  H. Wolfson,et al.  Determining macromolecular assembly structures by molecular docking and fitting into an electron density map , 2010, Proteins.

[15]  Haim J. Wolfson,et al.  DockStar: a novel ILP-based integrative method for structural modeling of multimolecular protein complexes , 2015, Bioinform..

[16]  Dominika Elmlund,et al.  Cryogenic electron microscopy and single-particle analysis. , 2015, Annual review of biochemistry.

[17]  Ken Shoemake,et al.  Uniform Random Rotations , 1992, Graphics Gems III.

[18]  Peter Willett,et al.  GAPDOCK: A genetic algorithm approach to protein docking in CAPRI round 1 , 2003, Proteins.

[19]  N Gautham,et al.  Protein structure prediction using mutually orthogonal Latin squares and a genetic algorithm. , 2006, Biochemical and biophysical research communications.

[20]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[21]  Gabriel C Lander,et al.  Go hybrid: EM, crystallography, and beyond. , 2012, Current opinion in structural biology.

[22]  Ben M. Webb,et al.  Putting the Pieces Together: Integrative Modeling Platform Software for Structure Determination of Macromolecular Assemblies , 2012, PLoS biology.

[23]  Daisuke Kihara,et al.  Computational methods for constructing protein structure models from 3D electron microscopy maps. , 2013, Journal of structural biology.

[24]  Thomas D. Goddard,et al.  Quantitative analysis of cryo-EM density map segmentation by watershed and scale-space filtering, and fitting of structures by alignment to regions. , 2010, Journal of structural biology.

[25]  Melanie Mitchell,et al.  An introduction to genetic algorithms , 1996 .

[26]  Stefan Birmanns,et al.  Evolutionary tabu search strategies for the simultaneous registration of multiple atomic structures in cryo-EM reconstructions. , 2010, Journal of structural biology.

[27]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[28]  Daisuke Kihara,et al.  Fitting multimeric protein complexes into electron microscopy maps using 3D Zernike descriptors. , 2012, The journal of physical chemistry. B.

[29]  Agnel Praveen Joseph,et al.  TEMPy: a Python library for assessment of three-dimensional electron microscopy density fits , 2015, Journal of applied crystallography.

[30]  M. Topf,et al.  Scoring functions for cryoEM density fitting. , 2011, Journal of structural biology.

[31]  A. Bonvin,et al.  Integrative Modeling of Biomolecular Complexes: HADDOCKing with Cryo-Electron Microscopy Data. , 2015, Structure.

[32]  Stefan Birmanns,et al.  Using Sculptor and Situs for simultaneous assembly of atomic components into low-resolution shapes. , 2011, Journal of structural biology.

[33]  Daniel P. Huttenlocher,et al.  Comparing Images Using the Hausdorff Distance , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  A. Roseman Docking structures of domains into maps from cryo-electron microscopy using local correlation. , 2000, Acta crystallographica. Section D, Biological crystallography.