MOTIF-EM: an automated computational tool for identifying conserved regions in CryoEM structures

We present a new, first-of-its-kind, fully automated computational tool MOTIF-EM for identifying regions or domains or motifs in cryoEM maps of large macromolecular assemblies (such as chaperonins, viruses, etc.) that remain conformationally conserved. As a by-product, regions in structures that are not conserved are revealed: this can indicate local molecular flexibility related to biological activity. MOTIF-EM takes cryoEM volumetric maps as inputs. The technique used by MOTIF-EM to detect conserved sub-structures is inspired by a recent breakthrough in 2D object recognition. The technique works by constructing rotationally invariant, low-dimensional representations of local regions in the input cryoEM maps. Correspondences are established between the reduced representations (by comparing them using a simple metric) across the input maps. The correspondences are clustered using hash tables and graph theory is used to retrieve conserved structural domains or motifs. MOTIF-EM has been used to extract conserved domains occurring in large macromolecular assembly maps, including as those of viruses P22 and epsilon 15, Ribosome 70S, GroEL, that remain structurally conserved in different functional states. Our method can also been used to build atomic models for some maps. We also used MOTIF-EM to identify the conserved folds shared among dsDNA bacteriophages HK97, Epsilon 15, and ô29, though they have low-sequence similarity. Contact: mitul@cs.stanford.edu Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  W. Chiu,et al.  Seeing GroEL at 6 A resolution by single particle electron cryomicroscopy. , 2004, Structure.

[2]  N. Samatova,et al.  On the Relative Efficiency of Maximal Clique Enumeration Algorithms , with Application to High-Throughput Computational Biology , 2005 .

[3]  A. Roseman Docking structures of domains into maps from cryo-electron microscopy using local correlation. , 2000, Acta crystallographica. Section D, Biological crystallography.

[4]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[5]  Ruth Nussinov,et al.  Discovery of Protein Substructures in EM Maps , 2005, WABI.

[6]  Joachim Frank,et al.  Locking and Unlocking of Ribosomal Motions , 2003, Cell.

[7]  Niels Volkmann,et al.  Docking of atomic models into reconstructions from electron microscopy. , 2003, Methods in enzymology.

[8]  R. Russell,et al.  Fast fitting of atomic structures to low-resolution electron density maps by surface overlap maximization. , 2004, Journal of molecular biology.

[9]  Dong-Hua Chen,et al.  De novo backbone trace of GroEL from single particle electron cryomicroscopy. , 2008, Structure.

[10]  M. Rossmann,et al.  Conservation of the capsid structure in tailed dsDNA bacteriophages: the pseudoatomic structure of phi29. , 2005, Molecular cell.

[11]  F. Tama,et al.  Normal mode based flexible fitting of high-resolution structure into low-resolution experimental data from cryo-EM. , 2004, Journal of structural biology.

[12]  J. King,et al.  Structure of epsilon15 bacteriophage reveals genome organization and DNA packaging/injection apparatus , 2006, Nature.

[13]  Wah Chiu,et al.  JADAS: a customizable automated data acquisition system and its application to ice-embedded single particles. , 2009, Journal of structural biology.

[14]  M. Rossmann,et al.  Combining electron microscopic with x-ray crystallographic structures. , 2001, Journal of structural biology.

[15]  M. Baker,et al.  Bridging the information gap: computational tools for intermediate resolution structure interpretation. , 2001, Journal of molecular biology.

[16]  Zeyun Yu,et al.  Computational Approaches for Automatic Structural Analysis of Large Biomolecular Complexes , 2008, TCBB.

[17]  J. King,et al.  Structure of Epsilon 15 Phage Reveals Organization of Genome and DNA Packaging / Injection Apparatus , 2006 .

[18]  Wen Jiang,et al.  Electron cryomicroscopy of single particles at subnanometer resolution. , 2005, Current opinion in structural biology.

[19]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[20]  Phillip J. McKerrow,et al.  Introduction to robotics , 1991 .

[21]  Berthold K. P. Horn,et al.  Closed-form solution of absolute orientation using unit quaternions , 1987 .

[22]  M. Baker,et al.  Structural characterization of components of protein assemblies by comparative modeling and electron cryo-microscopy. , 2005, Journal of structural biology.

[23]  M. Baker,et al.  Identification of secondary structure elements in intermediate-resolution density maps. , 2007, Structure.

[24]  M. Baker,et al.  Structural biology of cellular machines. , 2006, Trends in cell biology.

[25]  W Chiu,et al.  EMAN: semiautomated software for high-resolution single-particle reconstructions. , 1999, Journal of structural biology.

[26]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[27]  H. Wolfson,et al.  EMatch: Discovery of High Resolution Structural Homologues of Protein Domains in Intermediate Resolution Cryo-EM Maps , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[28]  Matthew L. Baker,et al.  Backbone structure of the infectious ε15 virus capsid revealed by electron cryomicroscopy , 2008, Nature.

[29]  S Birmanns,et al.  Using situs for flexible and rigid-body fitting of multiresolution single-molecule data. , 2001, Journal of structural biology.