Bridging the information gap: computational tools for intermediate resolution structure interpretation.

Due to large sizes and complex nature, few large macromolecular complexes have been solved to atomic resolution. This has lead to an under-representation of these structures, which are composed of novel and/or homologous folds, in the library of known structures and folds. While it is often difficult to achieve a high-resolution model for these structures, X-ray crystallography and electron cryomicroscopy are capable of determining structures of large assemblies at low to intermediate resolutions. To aid in the interpretation and analysis of such structures, we have developed two programs: helixhunter and foldhunter. Helixhunter is capable of reliably identifying helix position, orientation and length using a five-dimensional cross-correlation search of a three-dimensional density map followed by feature extraction. Helixhunter's results can in turn be used to probe a library of secondary structure elements derived from the structures in the Protein Data Bank (PDB). From this analysis, it is then possible to identify potential homologous folds or suggest novel folds based on the arrangement of alpha helix elements, resulting in a structure-based recognition of folds containing alpha helices. Foldhunter uses a six-dimensional cross-correlation search allowing a probe structure to be fitted within a region or component of a target structure. The structural fitting therefore provides a quantitative means to further examine the architecture and organization of large, complex assemblies. These two methods have been successfully tested with simulated structures modeled from the PDB at resolutions between 6 and 12 A. With the integration of helixhunter and foldhunter into sequence and structural informatics techniques, we have the potential to deduce or confirm known or novel folds in domains or components within large complexes.

[1]  D W Banner,et al.  Atomic coordinates for triose phosphate isomerase from chicken muscle. , 1976, Biochemical and biophysical research communications.

[2]  Marin van Heel,et al.  Correlation functions revisited , 1992 .

[3]  S. Hubbard,et al.  Crystal structure of the tyrosine kinase domain of the human insulin receptor , 1994, Nature.

[4]  Jonathan Grimes,et al.  The crystal structure of bluetongue virus VP7 , 1995, Nature.

[5]  K. Mizuguchi,et al.  Comparison of spatial arrangements of secondary structural elements in proteins. , 1995, Protein engineering.

[6]  C Sander,et al.  Mapping the Protein Universe , 1996, Science.

[7]  D. Stuart,et al.  Crystal structure of the top domain of African horse sickness virus VP7: comparisons with bluetongue virus VP7. , 1996, Journal of virology.

[8]  D. Fischer,et al.  Protein fold recognition using sequence‐derived predictions , 1996, Protein science : a publication of the Protein Society.

[9]  Rosemarie Swanson,et al.  Algorithms for Finding the Axis of a Helix: Fast Rotational and Parametric Least-squares Methods , 1996, Comput. Chem..

[10]  Tim J. P. Hubbard,et al.  SCOP: a structural classification of proteins database , 1998, Nucleic Acids Res..

[11]  P. Wingfield,et al.  Visualization of a 4-helix bundle in the hepatitis B virus capsid by cryo-electron microscopy , 1997, Nature.

[12]  G. Kleywegt,et al.  Detecting folding motifs and similarities in protein structures. , 1997, Methods in enzymology.

[13]  B. Böttcher,et al.  Determination of the fold of the core protein of hepatitis B virus by electron cryomicroscopy , 1997, Nature.

[14]  D J DeRosier,et al.  Macromolecular assemblages. Sizing things up. , 1997, Current opinion in structural biology.

[15]  N. C. Price,et al.  The Herpes Simplex Virus Triplex Protein, VP23, Exists as a Molten Globule , 1998, Journal of Virology.

[16]  Joachim Frank,et al.  A 9 Å Resolution X-Ray Crystallographic Map of the Large Ribosomal Subunit , 1998, Cell.

[17]  M. Baker,et al.  Structure of Double-Shelled Rice Dwarf Virus , 1998, Journal of Virology.

[18]  D. Stuart,et al.  The atomic structure of the bluetongue virus core , 1998, Nature.

[19]  J L Sussman,et al.  Protein Data Bank (PDB): database of three-dimensional structural information of biological macromolecules. , 1998, Acta crystallographica. Section D, Biological crystallography.

[20]  H Luecke,et al.  Structure of bacteriorhodopsin at 1.55 A resolution. , 1999, Journal of molecular biology.

[21]  N Go,et al.  Diversity of functions of proteins with internal symmetry in spatial arrangement of secondary structural elements , 1999, Protein science : a publication of the Protein Society.

[22]  J. Mccammon,et al.  Situs: A package for docking crystal structures into low-resolution maps from electron microscopy. , 1999, Journal of structural biology.

[23]  W Chiu,et al.  EMAN: semiautomated software for high-resolution single-particle reconstructions. , 1999, Journal of structural biology.

[24]  M van Heel,et al.  The Escherichia coli large ribosomal subunit at 7.5 A resolution. , 1999, Structure.

[25]  W. Fiers,et al.  The structure of mouse tumour-necrosis factor at 1.4 A resolution: towards modulation of its selectivity and trimerization. , 1999, Acta crystallographica. Section D, Biological crystallography.

[26]  M. Yeager,et al.  Three-dimensional structure of a recombinant gap junction membrane channel. , 1999, Science.

[27]  J. Frank,et al.  Solution Structure of the E. coli 70S Ribosome at 11.5 Å Resolution , 2000, Cell.

[28]  S D Fuller,et al.  Cryo-electron microscopy reveals the functional organization of an enveloped virus, Semliki Forest virus. , 2000, Molecular cell.

[29]  W. Chiu,et al.  Seeing the herpesvirus capsid at 8.5 A. , 2000, Science.

[30]  Stephen K. Burley,et al.  An overview of structural genomics , 2000, Nature Structural Biology.

[31]  William H. Press,et al.  Numerical recipes in C , 2002 .