Functional insights from structural predictions: Analysis of the Escherichia coli genome

Fold assignments for proteins from the Escherichia coli genome are carried out using BASIC, a profile–profile alignment algorithm, recently tested on fold recognition benchmarks and on the Mycoplasma genitalium genome and PSI BLAST, the newest generation of the de facto standard in homology search algorithms. The fold assignments are followed by automated modeling and the resulting three‐dimensional models are analyzed for possible function prediction.

[1]  A. D. McLachlan,et al.  Profile analysis: detection of distantly related proteins. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[2]  S. Karlin,et al.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[3]  A. Anderson,et al.  Occurrence, metabolism, metabolic role, and industrial uses of bacterial polyhydroxyalkanoates. , 1990, Microbiological reviews.

[4]  D. T. Jones,et al.  A new approach to protein fold recognition , 1992, Nature.

[5]  G. Gonnet,et al.  Exhaustive matching of the entire protein sequence database. , 1992, Science.

[6]  A. Godzik,et al.  Topology fingerprint approach to the inverse protein folding problem. , 1992, Journal of molecular biology.

[7]  G. Schulz Binding of nucleotides by proteins , 1992, Current Biology.

[8]  T. P. Flores,et al.  Recurring structural motifs in proteins with different functions , 1993, Current Biology.

[9]  S. Bryant,et al.  An empirical energy function for threading protein sequence through the folding motif , 1993, Proteins.

[10]  C Sander,et al.  Prediction of protein structure by evaluation of sequence-structure fitness. Aligning sequences to contact profiles derived from three-dimensional structures. , 1993, Journal of molecular biology.

[11]  E S Lander,et al.  Recognition of related proteins by iterative template refinement (ITR) , 1994, Protein science : a publication of the Protein Society.

[12]  John P. Overington,et al.  Derivation of rules for comparative protein modeling from a database of protein structure alignments , 1994, Protein science : a publication of the Protein Society.

[13]  Y. Matsuo,et al.  Protein structural similarities predicted by a sequence‐structure compatibility method , 1994, Protein science : a publication of the Protein Society.

[14]  D Eisenberg,et al.  Inverse protein folding by the residue pair preference profile method: estimating the correctness of alignments of structurally compatible sequences. , 1995, Protein engineering.

[15]  Michael S. Waterman,et al.  Introduction to computational biology , 1995 .

[16]  F. Quiocho,et al.  Atomic structure and specificity of bacterial periplasmic receptors for active transport and chemotaxis: variation of common themes , 1996, Molecular microbiology.

[17]  G. Barton,et al.  Protein fold recognition by mapping predicted secondary structures. , 1996, Journal of molecular biology.

[18]  R. Reusch,et al.  Poly(3-hydroxybutyrate) Is Associated with Specific Proteins in the Cytoplasm and Membranes of Escherichia coli* , 1996, The Journal of Biological Chemistry.

[19]  T. Gibson,et al.  Applying motif and profile searches. , 1996, Methods in enzymology.

[20]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[21]  E. Chiancone,et al.  A Novel Non-heme Iron-binding Ferritin Related to the DNA-binding Proteins of the Dps Family in Listeria innocua* , 1997, The Journal of Biological Chemistry.

[22]  G N Murshudov,et al.  The structure of the cofactor-binding fragment of the LysR family member, CysB: a familiar fold with a surprising subunit arrangement. , 1997, Structure.

[23]  D. Fischer,et al.  Assigning folds to the proteins encoded by the genome of Mycoplasma genitalium. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[24]  M. L. Jones,et al.  PDBsum: a Web-based database of summaries and analyses of all PDB structures. , 1997, Trends in biochemical sciences.

[25]  E. Moxon,et al.  E. coligenome sequence: A blueprint for life , 1997, Nature.

[26]  H. Mewes,et al.  Protein structural classes in five complete genomes , 1997, Nature Structural Biology.

[27]  S. Lee,et al.  Production of poly(3-hydroxybutyrate) by fed-batch culture of filamentation-suppressed recombinant Escherichia coli , 1997, Applied and environmental microbiology.

[28]  J Skolnick,et al.  Functional analysis of the Escherichia coli genome using the sequence-to-structure-to-function paradigm: identification of proteins exhibiting the glutaredoxin/thioredoxin disulfide oxidoreductase activity. , 1998, Journal of molecular biology.

[29]  Leszek Rychlewski,et al.  Fold prediction by a hierarchy of sequence, threading, and modeling methods , 1998, Protein science : a publication of the Protein Society.

[30]  A. Godzik,et al.  Fold and function predictions for Mycoplasma genitalium proteins. , 1998, Folding & design.