Structural constraints and emergence of sequence patterns in protein evolution.

The aim of this work was to study the relationship between structure conservation and sequence divergence in protein evolution. To this end, we developed a model of structurally constrained protein evolution (SCPE) in which trial sequences, generated by random mutations at gene level, are selected against departure from a reference three-dimensional structure. Since at the mutational level SCPE is completely unbiased, any emergent sequence pattern will be due exclusively to structural constraints. In this first report, it is shown that SCPE correctly predicts the characteristic hexapeptide motif of the left-handed parallel beta helix (LbetaH) domain of UDP-N-acetylglucosamine acyltransferases (LpxA).

[1]  M. Perutz Species adaptation in a protein molecule. , 1983, Molecular biology and evolution.

[2]  L. Mirny,et al.  Universally conserved positions in protein folds: reading evolutionary signals about stability, folding kinetics and function. , 1999, Journal of molecular biology.

[3]  M. Sippl Recognition of errors in three‐dimensional structures of proteins , 1993, Proteins.

[4]  T. P. Flores,et al.  Comparison of conformational characteristics in structurally similar protein pairs , 1993, Protein science : a publication of the Protein Society.

[5]  W. Li,et al.  Selective constraints, amino acid composition, and the rate of protein evolution. , 2000, Molecular biology and evolution.

[6]  M. Vaara,et al.  The novel hexapeptide motif found in the acyltransferases LpxA and LpxD of lipid A biosynthesis is conserved in various bacteria , 1994, FEBS letters.

[7]  John P. Overington,et al.  Tertiary structural constraints on protein evolutionary diversity: templates, key residues and structure prediction , 1990, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[8]  Eugene I. Shakhnovich,et al.  Kinetics, thermodynamics and evolution of non-native interactions in a protein folding nucleus , 2000, Nature Structural Biology.

[9]  P. Stadler,et al.  Neutral networks in protein space: a computational study based on knowledge-based potentials of mean force. , 1997, Folding & design.

[10]  C. Raetz,et al.  A Left-Handed Parallel β Helix in the Structure of UDP-N-Acetylglucosamine Acyltransferase , 1995, Science.

[11]  M. Billeter,et al.  MOLMOL: a program for display and analysis of macromolecular structures. , 1996, Journal of molecular graphics.

[12]  P. Lio’,et al.  Models of molecular evolution and phylogeny. , 1998, Genome research.

[13]  Xuhua Xia,et al.  What Amino Acid Properties Affect Protein Evolution? , 1998, Journal of Molecular Evolution.

[14]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[15]  D. Baker,et al.  A surprising simplicity to protein folding , 2000, Nature.

[16]  A. Dean,et al.  The structural basis of molecular adaptation. , 1998, Molecular biology and evolution.

[17]  M. Levitt,et al.  De novo protein design. II. Plasticity in sequence space. , 1999, Journal of molecular biology.

[18]  W. Brown,et al.  Structural biology and phylogenetic estimation , 1997, Nature.

[19]  C. Cambillau,et al.  Crystal structure of the bifunctional N‐acetylglucosamine 1‐phosphate uridyltransferase from Escherichia coli: a paradigm for the related pyrophosphorylase superfamily , 1999, The EMBO journal.

[20]  J. Echave,et al.  Evolutionary Analysis of γ-Carbonic Anhydrase and Structurally Related Proteins , 2000 .

[21]  R A Goldstein,et al.  Models of natural mutations including site heterogeneity , 1998, Proteins.

[22]  M. Gerstein,et al.  Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scores. , 2000, Journal of molecular biology.

[23]  H. Schindelin,et al.  A left‐hand beta‐helix revealed by the crystal structure of a carbonic anhydrase from the archaeon Methanosarcina thermophila. , 1996, The EMBO journal.

[24]  M. Levitt,et al.  De novo protein design. I. In search of stability and specificity. , 1999, Journal of molecular biology.

[25]  J. Blanchard,et al.  Three-dimensional structure of tetrahydrodipicolinate N-succinyltransferase. , 1997, Biochemistry.

[26]  E. Shakhnovich,et al.  Conserved residues and the mechanism of protein folding , 1996, Nature.

[27]  M. Vaara,et al.  Eight bacterial proteins, including UDP-N-acetylglucosamine acyltransferase (LpxA) and three other transferases of Escherichia coli, consist of a six-residue periodicity theme. , 1992, FEMS microbiology letters.

[28]  M. Sugantino,et al.  Structure of the hexapeptide xenobiotic acetyltransferase from Pseudomonas aeruginosa. , 1998, Biochemistry.

[29]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[30]  T. Blundell,et al.  Evolution and the tertiary structure of proteins. , 1984, Annual review of biophysics and bioengineering.

[31]  W. Pearson,et al.  Evolution of protein sequences and structures. , 1999, Journal of molecular biology.

[32]  M. Kimura,et al.  The neutral theory of molecular evolution. , 1983, Scientific American.

[33]  A. Lesk,et al.  The relation between the divergence of sequence and structure in proteins. , 1986, The EMBO journal.

[34]  M. Kimura The Neutral Theory of Molecular Evolution: Introduction , 1983 .