Protein tolerance to random amino acid change.

Mutagenesis of protein-encoding sequences occurs ubiquitously; it enables evolution, accumulates during aging, and is associated with disease. Many biotechnological methods exploit random mutations to evolve novel proteins. To quantitate protein tolerance to random change, it is vital to understand the probability that a random amino acid replacement will lead to a protein's functional inactivation. We define this probability as the "x factor." Here, we develop a broadly applicable approach to calculate x factors and demonstrate this method using the human DNA repair enzyme 3-methyladenine DNA glycosylase (AAG). Three gene-wide mutagenesis libraries were created, each with 10(5) diversity and averaging 2.2, 4.6, and 6.2 random amino acid changes per mutant. After determining the percentage of functional mutants in each library using high-stringency selection (>19,000-fold), the x factor was found to be 34% +/- 6%. Remarkably, reanalysis of data from studies of diverse proteins reveals similar inactivation probabilities. To delineate the nature of tolerated amino acid substitutions, we sequenced 244 surviving AAG mutants. The 920 tolerated substitutions were characterized by substitutability index and mapped onto the AAG primary, secondary, and known tertiary structures. Evolutionarily conserved residues show low substitutability indices. In AAG, beta strands are on average less substitutable than alpha helices; and surface loops that are not involved in DNA binding are the most substitutable. Our results are relevant to such diverse topics as applied molecular evolution, the rate of introduction of deleterious alleles into genomes in evolutionary history, and organisms' tolerance of mutational burden.

[1]  L. Samson,et al.  Imbalanced Base Excision Repair Increases Spontaneous Mutation and Alkylation Sensitivity inEscherichia coli , 1999, Journal of bacteriology.

[2]  L. Loeb,et al.  Random Sequence Mutagenesis and Resistance to 5-Fluorouridine in Human Thymidylate Synthases* , 1998, The Journal of Biological Chemistry.

[3]  M. Black,et al.  Identification of important residues within the putative nucleoside binding site of HSV-1 thymidine kinase by random sequence selection: analysis of selected mutants in vitro. , 1993, Biochemistry.

[4]  Michael H. Hecht,et al.  Protein Design: The Choice of de Novo Sequences* , 1997, The Journal of Biological Chemistry.

[5]  K. Loeb,et al.  Multiple mutations and cancer , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[6]  J. Mullins,et al.  Lethal mutagenesis of HIV with mutagenic nucleoside analogs. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[7]  J. Drake,et al.  Rates of spontaneous mutation. , 1998, Genetics.

[8]  Peter D. Keightley,et al.  High genomic deleterious mutation rates in hominids , 1999, Nature.

[9]  John Maynard Smith,et al.  Natural Selection and the Concept of a Protein Space , 1970, Nature.

[10]  C Cruz,et al.  Genetic studies of the lac repressor. XIV. Analysis of 4000 altered Escherichia coli lac repressors reveals essential and non-essential residues, as well as "spacers" which do not require a specific sequence. , 1994, Journal of molecular biology.

[11]  M. Zaccolo,et al.  The effect of high-frequency random mutagenesis on in vitro protein evolution: a study on TEM-1 beta-lactamase. , 1999, Journal of molecular biology.

[12]  S. Henikoff,et al.  Predicting deleterious amino acid substitutions. , 2001, Genome research.

[13]  L. Loeb,et al.  Distribution of Mutations in Human Thymidylate Synthase Yielding Resistance to 5-Fluorodeoxyuridine* , 2002, Journal of Biological Chemistry.

[14]  L. Hood,et al.  Random mutagenesis of Thermus aquaticus DNA polymerase I: concordance of immutable sites in vivo with the crystal structure. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[15]  S. Bouvier,et al.  Systematic mutation of bacteriophage T4 lysozyme. , 1991, Journal of molecular biology.

[16]  A. Fersht,et al.  A search for single substitutions that eliminate enzymatic function in a bacterial ribonuclease. , 1998, Biochemistry.

[17]  Christopher T. Saunders,et al.  Evaluation of structural and evolutionary contributions to deleterious mutation prediction. , 2002, Journal of molecular biology.

[18]  L. Loeb,et al.  Mutations in human DNA polymerase η motif II alter bypass of DNA lesions , 2001, The EMBO journal.

[19]  M D Wyatt,et al.  Molecular basis for discriminating between normal and damaged bases by the human alkyladenine glycosylase, AAG. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[20]  F. Taddei,et al.  Evolution of Evolvability a , 1999 .

[21]  L. Samson,et al.  Crystal Structure of a Human Alkylbase-DNA Repair Enzyme Complexed to DNA Mechanisms for Nucleotide Flipping and Base Excision , 1998, Cell.

[22]  P H Patel,et al.  DNA polymerase active site is highly mutable: evolutionary consequences. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[23]  L. Loeb,et al.  Human Immunodeficiency Virus Reverse Transcriptase , 1996, The Journal of Biological Chemistry.

[24]  R. Sauer,et al.  Additivity of mutant effects assessed by binomial mutagenesis. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[25]  W. Lim,et al.  Deciphering the message in protein sequences: tolerance to amino acid substitutions. , 1990, Science.

[26]  T. O'Connor,et al.  Purification and characterization of human 3-methyladenine-DNA glycosylase. , 1993, Nucleic acids research.

[27]  James Allan,et al.  3‐methyladenine DNA glycosylases: structure, function, and biological importance , 1999, BioEssays : news and reviews in molecular, cellular and developmental biology.

[28]  L. Samson,et al.  Cloning and characterization of a 3-methyladenine DNA glycosylase cDNA from human cells whose gene maps to chromosome 16. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[29]  M. Malim,et al.  DNA deamination: not just a trigger for antibody diversification but also a mechanism for defense against retroviruses , 2003, Nature Immunology.