Reconstructed evolutionary adaptive paths give polymerases accepting reversible terminators for sequencing and SNP detection

Any system, natural or human-made, is better understood if we analyze both its history and its structure. Here we combine structural analyses with a “Reconstructed Evolutionary Adaptive Path” (REAP) analysis that used the evolutionary and functional history of DNA polymerases to replace amino acids to enable polymerases to accept a new class of triphosphate substrates, those having their 3′-OH ends blocked as a 3′-ONH2 group (dNTP-ONH2). Analogous to widely used 2′,3′-dideoxynucleoside triphosphates (ddNTPs), dNTP-ONH2s terminate primer extension. Unlike ddNTPs, however, primer extension can be resumed by cleaving an O-N bond to restore an -OH group to the 3′-end of the primer. REAP combined with crystallographic analyses identified 35 sites where replacements might improve the ability of Taq to accept dNTP-ONH2s. A library of 93 Taq variants, each having replacements at three or four of these sites, held eight variants having improved ability to accept dNTP-ONH2 substrates. Two of these (A597T, L616A, F667Y, E745H, and E520G, K540I, L616A) performed notably well. The second variant incorporated both dNTP-ONH2sand ddNTPs faithfully and efficiently, supporting extension-cleavage-extension cycles applicable in parallel sequencing and in SNP detection through competition between reversible and irreversible terminators. Dissecting these results showed that one replacement (L616A), not previously identified, allows Taq to incorporate both reversible and irreversible terminators. Modeling showed how L616A might open space behind Phe-667, allowing it to move to accommodate the larger 3′-substituent. This work provides polymerases for DNA analyses and shows how evolutionary analyses help explore relationships between structure and function in proteins.

[1]  Linus Pauling,et al.  Chemical Paleogenetics. Molecular "Restoration Studies" of Extinct Forms of Life. , 1963 .

[2]  Steven Henikoff,et al.  SIFT: predicting amino acid changes that affect protein function , 2003, Nucleic Acids Res..

[3]  Jingyue Ju,et al.  Four-color DNA sequencing with 3′-O-modified nucleotide reversible terminators and chemically cleavable fluorescent dideoxynucleotides , 2008, Proceedings of the National Academy of Sciences.

[4]  C. Richardson,et al.  A single residue in DNA polymerases of the Escherichia coli DNA polymerase I family is critical for distinguishing between deoxy- and dideoxyribonucleotides. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Steven A Benner,et al.  Understanding nucleic acids using synthetic chemistry. , 2004, Accounts of chemical research.

[6]  L. Loeb,et al.  Multiple Amino Acid Substitutions Allow DNA Polymerases to Synthesize RNA* , 2000, The Journal of Biological Chemistry.

[7]  M. Metzker,et al.  Termination of DNA synthesis by N6-alkylated, not 3′-O-alkylated, photocleavable 2′-deoxyadenosine triphosphates , 2007, Nucleic acids research.

[8]  Nicholas J Turro,et al.  3′-O-modified nucleotides as reversible terminators for pyrosequencing , 2007, Proceedings of the National Academy of Sciences.

[9]  D. Liberles Ancestral sequence reconstruction , 2007 .

[10]  S. Benner,et al.  Recognition by viral and cellular DNA polymerases of nucleosides bearing bases with nonstandard hydrogen bonding patterns. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Nicholas J. Turro,et al.  Four-color DNA sequencing by synthesis using cleavable fluorescent nucleotide reversible terminators , 2006, Proceedings of the National Academy of Sciences.

[12]  J. Bonfield,et al.  Finishing the euchromatic sequence of the human genome , 2004, Nature.

[13]  Manfred K. Warmuth,et al.  Engineering proteinase K using machine learning and synthetic genes , 2007, BMC biotechnology.

[14]  Steven A Benner,et al.  Interpretive proteomics--finding biological meaning in genome and proteome databases. , 2003, Advances in enzyme regulation.

[15]  F. Romesberg,et al.  The evolution of DNA polymerases with novel activities. , 2005, Current opinion in biotechnology.

[16]  Stephen H Hughes,et al.  PCR amplification of DNA containing non-standard base pairs by variants of reverse transcriptase from Human Immunodeficiency Virus-1. , 2004, Nucleic acids research.

[17]  Emanuel Carrilho,et al.  A review of DNA sequencing techniques , 2002, Quarterly Reviews of Biophysics.

[18]  M. Marra,et al.  Applications of next-generation sequencing technologies in functional genomics. , 2008, Genomics.

[19]  Nancy F. Hansen,et al.  Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry , 2008, Nature.

[20]  Gabriel Waksman,et al.  Crystal structures of open and closed forms of binary and ternary complexes of the large fragment of Thermus aquaticus DNA polymerase I: structural basis for nucleotide incorporation , 1998, The EMBO journal.

[21]  N. Turro,et al.  Design and synthesis of a 3′-O-allyl photocleavable fluorescent nucleotide as a reversible terminator for DNA sequencing by synthesis , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[22]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[23]  B. Canard,et al.  Catalytic editing properties of DNA polymerases. , 1995, Proceedings of the National Academy of Sciences of the United States of America.