Did DNA replication evolve twice independently?

DNA replication is central to all extant cellular organisms. There are substantial functional similarities between the bacterial and the archaeal/eukaryotic replication machineries, including but not limited to defined origins, replication bidirectionality, RNA primers and leading and lagging strand synthesis. However, several core components of the bacterial replication machinery are unrelated or only distantly related to the functionally equivalent components of the archaeal/eukaryotic replication apparatus. This is in sharp contrast to the principal proteins involved in transcription and translation, which are highly conserved in all divisions of life. We performed detailed sequence comparisons of the proteins that fulfill indispensable functions in DNA replication and classified them into four main categories with respect to the conservation in bacteria and archaea/eukaryotes: (i) non-homologous, such as replicative polymerases and primases; (ii) containing homologous domains but apparently non-orthologous and conceivably independently recruited to function in replication, such as the principal replicative helicases or proofreading exonucleases; (iii) apparently orthologous but poorly conserved, such as the sliding clamp proteins or DNA ligases; (iv) orthologous and highly conserved, such as clamp-loader ATPases or 5'-->3' exonucleases (FLAP nucleases). The universal conservation of some components of the DNA replication machinery and enzymes for DNA precursor biosynthesis but not the principal DNA polymerases suggests that the last common ancestor (LCA) of all modern cellular life forms possessed DNA but did not replicate it the way extant cells do. We propose that the LCA had a genetic system that contained both RNA and DNA, with the latter being produced by reverse transcription. Consequently, the modern-type system for double-stranded DNA replication likely evolved independently in the bacterial and archaeal/eukaryotic lineages.

[1]  Michael Y. Galperin,et al.  Comparative genomics of the Archaea (Euryarchaeota): evolution of conserved protein families, the stable core, and the variable shell. , 1999, Genome research.

[2]  E. Koonin,et al.  Gleaning non-trivial structural, functional and evolutionary information about proteins by iterative database searches. , 1999, Journal of molecular biology.

[3]  E. Koonin,et al.  Conserved domains in DNA repair proteins and evolution of repair systems. , 1999, Nucleic acids research.

[4]  A. Wolffe,et al.  Chromatin disruption and modification. , 1999, Nucleic acids research.

[5]  D. Wigley,et al.  Structure of the adenylation domain of an NAD+-dependent DNA ligase. , 1999, Structure.

[6]  E V Koonin,et al.  AAA+: A class of chaperone-like ATPases associated with the assembly, operation, and disassembly of protein complexes. , 1999, Genome research.

[7]  H. Toh,et al.  A heterodimeric DNA polymerase: evidence that members of Euryarchaeota possess a distinct DNA polymerase. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[8]  C. Ban,et al.  Crystal Structure and ATPase Activity of MutL Implications for DNA Repair and Mutagenesis , 1998, Cell.

[9]  N. Brown,et al.  DNA polymerase III of Gram-positive eubacteria is a zinc metalloprotein conserving an essential finger-like domain. , 1998, Biochemistry.

[10]  S F Altschul,et al.  Iterated profile searches with PSI-BLAST--a tool for discovery in protein databases. , 1998, Trends in biochemical sciences.

[11]  B. Snel,et al.  Conservation of gene order: a fingerprint of proteins that physically interact. , 1998, Trends in biochemical sciences.

[12]  Detlef D. Leipe,et al.  Toprim--a conserved catalytic domain in type IA and II topoisomerases, DnaG-type primases, OLD family nucleases and RecR proteins. , 1998, Nucleic acids research.

[13]  Yunje Cho,et al.  The crystal structure of flap endonuclease-1 from Methanococcus jannaschii , 1998, Nature Structural &Molecular Biology.

[14]  F. Chédin,et al.  Novel homologs of replication protein A in archaea: implications for the evolution of ssDNA-binding proteins. , 1998, Trends in biochemical sciences.

[15]  E V Koonin,et al.  Phosphoesterase domains associated with DNA polymerases of diverse origins. , 1998, Nucleic acids research.

[16]  C. Woese The universal ancestor. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[17]  J. Tainer,et al.  Flap endonuclease homologs in archaebacteria exist as independent proteins. , 1998, Trends in biochemical sciences.

[18]  K. Komori,et al.  Copyright © 1998, American Society for Microbiology A Novel DNA Polymerase Family Found in Archaea , 1997 .

[19]  Tania A Baker,et al.  Polymerases and the Replisome: Machines within Machines , 1998, Cell.

[20]  P. Forterre,et al.  Archaea: what can we learn from their sequences? , 1997, Current opinion in genetics & development.

[21]  Michael Y. Galperin,et al.  Prokaryotic genomes: the emerging paradigm of genome-based microbiology. , 1997, Current opinion in genetics & development.

[22]  I S Mian,et al.  The proofreading domain of Escherichia coli DNA polymerase I and other DNA and/or RNA exonuclease domains. , 1997, Nucleic acids research.

[23]  S. Biswas,et al.  Purification and characterization of DNA polymerase alpha-associated replication protein A-dependent yeast DNA helicase A. , 1997, Biochemistry.

[24]  D. Lipman,et al.  A genomic perspective on protein families. , 1997, Science.

[25]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[26]  C. Kang,et al.  A common core for binding single‐stranded DNA: structural comparison of the single‐stranded DNA‐binding proteins (SSB) from E. coli and human mitochondria , 1997, FEBS letters.

[27]  T. Steitz,et al.  Crystal Structure of a pol α Family Replication DNA Polymerase from Bacteriophage RB69 , 1997, Cell.

[28]  W. Doolittle,et al.  Archaea and the Origin(s) of DNA Replication Proteins , 1997, Cell.

[29]  G. Waksman,et al.  Crystal structure of the homo-tetrameric DNA binding domain of Escherichia coli single-stranded DNA-binding protein determined by multiwavelength x-ray diffraction on the selenomethionyl protein at 2.9-A resolution. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Eugene V. Koonin,et al.  SEALS: A System for Easy Analysis of Lots of Sequences , 1997, ISMB.

[31]  Samuel Karlin,et al.  Evolutionary Comparisons of RecA-Like Proteins Across All Major Kingdoms of Living Organisms , 1997, Journal of Molecular Evolution.

[32]  A. Nicolas,et al.  An atypical topoisomerase II from archaea with implications for meiotic recombination , 1997, Nature.

[33]  H. Jäck,et al.  Cloning and characterization of HUPF1, a human homolog of the Saccharomyces cerevisiae nonsense mRNA-reducing UPF1 protein. , 1997, Nucleic acids research.

[34]  Alexey Bochkarev,et al.  Structure of the single-stranded-DNA-binding domain of replication protein A bound to DNA , 1997, Nature.

[35]  S. Benner,et al.  The B12-dependent ribonucleotide reductase from the archaebacterium Thermoplasma acidophila: an evolutionary solution to the ribonucleotide reductase conundrum. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Peer Bork,et al.  A superfamily of conserved domains in DNA damage‐ responsive cell cycle checkpoint proteins , 1997, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[37]  E. Koonin,et al.  A minimal gene set for cellular life derived by comparison of complete bacterial genomes. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[38]  T. Ceska,et al.  A helical arch allowing single-stranded DNA to thread through T5 5'-exonuclease , 1996, Nature.

[39]  Chris P. Ponting,et al.  The helix-hairpin-helix DNA-binding motif: a structural basis for non- sequence-specific recognition of DNA , 1996, Nucleic Acids Res..

[40]  T. Mueser,et al.  Structure of Bacteriophage T4 RNase H, a 5′ to 3′ RNA–DNA and DNA–DNA Exonuclease with Sequence Similarity to the RAD2 Family of Eukaryotic Proteins , 1996, Cell.

[41]  P. Bork,et al.  Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coli , 1996, Current Biology.

[42]  J. Wootton,et al.  Analysis of compositionally biased regions in sequence databases. , 1996, Methods in enzymology.

[43]  Z. Kelman,et al.  Structural and functional similarities of prokaryotic and eukaryotic DNA polymerase sliding clamps. , 1995, Nucleic acids research.

[44]  Dae-Sil Lee,et al.  Crystal structure of Thermus aquaticus DNA polymerase , 1995, Nature.

[45]  A. Murzin A ribosomal protein module in EF-G and DNA gyrase , 1995, Nature Structural Biology.

[46]  John Kuriyan,et al.  Crystal structure of the eukaryotic DNA polymerase processivity factor PCNA , 1994, Cell.

[47]  S. Altschul,et al.  Issues in searching molecular sequence databases , 1994, Nature Genetics.

[48]  Yong Je Chung,et al.  Crystal structure of bacteriophage T7 RNA polymerase at 3.3 Å resolution , 1993, Nature.

[49]  E V Koonin,et al.  A common set of conserved motifs in a vast variety of putative nucleic acid-dependent ATPases including MCM proteins involved in the initiation of eukaryotic DNA replication. , 1993, Nucleic acids research.

[50]  Eugene V. Koonin,et al.  Helicases: amino acid sequence comparisons and structure-function relationships , 1993 .

[51]  A. Murzin OB(oligonucleotide/oligosaccharide binding)‐fold: common structural and functional solution for non‐homologous sequences. , 1993, The EMBO journal.

[52]  Mei Chen,et al.  Homology in accessory proteins of replicative polymerases--E. coli to humans , 1993, Nucleic Acids Res..

[53]  T. Steitz,et al.  Crystal structure at 3.5 A resolution of HIV-1 reverse transcriptase complexed with an inhibitor. , 1992, Science.

[54]  John Kuriyan,et al.  Three-dimensional structure of the β subunit of E. coli DNA polymerase III holoenzyme: A sliding DNA clamp , 1992, Cell.

[55]  D. Demarini,et al.  SEN1, a positive effector of tRNA-splicing endonuclease in Saccharomyces cerevisiae , 1992, Molecular and cellular biology.

[56]  P. Slonimski,et al.  NAM7 nuclear gene encodes a novel member of a family of helicases with a Zn-ligand motif and is involved in mitochondrial functions in Saccharomyces cerevisiae. , 1992, Journal of molecular biology.

[57]  P. Forterre,et al.  The nature of the last universal ancestor and the root of the tree of life, still open questions. , 1992, Bio Systems.

[58]  J. Ito,et al.  Compilation and alignment of DNA polymerase sequences. , 1991, Nucleic acids research.

[59]  P Argos,et al.  An attempt to unify the structure of polymerases. , 1990, Protein engineering.

[60]  E. Koonin,et al.  Viral proteins containing the purine NTP-binding sequence pattern. , 1989, Nucleic acids research.

[61]  L. Blanco,et al.  A conserved 3′→5′ exonuclease active site in prokaryotic and eukaryotic DNA polymerases , 1989, Cell.

[62]  P Argos,et al.  A sequence motif in many polymerases. , 1988, Nucleic acids research.

[63]  E. Wintersberger,et al.  RNA makes DNA: a speculative view of the evolution of DNA replication mechanisms , 1987 .

[64]  T. Steitz,et al.  Structure of large fragment of Escherichia coli DNA polymerase I complexed with dTMP , 2020, Nature.