A DNA repair system specific for thermophilic Archaea and bacteria predicted by genomic context analysis.

During a systematic analysis of conserved gene context in prokaryotic genomes, a previously undetected, complex, partially conserved neighborhood consisting of more than 20 genes was discovered in most Archaea (with the exception of Thermoplasma acidophilum and Halobacterium NRC-1) and some bacteria, including the hyperthermophiles Thermotoga maritima and Aquifex aeolicus. The gene composition and gene order in this neighborhood vary greatly between species, but all versions have a stable, conserved core that consists of five genes. One of the core genes encodes a predicted DNA helicase, often fused to a predicted HD-superfamily hydrolase, and another encodes a RecB family exonuclease; three core genes remain uncharacterized, but one of these might encode a nuclease of a new family. Two more genes that belong to this neighborhood and are present in most of the genomes in which the neighborhood was detected encode, respectively, a predicted HD-superfamily hydrolase (possibly a nuclease) of a distinct family and a predicted, novel DNA polymerase. Another characteristic feature of this neighborhood is the expansion of a superfamily of paralogous, uncharacterized proteins, which are encoded by at least 20-30% of the genes in the neighborhood. The functional features of the proteins encoded in this neighborhood suggest that they comprise a previously undetected DNA repair system, which, to our knowledge, is the first repair system largely specific for thermophiles to be identified. This hypothetical repair system might be functionally analogous to the bacterial-eukaryotic system of translesion, mutagenic repair whose central components are DNA polymerases of the UmuC-DinB-Rad30-Rev1 superfamily, which typically are missing in thermophiles.

[1]  M. Sternberg,et al.  Enhanced genome annotation using structural profiles in the program 3D-PSSM. , 2000, Journal of molecular biology.

[2]  E. Koonin,et al.  Prokaryotic homologs of the eukaryotic DNA-end-binding protein Ku, novel domains in the Ku protein and prediction of a prokaryotic double-strand break repair system. , 2001, Genome research.

[3]  L. Aravind Guilt by association: contextual information in genome analysis. , 2000, Genome research.

[4]  E. Koonin,et al.  Gleaning non-trivial structural, functional and evolutionary information about proteins by iterative database searches. , 1999, Journal of molecular biology.

[5]  E. Koonin,et al.  Conserved domains in DNA repair proteins and evolution of repair systems. , 1999, Nucleic acids research.

[6]  S. Rosenberg,et al.  SOS mutator DNA polymerase IV functions in adaptive mutation and not adaptive amplification. , 2001, Molecular cell.

[7]  A. Murzin How far divergent evolution goes in proteins. , 1998, Current opinion in structural biology.

[8]  E. G. Frank,et al.  DNA polymerase iota and related rad30-like enzymes. , 2001, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[9]  S. Kanaya,et al.  A unique DNase activity shares the active site with ATPase activity of the RecA/Rad51 homologue (Pk‐REC) from a hyperthermophilic archaeon , 1999, FEBS letters.

[10]  J. DiRuggiero,et al.  Repair of extensive ionizing-radiation DNA damage at 95 degrees C in the hyperthermophilic archaeon Pyrococcus furiosus , 1997, Journal of bacteriology.

[11]  X. Zhang,et al.  Isolation and characterization of the C-terminal nuclease domain from the RecB protein of Escherichia coli. , 1999, Nucleic acids research.

[12]  Bruce A. Roe,et al.  Complete genome sequence of an M1 strain of Streptococcus pyogenes , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[13]  R. Fleischmann,et al.  Complete Genome Sequence of the Methanogenic Archaeon, Methanococcus jannaschii , 1996, Science.

[14]  J. F. Connaughton,et al.  Identification of a DinB/UmuC homolog in the archeon Sulfolobus solfataricus. , 1996, Mutation research.

[15]  D. Lipman,et al.  A genomic perspective on protein families. , 1997, Science.

[16]  S. Salzberg,et al.  Evidence for lateral gene transfer between Archaea and Bacteria from genome sequence of Thermotoga maritima , 1999, Nature.

[17]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[18]  J. Adachi,et al.  MOLPHY, programs for molecular phylogenetics , 1992 .

[19]  M. Di Giulio The universal ancestor lived in a thermophilic or hyperthermophilic environment. , 2000, Journal of theoretical biology.

[20]  A. Kuzminov Recombinational repair of DNA damage in Escherichia coli and bacteriophage lambda. , 1999, Microbiology and molecular biology reviews : MMBR.

[21]  P. Hanawalt,et al.  A phylogenomic study of DNA repair genes, proteins, and processes. , 1999, Mutation research.

[22]  D Fischer,et al.  Hybrid fold recognition: combining sequence derived properties with evolutionary information. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[23]  Geoffrey J. Barton,et al.  JPred : a consensus secondary structure prediction server , 1999 .

[24]  P Argos,et al.  An attempt to unify the structure of polymerases. , 1990, Protein engineering.

[25]  Mark A. Ragan,et al.  The complete genome of the crenarchaeon Sulfolobus solfataricus P2 , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[26]  E V Koonin,et al.  Phosphoesterase domains associated with DNA polymerases of diverse origins. , 1998, Nucleic acids research.

[27]  E V Koonin,et al.  Evidence for massive gene exchange between archaeal and bacterial hyperthermophiles. , 1998, Trends in genetics : TIG.

[28]  E. Koonin,et al.  DNA-binding proteins and evolution of transcription regulation in the archaea. , 1999, Nucleic acids research.

[29]  E V Koonin,et al.  SURVEY AND SUMMARY: holliday junction resolvases and related nucleases: identification of new families, phyletic distribution and evolutionary trajectories. , 2000, Nucleic acids research.

[30]  Michael Y. Galperin,et al.  Who's your neighbor? New computational approaches for functional genomics , 2000, Nature Biotechnology.

[31]  Michael Y. Galperin,et al.  A specialized version of the HD hydrolase domain implicated in signal transduction. , 1999, Journal of molecular microbiology and biotechnology.

[32]  R. Huber,et al.  The complete genome of the hyperthermophilic bacterium Aquifex aeolicus , 1998, Nature.

[33]  Warren C. Lathe,et al.  Predicting protein function by genomic context: quantitative evaluation and qualitative inferences. , 2000, Genome research.

[34]  B. Rost,et al.  Combining evolutionary information and neural networks to predict protein secondary structure , 1994, Proteins.

[35]  F. Robb,et al.  Complete sequence and gene organization of the genome of a hyper-thermophilic archaebacterium, Pyrococcus horikoshii OT3. , 1998, DNA research : an international journal for rapid publication of reports on genes and genomes.

[36]  I Sauvaget,et al.  Identification of four conserved motifs among the RNA‐dependent polymerase encoding elements. , 1989, The EMBO journal.

[37]  N. Grishin,et al.  GGDEF domain is homologous to adenylyl cyclase , 2001, Proteins.

[38]  G. Margison,et al.  Thermostable archaeal O6-alkylguanine-DNA alkyltransferases. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[39]  D. Julin,et al.  A Single Nuclease Active Site of the Escherichia coli RecBCD Enzyme Catalyzes Single-stranded DNA Degradation in Both Directions* , 2000, The Journal of Biological Chemistry.

[40]  Michael Y. Galperin,et al.  The COG database: a tool for genome-scale analysis of protein functions and evolution , 2000, Nucleic Acids Res..

[41]  R. Fleischmann,et al.  The complete genome sequence of the hyperthermophilic, sulphate-reducing archaeon Archaeoglobus fulgidus , 1997, Nature.

[42]  W. Franklin,et al.  Uracil-DNA Glycosylase in the Extreme Thermophile Archaeoglobus fulgidus * , 2000, The Journal of Biological Chemistry.

[43]  James R. Brown,et al.  DNA Repair Systems in Archaea: Mementos from the Last Universal Common Ancestor? , 1999, Journal of Molecular Evolution.

[44]  V. Thorsson,et al.  Genome sequence of Halobacterium species NRC-1. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[45]  D. Kaiser,et al.  devRS, an autoregulated and essential genetic locus for fruiting body development in Myxococcus xanthus , 1993, Journal of bacteriology.

[46]  W. Franklin,et al.  Thermostable uracil-DNA glycosylase from Thermotoga maritima a member of a novel class of DNA repair enzymes , 1999, Current Biology.

[47]  B. Barrell,et al.  Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence , 1998, Nature.

[48]  D. Grogan The question of DNA repair in hyperthermophilic archaea. , 2000, Trends in microbiology.

[49]  G. Walker,et al.  Managing DNA polymerases: Coordinating DNA replication, DNA repair, and DNA recombination , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[50]  E. Koonin,et al.  Genome alignment, evolution of prokaryotic genome organization, and prediction of gene function using genomic context. , 2001, Genome research.

[51]  Stanley L. Miller,et al.  The Origin and Early Evolution of Life: Prebiotic Chemistry, the Pre-RNA World, and Time , 1996, Cell.

[52]  E V Koonin,et al.  The HD domain defines a new superfamily of metal-dependent phosphohydrolases. , 1998, Trends in biochemical sciences.

[53]  A. Kuzminov Recombinational Repair of DNA Damage inEscherichia coli and Bacteriophage λ , 1999, Microbiology and Molecular Biology Reviews.

[54]  R. Overbeek,et al.  The use of gene clusters to infer functional coupling. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[55]  G. Church,et al.  Complete genome sequence of Methanobacterium thermoautotrophicum deltaH: functional analysis and comparative genomics , 1997, Journal of bacteriology.

[56]  Tatiana A. Tatusova,et al.  Complete genomes in WWW Entrez: data representation and analysis , 1999, Bioinform..

[57]  D. Cowan,et al.  Biomolecular stability and life at high temperatures , 2000, Cellular and Molecular Life Sciences CMLS.

[58]  Y. Nakamura,et al.  Complete genome sequence of the alkaliphilic bacterium Bacillus halodurans and genomic sequence comparison with Bacillus subtilis. , 2000, Nucleic acids research.

[59]  Peter Ross,et al.  Three cdg Operons Control Cellular Turnover of Cyclic Di-GMP in Acetobacter xylinum: Genetic Organization and Occurrence of Conserved Domains in Isoenzymes , 1998, Journal of bacteriology.

[60]  J. Felsenstein Inferring phylogenies from protein sequences by parsimony, distance, and likelihood methods. , 1996, Methods in enzymology.

[61]  E. Nisbet Palaeobiology: The realms of Archaean life , 2000, Nature.

[62]  Peer Bork,et al.  SMART: a web-based tool for the study of genetically mobile domains , 2000, Nucleic Acids Res..

[63]  W. Fitch,et al.  Construction of phylogenetic trees. , 1967, Science.

[64]  E V Koonin,et al.  Regulatory potential, phyletic distribution and evolution of ancient, intracellular small-molecule-binding domains. , 2001, Journal of molecular biology.

[65]  Y. Kawarabayasi,et al.  Complete genome sequence of an aerobic hyper-thermophilic crenarchaeon, Aeropyrum pernix K1. , 1999, DNA research : an international journal for rapid publication of reports on genes and genomes.

[66]  Dmitrij Frishman,et al.  The genome sequence of the thermoacidophilic scavenger Thermoplasma acidophilum , 2000, Nature.

[67]  T. Steitz,et al.  Crystal structure of a pol alpha family replication DNA polymerase from bacteriophage RB69. , 1997, Cell.

[68]  T. Steitz,et al.  Crystal Structure of a pol α Family Replication DNA Polymerase from Bacteriophage RB69 , 1997, Cell.

[69]  J. Drake,et al.  Genetic fidelity under harsh conditions: Analysis of spontaneous mutation in the thermoacidophilic archaeon Sulfolobus acidocaldarius , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[70]  K. Stetter Hyperthermophiles in the history of life. , 2007, Ciba Foundation symposium.

[71]  D. Grogan,et al.  Spontaneous mutation in a thermoacidophilic archaeon: evaluation of genetic and physiological factors , 1997, Archives of Microbiology.

[72]  S F Altschul,et al.  Iterated profile searches with PSI-BLAST--a tool for discovery in protein databases. , 1998, Trends in biochemical sciences.

[73]  D. Prieur,et al.  UV and Ethyl Methanesulfonate Effects in Hyperthermophilic Archaea and Isolation of Auxotrophic Mutants of Pyrococcus Strains , 1996, Current Microbiology.

[74]  M. Jockovich,et al.  Nuclease activity is essential for RecBCD recombination in Escherichia coli , 2001, Molecular microbiology.

[75]  E. Koonin,et al.  The alpha/beta fold uracil DNA glycosylases: a common origin with diverse fates , 2000, Genome Biology.