Simple sequence repeats in the Helicobacter pylori genome

We describe an integrated system for the analysis of DNA sequence motifs within complete bacterial genome sequences. This system is based around ACeDB, a genome database with an integrated graphical user interface; we identify and display motifs in the context of genetic, sequence and bibliographic data. Tomb et al. (1997) previously reported the identification of contingency genes in Helicobacter pylori through their association with homopolymeric tracts and dinucleotide repeats. With this as a starting point, we validated the system by a search for this type of repeat and used the contextual information to assess the likelihood that they mediate phase variation in the associated open reading frames (ORFs). We found all of the repeats previously described, and identified 27 putative phase‐variable genes (including 17 previously described). These could be divided into three groups: lipopolysaccharide (LPS) biosynthesis, cell‐surface‐associated proteins and DNA restriction/modification systems. Five of the putative genes did not have obvious homologues in any of the public domain sequence databases. The reading frame of some ORFs was disrupted by the presence of the repeats, including the alpha(1‐2) fucosyltransferase gene, necessary for the synthesis of the Lewis Y epitope. An additional benefit of this approach is that the results of each search can be analysed further and compared with those from other genomes. This revealed that H. pylori has an unusually high frequency of homopurine:homopyrimidine repeats suggesting mechanistic biases that favour their presence and instability.

[1]  Cathy H. Wu,et al.  The PIR-International Protein Sequence Database , 1999, Nucleic Acids Res..

[2]  B. Appelmelk,et al.  Phase variation in Helicobacter pylori lipopolysaccharide. , 1998, Infection and immunity.

[3]  Mark Borodovsky,et al.  The complete genome sequence of the gastric pathogen Helicobacter pylori , 1997, Nature.

[4]  D. Gordenin,et al.  Hypermutability of homonucleotide runs in mismatch repair and DNA polymerase proofreading yeast mutants , 1997, Molecular and cellular biology.

[5]  E. Kuipers,et al.  Molecular mimicry between Helicobacter pylori and the host. , 1997, Trends in microbiology.

[6]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL , 1997, Nucleic Acids Res..

[7]  R. Fleischmann,et al.  DNA repeats identify novel virulence genes in Haemophilus influenzae. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[8]  D. Hood,et al.  Tetrameric repeat units associated with virulence factor phase variation in Haemophilus also occur in Neisseria spp. and Moraxella catarrhalis. , 1996, FEMS microbiology letters.

[9]  Patricia Rodriguez-Tomé,et al.  The European Bioinformatics Institute (EBI) databases , 1994, Nucleic Acids Res..

[10]  Hans-Werner Mewes,et al.  The PIR-International Protein Sequence Database , 1992, Nucleic Acids Res..

[11]  R. Macnab,et al.  Flagella and motility , 1996 .

[12]  D. Hood,et al.  Molecular analysis of a locus for the biosynthesis and phase‐variable expression of the lacto‐N‐neotetraose terminal lipopolysaccharide structure in Neisseria meningitidis , 1995, Molecular microbiology.

[13]  V. Deretic,et al.  Pseudomonas aeruginosa, mucoidy and the chronic infection phenotype in cystic fibrosis. , 1995, Trends in microbiology.

[14]  E. Gotschlich Genetic locus for the biosynthesis of the variable portion of Neisseria gonorrhoeae lipooligosaccharide , 1994, The Journal of experimental medicine.

[15]  E. Hansen,et al.  Identification of a new locus involved in expression of Haemophilus influenzae type b lipooligosaccharide , 1994, Infection and immunity.

[16]  H. Bujard,et al.  Context-dependent effects of upstream A-tracts. Stimulation or inhibition of Escherichia coli promoter function. , 1994, Journal of molecular biology.

[17]  J. Cannon,et al.  Multiple gonococcal opacity proteins are expressed during experimental urethral infection in the male , 1994, The Journal of experimental medicine.

[18]  John M. Hancock,et al.  SIMPLE34: an improved and enhanced implementation for VAX and Sun computers of the SIMPLE algorithm for analysis of clustered repetitive motifs in nucleotide sequences , 1994, Comput. Appl. Biosci..

[19]  M. Nowak,et al.  Adaptive evolution of highly mutable loci in pathogenic bacteria , 1994, Current Biology.

[20]  S. Normark,et al.  Attachment of Helicobacter pylori to human gastric epithelium mediated by blood group antigens. , 1993, Science.

[21]  J. Putten,et al.  Phase variation of lipopolysaccharide directs interconversion of invasive and immuno‐resistant phenotypes of Neisseria gonorrhoeae. , 1993 .

[22]  F. Mooi,et al.  Phase variation of H. influenzae fimbriae: Transcriptional control of two divergent genes through a variable combined promoter region , 1993, Cell.

[23]  Thure Etzold,et al.  SRS - an indexing and retrieval tool for flat file data libraries , 1993, Comput. Appl. Biosci..

[24]  T. Kuroki,et al.  Variable opacity (Opa) outer membrane proteins account for the cell tropisms displayed by Neisseria gonorrhoeae for human leukocytes and epithelial cells. , 1993, The EMBO journal.

[25]  J. Griffiss,et al.  Effect of exogenous sialylation of the lipooligosaccharide of Neisseria gonorrhoeae on opsonophagocytosis , 1992, Infection and immunity.

[26]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[27]  E. Moxon,et al.  The molecular mechanism of phase variation of H. influenzae lipopolysaccharide , 1989, Cell.

[28]  G. Skjåk-Bræk,et al.  Effect of acetylation on some solution and gelling properties of alginates , 1989 .

[29]  A. Varki,et al.  Acetyl-coenzyme A:polysialic acid O-acetyltransferase from K1-positive Escherichia coli. The enzyme responsible for the O-acetyl plus phenotype and for O-acetyl form variation. , 1988, The Journal of biological chemistry.

[30]  Ian R. Booth,et al.  A physiological role for DNA supercoiling in the osmotic regulation of gene expression in S. typhimurium and E. coli , 1988, Cell.

[31]  T. Meyer,et al.  The repertoire of silent pilus genes in neisseria gonorrhoeae: Evidence for gene conversion , 1986, Cell.

[32]  J. Abraham,et al.  An invertible element of DNA controls phase variation of type 1 fimbriae of Escherichia coli. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[33]  A. Sutton,et al.  Form variation in Escherichia coli K1: determined by O-acetylation of the capsular polysaccharide , 1979, The Journal of experimental medicine.