Gain and Loss of Multiple Genes During the Evolution of Helicobacter pylori

Sequence diversity and gene content distinguish most isolates of Helicobacter pylori. Even greater sequence differences differentiate distinct populations of H. pylori from different continents, but it was not clear whether these populations also differ in gene content. To address this question, we tested 56 globally representative strains of H. pylori and four strains of Helicobacter acinonychis with whole genome microarrays. Of the weighted average of 1,531 genes present in the two sequenced genomes, 25% are absent in at least one strain of H. pylori and 21% were absent or variable in H. acinonychis. We extrapolate that the core genome present in all isolates of H. pylori contains 1,111 genes. Variable genes tend to be small and possess unusual GC content; many of them have probably been imported by horizontal gene transfer. Phylogenetic trees based on the microarray data differ from those based on sequences of seven genes from the core genome. These discrepancies are due to homoplasies resulting from independent gene loss by deletion or recombination in multiple strains, which distort phylogenetic patterns. The patterns of these discrepancies versus population structure allow a reconstruction of the timing of the acquisition of variable genes within this species. Variable genes that are located within the cag pathogenicity island were apparently first acquired en bloc after speciation. In contrast, most other variable genes are of unknown function or encode restriction/modification enzymes, transposases, or outer membrane proteins. These seem to have been acquired prior to speciation of H. pylori and were subsequently lost by convergent evolution within individual strains. Thus, the use of microarrays can reveal patterns of gene gain or loss when examined within a phylogenetic context that is based on sequences of core genes.

[1]  Matthew Berriman,et al.  ACT: the Artemis comparison tool , 2005, Bioinform..

[2]  N. Moran,et al.  Evolutionary Origins of Genomic Repertoires in Bacteria , 2005, PLoS biology.

[3]  Giovanna Morelli,et al.  Microevolution and history of the plague bacillus, Yersinia pestis. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Trinad Chakraborty,et al.  GenomeViz: visualizing microbial genomes , 2004, BMC Bioinformatics.

[5]  D. Berg,et al.  Metastability of Helicobacter pylori bab adhesin genes and dynamics in Lewis b antigen binding. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Michael P. Cummings,et al.  MEGA (Molecular Evolutionary Genetics Analysis) , 2004 .

[7]  Paul Keim,et al.  Phylogenetic discovery bias in Bacillus anthracis using single-nucleotide polymorphisms from whole-genome sequencing. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[8]  L. Florea,et al.  Characterization of Salmonella enterica Subspecies I Genovars by Use of Microarrays , 2004, Journal of bacteriology.

[9]  Ruifu Yang,et al.  DNA Microarray Analysis of Genome Dynamics in Yersinia pestis: Insights into Bacterial Genome Microevolution and Niche Adaptation , 2004, Journal of bacteriology.

[10]  M. Blaser,et al.  Functional Adaptation of BabA, the H. pylori ABO Blood Group Antigen Binding Adhesin , 2004, Science.

[11]  Ralph Schlapbach,et al.  Genome‐wide analysis of transcriptional hierarchy and feedback regulation in the flagellar system of Helicobacter pylori , 2004, Molecular microbiology.

[12]  Marcus W Feldman,et al.  Stable association between strains of Mycobacterium tuberculosis and their human host populations. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Midori Kato-Maeda,et al.  Functional and evolutionary genomics of Mycobacterium tuberculosis: insights from genomic deletions in 100 strains. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[14]  N. Saunders,et al.  The diversity within an expanded and redefined repertoire of phase-variable genes in Helicobacter pylori. , 2004, Microbiology.

[15]  C. Buchrieser,et al.  New Aspects Regarding Evolution and Virulence of Listeria monocytogenes Revealed by Comparative Genomics and DNA Arrays , 2004, Infection and Immunity.

[16]  D. Berg,et al.  Helicobacter acinonychis: Genetic and Rodent Infection Studies of a Helicobacter pylori-Like Gastric Pathogen of Cheetahs and Other Big Cats , 2004, Journal of bacteriology.

[17]  M. Maiden,et al.  Multi-locus sequence typing: a tool for global epidemiology. , 2003, Trends in microbiology.

[18]  A. Labigne,et al.  Presence of Active Aliphatic Amidases in Helicobacter Species Able To Colonize the Stomach , 2003, Infection and Immunity.

[19]  Folker Meyer,et al.  Complete genome sequence and analysis of Wolinella succinogenes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[20]  R. Nichols,et al.  Application of DNA microarrays to study the evolutionary genomics of Yersinia pestis and Yersinia pseudotuberculosis. , 2003, Genome research.

[21]  G. Nyakatura,et al.  The complete genome sequence of the carcinogenic bacterium Helicobacter hepaticus , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[22]  A. Labigne,et al.  A revised annotation and comparative analysis of Helicobacter pylori genomes. , 2003, Nucleic acids research.

[23]  M. Stephens,et al.  Traces of Human Migrations in Helicobacter pylori Populations , 2003, Science.

[24]  D. Graham,et al.  Helicobacter pylori Strain and the Pattern of Gastritis Among First‐Degree Relatives of Patients with Gastric Carcinoma , 2002, Helicobacter.

[25]  Stanley Falkow,et al.  Improved analytical methods for microarray-based genome-composition analysis , 2002, Genome Biology.

[26]  F. Dewhirst,et al.  Helicobacter nemestrinae ATCC 49396T is a strain of Helicobacter pylori (Marshall et al. 1985) Goodwin et al. 1989, and Helicobacter nemestrinae Bronsdon et al. 1991 is therefore a junior heterotypic synonym of Helicobacter pylori. , 2002, International journal of systematic and evolutionary microbiology.

[27]  C. O'Morain,et al.  Helicobacter pylori Infection , 1994 .

[28]  Mark Achtman,et al.  A Phylogenetic Perspective on Molecular Epidemiology , 2002 .

[29]  D. Falush,et al.  Recombination and mutation during long-term gastric colonization by Helicobacter pylori: Estimates of clock rates, recombination size, and minimal age , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Sudhir Kumar,et al.  MEGA2: molecular evolutionary genetics analysis software , 2001, Bioinform..

[31]  S Falkow,et al.  Helicobacter pylori genetic diversity within the gastric niche of a single human host , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[32]  E. Holmes,et al.  Recombination within natural populations of pathogenic bacteria: short-term empirical estimates and long-term phylogenetic consequences. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[33]  David B. Schauer,et al.  Emergence of Diverse HelicobacterSpecies in the Pathogenesis of Gastric and Enterohepatic Diseases , 2001, Clinical Microbiology Reviews.

[34]  Christoph Dehio,et al.  PrimeArray: genome-scale primer design for DNA-microarray construction , 2001, Bioinform..

[35]  G. Sherlock,et al.  A whole-genome microarray reveals genetic diversity among Helicobacter pylori strains. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[36]  M. Naumann,et al.  Translocation of the Helicobacter pylori CagA protein in gastric epithelial cells by a type IV secretion apparatus , 2000, Cellular microbiology.

[37]  R. Haas,et al.  Translocation of Helicobacter pylori CagA into gastric epithelial cells by type IV secretion. , 2000, Science.

[38]  R. Rappuoli,et al.  Tyrosine phosphorylation of the Helicobacter pylori CagA antigen after cag-driven host cell translocation. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[39]  E. Kuipers,et al.  Quasispecies development of Helicobacter pylori observed in paired isolates obtained years apart from the same host. , 2000, The Journal of infectious diseases.

[40]  W R Pearson,et al.  Flexible sequence similarity searching with the FASTA3 program package. , 2000, Methods in molecular biology.

[41]  T. Alarcón,et al.  cagA gene and vacA alleles in Spanish Helicobacter pylori clinical isolates from patients of different ages. , 1999, FEMS immunology and medical microbiology.

[42]  R. Rappuoli,et al.  Helicobacter pylori virulence and genetic geography. , 1999, Science.

[43]  M. Achtman,et al.  Recombination and clonal groupings within Helicobacter pylori from different geographical regions , 2012 .

[44]  F. Mégraud,et al.  Geographic distribution of vacA allelic types of Helicobacter pylori. , 1999, Gastroenterology.

[45]  Benjamin L. King,et al.  Genomic-sequence comparison of two unrelated isolates of the human gastric pathogen Helicobacter pylori , 1999, Nature.

[46]  D. Berg,et al.  Emergence of recombinant strains of Helicobacter pylori during human infection , 1999, Molecular microbiology.

[47]  J. M. Smith,et al.  Free recombination within Helicobacter pylori. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[48]  M. Stolte,et al.  Chronic gastritis in tigers associated with Helicobacter acinonyx. , 1998, Journal of comparative pathology.

[49]  Y. Nakayama,et al.  Restriction-modification gene complexes as selfish gene entities: roles of a regulatory system in their establishment, maintenance, and apoptotic mutual exclusion. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[50]  R. V. D. van der Hulst,et al.  cagA-Positive Helicobacter pylori Populations in China and The Netherlands Are Distinct , 1998, Infection and Immunity.

[51]  W. Pearson Empirical statistical estimates for sequence similarity searches. , 1998, Journal of molecular biology.

[52]  T. Whittam,et al.  Restricted structural gene polymorphism in the Mycobacterium tuberculosis complex indicates evolutionarily recent global dissemination. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[53]  Mark Borodovsky,et al.  The complete genome sequence of the gastric pathogen Helicobacter pylori , 1997, Nature.

[54]  H. Ochman,et al.  Amelioration of Bacterial Genomes: Rates of Change and Exchange , 1997, Journal of Molecular Evolution.

[55]  M. Borodovsky,et al.  cag, a pathogenicity island of Helicobacter pylori, encodes type I-specific and disease-associated virulence factors. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[56]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[57]  D. Graham,et al.  Population genetic analysis of Helicobacter pylori by multilocus enzyme electrophoresis: extensive allelic diversity and recombinational population structure , 1996, Journal of bacteriology.

[58]  Q. Jiang,et al.  Variability of gene order in different Helicobacter pylori strains contributes to genome diversity , 1996, Molecular microbiology.

[59]  J. M. Smith,et al.  How clonal are bacteria? , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[60]  D. Morgan,et al.  Helicobacter acinonyx sp. nov., isolated from cheetahs with gastritis. , 1993, International journal of systematic bacteriology.

[61]  D. Berg,et al.  PCR-based RFLP analysis of DNA sequence diversity in the gastric pathogen Helicobacter pylori. , 1992, Nucleic Acids Research.

[62]  S. Kresovich,et al.  DNA diversity among clinical isolates of Helicobacter pylori detected by PCR-based RAPD fingerprinting. , 1992, Nucleic acids research.