Describing ancient horizontal gene transfers at the nucleotide and gene levels by comparative pathogenicity island genometrics

MOTIVATION Lateral gene transfer is a major mechanism contributing to bacterial genome dynamics and pathovar emergence via pathogenicity island (PAI) spreading. However, since few of these genomic exchanges are experimentally reproducible, it is difficult to establish evolutionary scenarios for the successive PAI transmissions between bacterial genera. Methods initially developed at the gene and/or nucleotide level for genomics, i.e. comparisons of concatenated sequences, ortholog frequency, gene order or dinucleotide usage, were combined and applied here to homologous PAIs: we call this approach comparative PAI genometrics. RESULTS YAPI, a Yersinia PAI, and related islands were compared with measure evolutionary relationships between related modules. Through use of our genometric approach designed for tracking codon usage adaptation and gene phylogeny, an ancient inter-genus PAI transfer was oriented for the first time by characterizing the genomic environment in which the ancestral island emerged and its subsequent transfers to other bacterial genera.

[1]  S Karlin,et al.  Detecting anomalous gene clusters and pathogenicity islands in diverse bacterial genomes. , 2001, Trends in microbiology.

[2]  D. Sankoff,et al.  Gene Order Breakpoint Evidence in Animal Mitochondrial Phylogeny , 1999, Journal of Molecular Evolution.

[3]  A. Danchin,et al.  The genome sequence of the entomopathogenic bacterium Photorhabdus luminescens , 2003, Nature Biotechnology.

[4]  Micah Acinapura,et al.  Computational DNA Sequence Analysis , 2003 .

[5]  V. Escuyer,et al.  Yersinia pseudotuberculosis Harbors a Type IV Pilus Gene Cluster That Contributes to Pathogenicity , 2002, Infection and Immunity.

[6]  J. Hacker,et al.  Pathogenicity islands and the evolution of microbes. , 2000, Annual review of microbiology.

[7]  N. Grishin,et al.  Genome trees constructed using five different approaches suggest new major bacterial clades , 2001, BMC Evolutionary Biology.

[8]  B. Snel,et al.  Genome phylogeny based on gene content , 1999, Nature Genetics.

[9]  E. Herniou,et al.  The genome sequence and evolution of baculoviruses. , 2003, Annual review of entomology.

[10]  D. McGeoch,et al.  Toward a Comprehensive Phylogeny for Mammalian and Avian Herpesviruses , 2000, Journal of Virology.

[11]  H. Ochman,et al.  Lateral gene transfer and the nature of bacterial innovation , 2000, Nature.

[12]  Su-ryang Kim,et al.  Nucleotide sequence of the R721 shufflon , 1992, Journal of bacteriology.

[13]  R. Wilson,et al.  Complete genome sequence of Salmonella enterica serovar Typhimurium LT2 , 2001, Nature.

[14]  M Achtman,et al.  Yersinia pestis, the cause of plague, is a recently emerged clone of Yersinia pseudotuberculosis. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[15]  C. Hutchison,et al.  Gene content phylogeny of herpesviruses. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[17]  J. Wain,et al.  Composition, Acquisition, and Distribution of the Vi Exopolysaccharide-Encoding Salmonella enterica Pathogenicity Island SPI-7 , 2003, Journal of bacteriology.

[18]  Roderic D. M. Page,et al.  TreeView: an application to display phylogenetic trees on personal computers , 1996, Comput. Appl. Biosci..

[19]  S. Karlin,et al.  Genome signature comparisons among prokaryote, plasmid, and mitochondrial DNA. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[20]  B. Snel,et al.  The identification of functional modules from the genomic association of genes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[21]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[22]  M. P. Cummings PHYLIP (Phylogeny Inference Package) , 2004 .

[23]  B. Dujon,et al.  The genomic tree as revealed from whole proteome comparisons. , 1999, Genome research.

[24]  L. Ling,et al.  Proteome-wide analysis of protein function composition reveals the clustering and phylogenetic properties of organisms. , 2002, Molecular phylogenetics and evolution.

[25]  B. J. Hinnebusch,et al.  Yersinia: molecular and cellular biology. , 2004 .

[26]  S. Karlin,et al.  Comparative DNA analysis across diverse genomes. , 1998, Annual review of genetics.

[27]  Gilbert Greub,et al.  A genomic island present along the bacterial chromosome of the Parachlamydiaceae UWE25, an obligate amoebal endosymbiont, encodes a potentially functional F-like conjugative DNA transfer system , 2004, BMC Microbiology.

[28]  Wen-Hsiung Li,et al.  Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[29]  A. Billault,et al.  YAPI, a New Yersinia pseudotuberculosis Pathogenicity Island , 2004, Infection and Immunity.

[30]  S. Fitz-Gibbon,et al.  Whole genome-based phylogenetic analysis of free-living microorganisms. , 1999, Nucleic acids research.

[31]  J. Gower Some distance properties of latent root and vector methods used in multivariate analysis , 1966 .

[32]  S Karlin,et al.  Molecular evolution of herpesviruses: genomic and protein sequence comparisons , 1994, Journal of virology.

[33]  J Hacker,et al.  Whole genome plasticity in pathogenic bacteria. , 2001, Current opinion in microbiology.

[34]  P. Sharp,et al.  Variation in the strength of selected codon usage bias among bacteria , 2005, Nucleic acids research.