Detection and characterization of horizontal transfers in prokaryotes using genomic signature

Horizontal DNA transfer is an important factor of evolution and participates in biological diversity. Unfortunately, the location and length of horizontal transfers (HTs) are known for very few species. The usage of short oligonucleotides in a sequence (the so-called genomic signature) has been shown to be species-specific even in DNA fragments as short as 1 kb. The genomic signature is therefore proposed as a tool to detect HTs. Since DNA transfers originate from species with a signature different from those of the recipient species, the analysis of local variations of signature along recipient genome may allow for detecting exogenous DNA. The strategy consists in (i) scanning the genome with a sliding window, and calculating the corresponding local signature (ii) evaluating its deviation from the signature of the whole genome and (iii) looking for similar signatures in a database of genomic signatures. A total of 22 prokaryote genomes are analyzed in this way. It has been observed that atypical regions make up ∼6% of each genome on the average. Most of the claimed HTs as well as new ones are detected. The origin of putative DNA transfers is looked for among ∼12 000 species. Donor species are proposed and sometimes strongly suggested, considering similarity of signatures. Among the species studied, Bacillus subtilis, Haemophilus Influenzae and Escherichia coli are investigated by many authors and give the opportunity to perform a thorough comparison of most of the bioinformatics methods used to detect HTs.

[1]  S. Karlin,et al.  Dinucleotide relative abundance extremes: a genomic signature. , 1995, Trends in genetics : TIG.

[2]  H. Matsuda,et al.  Biased biological functions of horizontally transferred genes in prokaryotic genomes , 2004, Nature Genetics.

[3]  P. Deschavanne,et al.  Genomic signature: characterization and classification of species assessed by chaos game representation of sequences. , 1999, Molecular biology and evolution.

[4]  F. de la Cruz,et al.  Horizontal gene transfer and the origin of species: lessons from bacteria. , 2000, Trends in microbiology.

[5]  S. Karlin,et al.  Comparative DNA analysis across diverse genomes. , 1998, Annual review of genetics.

[6]  H. J. Jeffrey Chaos game representation of gene structure. , 1990, Nucleic acids research.

[7]  N. Sueoka,et al.  Asymmetric directional mutation pressures in bacteria , 2002, Genome Biology.

[8]  A. Goffeau,et al.  The complete genome sequence of the Gram-positive bacterium Bacillus subtilis , 1997, Nature.

[9]  S Karlin,et al.  Compositional biases of bacterial genomes and evolutionary implications , 1997, Journal of bacteriology.

[10]  M. Syvanen Horizontal gene transfer: evidence and possible consequences. , 1994, Annual review of genetics.

[11]  J. Eisen Horizontal gene transfer among microbial genomes: new insights from complete genome analysis. , 2000, Current opinion in genetics & development.

[12]  A. Danchin,et al.  Evidence for horizontal gene transfer in Escherichia coli speciation. , 1991, Journal of molecular biology.

[13]  S Guindon,et al.  Intragenomic base content variation is a potential source of biases when searching for horizontally transferred genes. , 2001, Molecular biology and evolution.

[14]  Howard Ochman,et al.  Reconciling the many faces of lateral gene transfer. , 2002, Trends in microbiology.

[15]  E. Denamur,et al.  Escherichia coli molecular phylogeny using the incongruence length difference test. , 1998, Molecular biology and evolution.

[16]  S Karlin,et al.  Detecting anomalous gene clusters and pathogenicity islands in diverse bacterial genomes. , 2001, Trends in microbiology.

[17]  Guy Perrière,et al.  G+C3 structuring along the genome: a common feature in prokaryotes. , 2003, Molecular biology and evolution.

[18]  P. Reeves,et al.  Gene transfer is a major factor in bacterial evolution. , 1996, Molecular biology and evolution.

[19]  M. Blaser,et al.  Evolutionary implications of microbial genome tetranucleotide frequency biases. , 2003, Genome research.

[20]  Antoine Danchin,et al.  Relationship of SARS-CoV to other pathogenic RNA viruses explored by tetranucleotide usage profiling , 2003, BMC Bioinformatics.

[21]  A Danchin,et al.  Oligonucleotide bias in Bacillus subtilis: general trends and taxonomic comparisons. , 1998, Nucleic acids research.

[22]  W. Doolittle,et al.  Lateral genomics. , 1999, Trends in cell biology.

[23]  S. Karlin,et al.  Global dinucleotide signatures and analysis of genomic heterogeneity. , 1998, Current opinion in microbiology.

[24]  Doolittle Wf Phylogenetic Classification and the Universal Tree , 1999 .

[25]  M. Ragan On surrogate methods for detecting lateral gene transfer. , 2001, FEMS microbiology letters.

[26]  S Karlin,et al.  Detecting Alien Genes in Bacterial Genomes a , 1999, Annals of the New York Academy of Sciences.

[27]  S. Karlin,et al.  Strand compositional asymmetry in bacterial and large viral genomes. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Alain Giron,et al.  Genomic signature is preserved in short DNA fragments , 2000, Proceedings IEEE International Symposium on Bio-Informatics and Biomedical Engineering.

[29]  J. Felsenstein,et al.  A Hidden Markov Model approach to variation among sites in rate of evolution. , 1996, Molecular biology and evolution.

[30]  J. Lobry Asymmetric substitution patterns in the two DNA strands of bacteria. , 1996, Molecular biology and evolution.

[31]  H. Ochman,et al.  Lateral gene transfer and the nature of bacterial innovation , 2000, Nature.

[32]  S M Payne,et al.  Complete Genome Sequence and Comparative Genomics of Shigella flexneri Serotype 2a Strain 2457T , 2003, Infection and Immunity.

[33]  S Karlin,et al.  Heterogeneity of genomes: measures and values. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[34]  M. Borodovsky,et al.  How to interpret an anonymous bacterial genome: machine learning approach to gene identification. , 1998, Genome research.

[35]  James R. Brown Ancient horizontal gene transfer , 2003, Nature Reviews Genetics.

[36]  Borisas Bursteinas,et al.  Tree structured classifiers, interconnected data, and predictive accuracy , 2000, Intell. Data Anal..

[37]  Bin Wang,et al.  Limitations of Compositional Approach to Identifying Horizontally Transferred Genes , 2001, Journal of Molecular Evolution.

[38]  J. M. Smith,et al.  Detecting recombination from gene trees. , 1998, Molecular biology and evolution.

[39]  Mark Hoebeke,et al.  Mining Bacillus subtilis chromosome heterogeneities using hidden Markov models. , 2002, Nucleic acids research.

[40]  F. Blattner,et al.  Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[41]  J. Lake,et al.  Horizontal gene transfer among genomes: the complexity hypothesis. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[42]  A Danchin,et al.  Codon usage and lateral gene transfer in Bacillus subtilis. , 1999, Current opinion in microbiology.

[43]  M. Ledoux The concentration of measure phenomenon , 2001 .

[44]  G. Perrière,et al.  The source of laterally transferred genes in bacterial genomes , 2003, Genome Biology.

[45]  G. Perrière,et al.  Use and misuse of correspondence analysis in codon usage studies. , 2002, Nucleic acids research.

[46]  R. Sandberg,et al.  Capturing whole-genome characteristics in short sequences using a naïve Bayesian classifier. , 2001, Genome research.

[47]  C. Dutta,et al.  Horizontal gene transfer and bacterial diversity , 2002, Journal of Biosciences.

[48]  Alain Giron,et al.  A genomic schism in birds revealed by phylogenetic analysis of DNA strings. , 2002, Systematic biology.

[49]  L. Koski,et al.  Codon bias and base composition are poor indicators of horizontally transferred genes. , 2001, Molecular biology and evolution.

[50]  J A Eisen,et al.  Assessing evolutionary relationships among microbes from whole-genome analysis. , 2000, Current opinion in microbiology.

[51]  Santiago Garcia-Vallvé,et al.  HGT-DB: a database of putative horizontally transferred genes in prokaryotic complete genomes , 2003, Nucleic Acids Res..

[52]  S. Karlin Bacterial DNA strand compositional asymmetry. , 1999, Trends in microbiology.

[53]  R F Doolittle,et al.  Evolution by acquisition: the case for horizontal gene transfers. , 1992, Trends in biochemical sciences.

[54]  A. Jeltsch,et al.  Horizontal gene transfer contributes to the wide distribution and evolution of type II restriction-modification systems , 1996, Journal of Molecular Evolution.

[55]  E V Koonin,et al.  Evolution of aminoacyl-tRNA synthetases--analysis of unique domain architectures and phylogenetic trees reveals a complex history of horizontal gene transfer events. , 1999, Genome research.

[56]  C R Woese,et al.  An archaeal genomic signature. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[57]  S. Karlin,et al.  Statistical significance of sequence patterns in proteins. , 1995, Current opinion in structural biology.

[58]  J. Fleiss Statistical methods for rates and proportions , 1974 .

[59]  S Karlin,et al.  Similarities and dissimilarities of phage genomes. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[60]  H. Ochman,et al.  Amelioration of Bacterial Genomes: Rates of Change and Exchange , 1997, Journal of Molecular Evolution.

[61]  L. Orgel,et al.  Phylogenetic Classification and the Universal Tree , 1999 .

[62]  H. Ochman,et al.  Molecular archaeology of the Escherichia coli genome. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[63]  N. W. Davis,et al.  Genome sequence of enterohaemorrhagic Escherichia coli O157:H7 , 2001, Nature.

[64]  S. Garcia-Vallvé,et al.  Horizontal gene transfer in bacterial and archaeal complete genomes. , 2000, Genome research.