Identification of Prophages in Bacterial Genomes by Dinucleotide Relative Abundance Difference

Background Prophages are integrated viral forms in bacterial genomes that have been found to contribute to interstrain genetic variability. Many virulence-associated genes are reported to be prophage encoded. Present computational methods to detect prophages are either by identifying possible essential proteins such as integrases or by an extension of this technique, which involves identifying a region containing proteins similar to those occurring in prophages. These methods suffer due to the problem of low sequence similarity at the protein level, which suggests that a nucleotide based approach could be useful. Methodology Earlier dinucleotide relative abundance (DRA) have been used to identify regions, which deviate from the neighborhood areas, in genomes. We have used the difference in the dinucleotide relative abundance (DRAD) between the bacterial and prophage DNA to aid location of DNA stretches that could be of prophage origin in bacterial genomes. Prophage sequences which deviate from bacterial regions in their dinucleotide frequencies are detected by scanning bacterial genome sequences. The method was validated using a subset of genomes with prophage data from literature reports. A web interface for prophage scan based on this method is available at http://bicmku.in:8082/prophagedb/dra.html. Two hundred bacterial genomes which do not have annotated prophages have been scanned for prophage regions using this method. Conclusions The relative dinucleotide distribution difference helps detect prophage regions in genome sequences. The usefulness of this method is seen in the identification of 461 highly probable loci pertaining to prophages which have not been annotated so earlier. This work emphasizes the need to extend the efforts to detect and annotate prophage elements in genome sequences.

[1]  P. Reeves,et al.  Gene transfer is a major factor in bacterial evolution. , 1996, Molecular biology and evolution.

[2]  S. Casjens,et al.  Prophages and bacterial genomics: what have we learned so far? , 2003, Molecular microbiology.

[3]  A. Gallagher,et al.  Alteration of a single amino acid residue reverses fosfomycin resistance of recombinant MurA from Mycobacterium tuberculosis. , 1999, Microbiology.

[4]  V. Makarov Computer programs for eukaryotic gene prediction , 2002, Briefings Bioinform..

[5]  Emmanuelle Lerat,et al.  The relative abundance of dinucleotides in transposable elements in five species. , 2002, Molecular biology and evolution.

[6]  H. Brüssow,et al.  Phage-Host Interaction: an Ecological Perspective , 2004, Journal of bacteriology.

[7]  S. Karlin,et al.  Dinucleotide relative abundance extremes: a genomic signature. , 1995, Trends in genetics : TIG.

[8]  Rekha R Meyer,et al.  Comparison of genome degradation in Paratyphi A and Typhi, human-restricted serovars of Salmonella enterica that cause typhoid , 2004, Nature Genetics.

[9]  S Karlin,et al.  Heterogeneity of genomes: measures and values. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Wolf-Dietrich Hardt,et al.  Phages and the Evolution of Bacterial Pathogens: from Genomic Rearrangements to Lysogenic Conversion , 2004, Microbiology and Molecular Biology Reviews.

[11]  K. Kurokawa,et al.  Diversification of Escherichia coli genomes: are bacteriophages the major contributors? , 2001, Trends in microbiology.

[12]  C. Bloch,et al.  "Black holes" and bacterial pathogenicity: a large genomic deletion that enhances the virulence of Shigella spp. and enteroinvasive Escherichia coli. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[13]  S Karlin,et al.  Similarities and dissimilarities of phage genomes. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Sayera Banu,et al.  Are the PE‐PGRS proteins of Mycobacterium tuberculosis variable surface antigens? , 2002, Molecular microbiology.

[15]  X. Nassif,et al.  Bacteriophages and pathogenicity: more than just providing a toxin? , 2006, Microbes and infection.

[16]  C. Dutta,et al.  Horizontal gene transfer and bacterial diversity , 2002, Journal of Biosciences.

[17]  Robert Barber,et al.  Prophage Finder: A Prophage Loci Prediction Tool for Prokaryotic Genome Sequences , 2006, Silico Biol..

[18]  D. Fouts Phage_Finder: Automated identification and classification of prophage regions in complete bacterial genome sequences , 2006, Nucleic acids research.

[19]  N. Federspiel,et al.  Granuloma-specific expression of Mycobacterium virulence proteins from the glycine-rich PE-PGRS family. , 2000, Science.

[20]  Herbert Schmidt,et al.  Pathogenicity Islands in Bacterial Pathogenesis , 2004, Clinical Microbiology Reviews.

[21]  Sung Ho Yoon,et al.  A computational approach for identifying pathogenicity islands in prokaryotic genomes , 2005, BMC Bioinformatics.

[22]  Yuansha Chen,et al.  Comparative Genomic Analyses of the Vibrio Pathogenicity Island and Cholera Toxin Prophage Regions in Nonepidemic Serogroup Strains of Vibrio cholerae , 2003, Applied and Environmental Microbiology.

[23]  J. W. Wilson,et al.  Mechanisms of bacterial pathogenicity , 2002, Postgraduate medical journal.

[24]  S Karlin,et al.  Compositional biases of bacterial genomes and evolutionary implications , 1997, Journal of bacteriology.

[25]  R. Rappuoli,et al.  Did the inheritance of a pathogenicity island modify the virulence of Helicobacter pylori? , 1997, Trends in microbiology.

[26]  B. Davis,et al.  Bacteriophage-bacteriophage interactions in the evolution of pathogenic bacteria. , 2001, Trends in microbiology.

[27]  S. Karlin,et al.  Global dinucleotide signatures and analysis of genomic heterogeneity. , 1998, Current opinion in microbiology.

[28]  Jun Yu,et al.  VFDB: a reference database for bacterial virulence factors , 2004, Nucleic Acids Res..

[29]  A. Fraser,et al.  Genome sequence of the enterobacterial phytopathogen Erwinia carotovora subsp. atroseptica and characterization of virulence factors. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Matthew K. Waldor,et al.  Bacteriophage Control of Bacterial Virulence , 2002, Infection and Immunity.

[31]  Jaime Prilusky,et al.  Database and Comparative Identification of Prophages , 2006 .

[32]  Richard H. Price,et al.  Black Holes , 1997 .

[33]  G. Fournous,et al.  Phage as agents of lateral gene transfer. , 2003, Current opinion in microbiology.

[34]  D. Fouts,et al.  The Bacillus anthracis chromosome contains four conserved, excision-proficient, putative prophages , 2006, BMC Microbiology.

[35]  Songnian Hu,et al.  The genome sequence of Salmonella enterica serovar Choleraesuis, a highly invasive and resistant zoonotic pathogen , 2005, Nucleic acids research.

[36]  J Hacker,et al.  Pathogenicity islands of virulent bacteria: structure, function and impact on microbial evolution , 1997, Molecular microbiology.

[37]  M. Waldor,et al.  Molecular Analyses of a Putative CTXφ Precursor and Evidence for Independent Acquisition of Distinct CTXφs by Toxigenic Vibrio cholerae , 2000, Journal of bacteriology.

[38]  M. Waldor,et al.  Infectious CTXΦ and the Vibrio Pathogenicity Island Prophage in Vibrio mimicus: Evidence for Recent Horizontal Transfer between V. mimicus and V. cholerae , 2000, Infection and Immunity.

[39]  Jeff Smith The social evolution of bacterial pathogenesis , 2001, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[40]  A Protein Similarity Approach For Detecting Prophage Regions In Bacterial Genomes , 2005, Genome Biology.

[41]  W Arber,et al.  Genetic variation: molecular mechanisms and impact on microbial evolution. , 2000, FEMS microbiology reviews.

[42]  Ghislain Fournous,et al.  The impact of prophages on bacterial chromosomes , 2004, Molecular microbiology.

[43]  Ghislain Fournous,et al.  Prophage Genomics , 2003, Microbiology and Molecular Biology Reviews.

[44]  R. Hendrix,et al.  Evolutionary relationships among diverse bacteriophages and prophages: all the world's a phage. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[45]  J. Hacker,et al.  Ecological fitness, genomic islands and bacterial pathogenicity , 2001, EMBO reports.

[46]  Santiago Garcia-Vallvé,et al.  HGT-DB: a database of putative horizontally transferred genes in prokaryotic complete genomes , 2003, Nucleic Acids Res..