A computational approach for identifying pathogenicity islands in prokaryotic genomes

BackgroundPathogenicity islands (PAIs), distinct genomic segments of pathogens encoding virulence factors, represent a subgroup of genomic islands (GIs) that have been acquired by horizontal gene transfer event. Up to now, computational approaches for identifying PAIs have been focused on the detection of genomic regions which only differ from the rest of the genome in their base composition and codon usage. These approaches often lead to the identification of genomic islands, rather than PAIs.ResultsWe present a computational method for detecting potential PAIs in complete prokaryotic genomes by combining sequence similarities and abnormalities in genomic composition. We first collected 207 GenBank accessions containing either part or all of the reported PAI loci. In sequenced genomes, strips of PAI-homologs were defined based on the proximity of the homologs of genes in the same PAI accession. An algorithm reminiscent of sequence-assembly procedure was then devised to merge overlapping or adjacent genomic strips into a large genomic region. Among the defined genomic regions, PAI-like regions were identified by the presence of homolog(s) of virulence genes. Also, GIs were postulated by calculating G+C content anomalies and codon usage bias. Of 148 prokaryotic genomes examined, 23 pathogenic and 6 non-pathogenic bacteria contained 77 candidate PAIs that partly or entirely overlap GIs.ConclusionSupporting the validity of our method, included in the list of candidate PAIs were thirty four PAIs previously identified from genome sequencing papers. Furthermore, in some instances, our method was able to detect entire PAIs for those only partial sequences are available. Our method was proven to be an efficient method for demarcating the potential PAIs in our study. Also, the function(s) and origin(s) of a candidate PAI can be inferred by investigating the PAI queries comprising it. Identification and analysis of potential PAIs in prokaryotic genomes will broaden our knowledge on the structure and properties of PAIs and the evolution of bacterial pathogenesis.

[1]  C. Buchrieser,et al.  Analysis of Genome Plasticity in Pathogenic and Commensal Escherichia coli Isolates by Use of DNA Arrays , 2003, Journal of bacteriology.

[2]  H. Ochman,et al.  Amelioration of Bacterial Genomes: Rates of Change and Exchange , 1997, Journal of Molecular Evolution.

[3]  Y. Nakamura,et al.  Complete genome structure of the nitrogen-fixing symbiotic bacterium Mesorhizobium loti. , 2000, DNA research : an international journal for rapid publication of reports on genes and genomes.

[4]  M. Hensel,et al.  Molecular and functional analysis indicates a mosaic structure of Salmonella pathogenicity island 2 , 1999, Molecular microbiology.

[5]  Jonathan A. Eisen,et al.  Microbial genome sequencing , 2000, Nature.

[6]  Eugene W. Myers,et al.  Whole-genome DNA sequencing , 1999, Comput. Sci. Eng..

[7]  S. Garcia-Vallvé,et al.  Horizontal gene transfer in bacterial and archaeal complete genomes. , 2000, Genome research.

[8]  Pietro Liò,et al.  Finding pathogenicity islands and gene transfer events in genome data , 2000, Bioinform..

[9]  Ulrich Dobrindt,et al.  Genomic islands in pathogenic and environmental microorganisms , 2004, Nature Reviews Microbiology.

[10]  J. F. Kim Revisiting the chlamydial type III protein secretion system: clues to the origin of type III protein secretion. , 2001, Trends in genetics : TIG.

[11]  S Karlin,et al.  Detecting anomalous gene clusters and pathogenicity islands in diverse bacterial genomes. , 2001, Trends in microbiology.

[12]  K. van Dijk,et al.  The Pseudomonas syringae Hrp pathogenicity island has a tripartite mosaic structure composed of a cluster of type III secretion genes bounded by exchangeable effector and conserved effector loci that contribute to parasitic fitness and pathogenicity in plants. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Ulrich Dobrindt,et al.  Genetic Structure and Distribution of Four Pathogenicity Islands (PAI I536 to PAI IV536) of Uropathogenic Escherichia coli Strain 536 , 2002, Infection and Immunity.

[14]  M. Ragan On surrogate methods for detecting lateral gene transfer. , 2001, FEMS microbiology letters.

[15]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[16]  K. Rajakumar,et al.  Ferric Dicitrate Transport System (Fec) of Shigella flexneri 2a YSH6000 Is Encoded on a Novel Pathogenicity Island Carrying Multiple Antibiotic Resistance Genes , 2001, Infection and Immunity.

[17]  Antonio Marín,et al.  Preference for guanosine at first codon position in highly expressed Escherichia coli genes. A relationship with translational efficiency , 1996, Nucleic Acids Res..

[18]  Qiang Tu,et al.  Detecting pathogenicity islands and anomalous gene clusters by iterative discriminant analysis. , 2003, FEMS microbiology letters.

[19]  J. Eisen Horizontal gene transfer among microbial genomes: new insights from complete genome analysis. , 2000, Current opinion in genetics & development.

[20]  J. Hacker,et al.  Pathogenicity islands and other mobile virulence elements , 1999 .

[21]  Rainer Merkl,et al.  SIGI: score-based identification of genomic islands , 2004, BMC Bioinformatics.

[22]  Bin Wang,et al.  Limitations of Compositional Approach to Identifying Horizontally Transferred Genes , 2001, Journal of Molecular Evolution.

[23]  J. Hacker,et al.  Pathogenicity Islands and the Evolution of Pathogenic Microbes , 2002, Current Topics in Microbiology and Immunology.

[24]  C. Hueck,et al.  Type III Protein Secretion Systems in Bacterial Pathogens of Animals and Plants , 1998, Microbiology and Molecular Biology Reviews.

[25]  Herbert Schmidt,et al.  Pathogenicity Islands in Bacterial Pathogenesis , 2004, Clinical Microbiology Reviews.

[26]  W. Goebel,et al.  Listeria Pathogenesis and Molecular Virulence Determinants , 2001, Clinical Microbiology Reviews.

[27]  Guy Plunkett,et al.  Genome Sequence of Yersinia pestis KIM , 2002, Journal of bacteriology.

[28]  A. Danchin,et al.  Unique physiological and pathogenic features of Leptospira interrogans revealed by whole-genome sequencing , 2003, Nature.

[29]  Kelly P. Williams,et al.  Islander: a database of integrative islands in prokaryotic genomes, the associated integrases and their DNA site specificities , 2004, Nucleic Acids Res..