ACLAME: A CLAssification of Mobile genetic Elements

The ACLAME database (http://aclame.ulb.ac.be) is a collection and classification of prokaryotic mobile genetic elements (MGEs) from various sources, comprising all known phage genomes, plasmids and transposons. In addition to providing information on the full genomes and genetic entities, it aims to build a comprehensive classification of the functional modules of MGEs at the protein, gene and higher levels. This first version contains a comprehensive classification of 5069 proteins from 119 DNA bacteriophages into over 400 functional families. This classification was produced automatically using TRIBE-MCL, a graph-theory-based Markov clustering algorithm that uses sequence measures as input, and then manually curated. Manual curation was aided by consulting annotations available in public databases retrieved through additional sequence similarity searches using Psi-Blast and Hidden Markov Models. The database is publicly accessible and open to expert volunteers willing to participate in its curation. Its web interface allows browsing as well as querying the classification. The main objectives are to collect and organize in a rational way the complexity inherent to MGEs, to extend and improve the inadequate annotation currently associated with MGEs and to screen known genomes for the validation and discovery of new MGEs.

[1]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[2]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[3]  M. Couturier,et al.  Identification and classification of bacterial plasmids. , 1988, Microbiological reviews.

[4]  Ghislain Fournous,et al.  Prophage Genomics , 2003, Microbiology and Molecular Biology Reviews.

[5]  Rainer Fuchs,et al.  CLUSTAL V: improved software for multiple sequence alignment , 1992, Comput. Appl. Biosci..

[6]  W. Doolittle,et al.  Prokaryotic evolution in light of gene transfer. , 2002, Molecular biology and evolution.

[7]  R. Schoenfeld,et al.  Comparative Genomics of Listeria Species , 1976 .

[8]  J. Lawrence,et al.  Gene transfer in bacteria: speciation without species? , 2002, Theoretical population biology.

[9]  Tim J. P. Hubbard,et al.  SCOP database in 2002: refinements accommodate structural genomics , 2002, Nucleic Acids Res..

[10]  S. Salzberg,et al.  The genome sequence of Bacillus anthracis Ames and comparison to closely related bacteria , 2003, Nature.

[11]  W. Jacobs,et al.  Origins of Highly Mosaic Mycobacteriophage Genomes , 2003, Cell.

[12]  F. de la Cruz,et al.  Horizontal gene transfer and the origin of species: lessons from bacteria. , 2000, Trends in microbiology.

[13]  M. Hattori,et al.  Complete genome sequence of enterohemorrhagic Escherichia coli O157:H7 and genomic comparison with a laboratory strain K-12. , 2001, DNA research : an international journal for rapid publication of reports on genes and genomes.

[14]  R. Edwards,et al.  The Phage Proteomic Tree: a Genome-Based Taxonomy for Phage , 2002, Journal of bacteriology.

[15]  J. Hacker,et al.  Ecological fitness, genomic islands and bacterial pathogenicity , 2001, EMBO reports.

[16]  N. W. Davis,et al.  Genome sequence of enterohaemorrhagic Escherichia coli O157:H7 , 2001, Nature.

[17]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[18]  M. Riley,et al.  MultiFun, a multifunctional classification scheme for Escherichia coli K-12 gene products. , 2000, Microbial & comparative genomics.

[19]  L. Gautier,et al.  Comparative Genomics of Listeria Species , 2001, Science.

[20]  Kim Rutherford,et al.  Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18 , 2001, Nature.

[21]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..