GenColors-based comparative genome databases for small eukaryotic genomes

Many sequence data repositories can give a quick and easily accessible overview on genomes and their annotations. Less widespread is the possibility to compare related genomes with each other in a common database environment. We have previously described the GenColors database system (http://gencolors.fli-leibniz.de) and its applications to a number of bacterial genomes such as Borrelia, Legionella, Leptospira and Treponema. This system has an emphasis on genome comparison. It combines data from related genomes and provides the user with an extensive set of visualization and analysis tools. Eukaryote genomes are normally larger than prokaryote genomes and thus pose additional challenges for such a system. We have, therefore, adapted GenColors to also handle larger datasets of small eukaryotic genomes and to display eukaryotic gene structures. Further recent developments include whole genome views, genome list options and, for bacterial genome browsers, the display of horizontal gene transfer predictions. Two new GenColors-based databases for two fungal species (http://fgb.fli-leibniz.de) and for four social amoebas (http://sacgb.fli-leibniz.de) were set up. Both new resources open up a single entry point for related genomes for the amoebozoa and fungal research communities and other interested users. Comparative genomics approaches are greatly facilitated by these resources.

[1]  Mary Goldman,et al.  The UCSC Genome Browser database: extensions and updates 2011 , 2011, Nucleic Acids Res..

[2]  J. Townsend,et al.  Horizontal gene transfer, genome innovation and evolution , 2005, Nature Reviews Microbiology.

[3]  W. Fitch Distinguishing homologous from analogous proteins. , 1970, Systematic zoology.

[4]  M. Kanehisa,et al.  The KEGG databases and tools facilitating omics analysis: latest developments involving human diseases and pharmaceuticals. , 2012, Methods in molecular biology.

[5]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[6]  Mary Goldman,et al.  The UCSC Genome Browser database: extensions and updates 2013 , 2012, Nucleic Acids Res..

[7]  Matthias Platzer,et al.  GenColors: annotation and comparative genomics of prokaryotes made easy. , 2007, Methods in molecular biology.

[8]  Matthias Platzer,et al.  Comparative genome analysis: selection pressure on the Borrelia vls cassettes is essential for infectivity , 2006, BMC Genomics.

[9]  T. C. White,et al.  Comparative and functional genomics provide insights into the pathogenicity of dermatophytic fungi , 2011, Genome Biology.

[10]  Jürgen Sühnel,et al.  GenColors: accelerated comparative analysis and annotation of prokaryotic genomes at various stages of completeness , 2005, Bioinform..

[11]  Darren A. Natale,et al.  The COG database: an updated version includes eukaryotes , 2003, BMC Bioinformatics.

[12]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[13]  Matthew J. Gonzales,et al.  Synonymous-non-synonymous mutation rates between sequences containing ambiguous nucleotides (Syn-SCAN) , 2002, Bioinform..

[14]  María Martín,et al.  The Gene Ontology: enhancements for 2011 , 2011, Nucleic Acids Res..

[15]  Matthias Platzer,et al.  Phylogeny-wide analysis of social amoeba genomes highlights ancient origins for complex intercellular communication. , 2011, Genome research.

[16]  J Sühnel,et al.  Comparative analysis of the Borrelia garinii genome. , 2004, Nucleic acids research.

[17]  M. Nei,et al.  Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. , 1986, Molecular biology and evolution.

[18]  Carsten Damm,et al.  Score-based prediction of genomic islands in prokaryotic genomes using hidden Markov models , 2006, BMC Bioinformatics.

[19]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[20]  J. Silberg,et al.  A transposase strategy for creating libraries of circularly permuted proteins , 2012, Nucleic acids research.

[21]  J. Parkhill,et al.  Comparative genomic structure of prokaryotes. , 2004, Annual review of genetics.

[22]  Gautier Koscielny,et al.  Ensembl Genomes: an integrative resource for genome-scale data from non-vertebrate species , 2011, Nucleic Acids Res..

[23]  The UniProt Consortium,et al.  Reorganizing the protein space at the Universal Protein Resource (UniProt) , 2011, Nucleic Acids Res..

[24]  Amos Bairoch,et al.  PROSITE, a protein domain database for functional characterization and annotation , 2009, Nucleic Acids Res..

[25]  Eric M. Just,et al.  dictyBase update 2011: web 2.0 functionality and the initial steps towards a genome portal for the Amoebozoa , 2010, Nucleic Acids Res..

[26]  G. Hong,et al.  Nucleic Acids Research , 2015, Nucleic Acids Research.

[27]  Rolf Apweiler,et al.  InterProScan: protein domains identifier , 2005, Nucleic Acids Res..