Ensembl Genomes: an integrative resource for genome-scale data from non-vertebrate species

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrative resource for genome-scale data from non-vertebrate species. The project exploits and extends technology (for genome annotation, analysis and dissemination) developed in the context of the (vertebrate-focused) Ensembl project and provides a complementary set of resources for non-vertebrate species through a consistent set of programmatic and interactive interfaces. These provide access to data including reference sequence, gene models, transcriptional data, polymorphisms and comparative analysis. Since its launch in 2009, Ensembl Genomes has undergone rapid expansion, with the goal of providing coverage of all major experimental organisms, and additionally including taxonomic reference points to provide the evolutionary context in which genes can be understood. Against the backdrop of a continuing increase in genome sequencing activities in all parts of the tree of life, we seek to work, wherever possible, with the communities actively generating and using data, and are participants in a growing range of collaborations involved in the annotation and analysis of genomes.

[1]  Syed Haider,et al.  Ensembl BioMarts: a hub for data retrieval across taxonomic space , 2011, Database J. Biol. Databases Curation.

[2]  Kimberly Van Auken,et al.  WormBase: a comprehensive resource for nematode research , 2009, Nucleic Acids Res..

[3]  Norman W. Paton,et al.  CADRE: the Central Aspergillus Data REpository. , 2004 .

[4]  David Haussler,et al.  The UCSC genome browser database: update 2007 , 2006, Nucleic Acids Res..

[5]  G. Cochrane,et al.  The International Nucleotide Sequence Database Collaboration , 2011, Nucleic Acids Res..

[6]  Robert S. Harris,et al.  Improved pairwise alignment of genomic dna , 2007 .

[7]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[8]  Vipin T. Sreedharan,et al.  Multiple reference genomes and transcriptomes for Arabidopsis thaliana , 2011, Nature.

[9]  Edward S. Buckler,et al.  Gramene database in 2010: updates and extensions , 2010, Nucleic Acids Res..

[10]  Albert J. Vilella,et al.  EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates. , 2009, Genome research.

[11]  Leopold Parts,et al.  Population genomics of domestic and wild yeasts , 2008 .

[12]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[13]  Robert P. Davey,et al.  Population genomics of domestic and wild yeasts , 2008, Nature.

[14]  E. Birney,et al.  Enredo and Pecan: genome-wide mammalian consistency-based multiple alignment with paralogs. , 2008, Genome research.

[15]  Gregory R. Madey,et al.  VectorBase: a data resource for invertebrate vector genomics , 2008, Nucleic Acids Res..

[16]  María Martín,et al.  The Universal Protein Resource (UniProt) in 2010 , 2010 .

[17]  Damian Smedley,et al.  BioMart – biological queries made easy , 2009, BMC Genomics.

[18]  Bjarni J. Vilhjálmsson,et al.  Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines , 2010 .

[19]  Daniel Rios,et al.  Ensembl 2011 , 2010, Nucleic Acids Res..

[20]  D. Haussler,et al.  Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[21]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[22]  Rolf Apweiler,et al.  InterProScan: protein domains identifier , 2005, Nucleic Acids Res..

[23]  Hagen Blankenburg,et al.  Integrating biological data – the Distributed Annotation System , 2008, BMC Bioinformatics.

[24]  Eric M. Just,et al.  dictyBase update 2011: web 2.0 functionality and the initial steps towards a genome portal for the Amoebozoa , 2010, Nucleic Acids Res..

[25]  Baris E. Suzek,et al.  The Universal Protein Resource (UniProt) in 2010 , 2009, Nucleic Acids Res..

[26]  Tanya Z. Berardini,et al.  The Arabidopsis Information Resource (TAIR): gene structure and function annotation , 2007, Nucleic Acids Res..

[27]  Norman W. Paton,et al.  CADRE: the Central Aspergillus Data REpository , 2004, Nucleic Acids Res..