MonarchBase: the monarch butterfly genome database

The monarch butterfly (Danaus plexippus) is emerging as a model organism to study the mechanisms of circadian clocks and animal navigation, and the genetic underpinnings of long-distance migration. The initial assembly of the monarch genome was released in 2011, and the biological interpretation of the genome focused on the butterfly’s migration biology. To make the extensive data associated with the genome accessible to the general biological and lepidopteran communities, we established MonarchBase (available at http://monarchbase.umassmed.edu). The database is an open-access, web-available portal that integrates all available data associated with the monarch butterfly genome. Moreover, MonarchBase provides access to an updated version of genome assembly (v3) upon which all data integration is based. These include genes with systematic annotation, as well as other molecular resources, such as brain expressed sequence tags, migration expression profiles and microRNAs. MonarchBase utilizes a variety of retrieving methods to access data conveniently and for integrating biological interpretations.

[1]  G. Hong,et al.  Nucleic Acids Research , 2015, Nucleic Acids Research.

[2]  Colin N. Dewey,et al.  Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution , 2004, Nature.

[3]  Robert D. Finn,et al.  Rfam: Wikipedia, clans and the “decimal” release , 2010, Nucleic Acids Res..

[4]  Kazuei Mita,et al.  The genome of a lepidopteran model insect, the silkworm Bombyx mori. , 2009, Insect biochemistry and molecular biology.

[5]  L. Brower,et al.  Monarch butterfly orientation: missing pieces of a magnificent puzzle , 1996, The Journal of experimental biology.

[6]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[7]  Rolf Apweiler,et al.  InterProScan: protein domains identifier , 2005, Nucleic Acids Res..

[8]  Matthew R. Pocock,et al.  The Bioperl toolkit: Perl modules for the life sciences. , 2002, Genome research.

[9]  Doina Caragea,et al.  BeetleBase in 2010: revisions to provide comprehensive genomic information for Tribolium castaneum , 2009, Nucleic Acids Res..

[10]  Walter Pirovano,et al.  BIOINFORMATICS APPLICATIONS , 2022 .

[11]  Jim Thurmond,et al.  FlyBase 101 – the basics of navigating FlyBase , 2011, Nucleic Acids Res..

[12]  Keith Bradnam,et al.  CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes , 2007, Bioinform..

[13]  Stephen M. Mount,et al.  The genome sequence of Drosophila melanogaster. , 2000, Science.

[14]  C. Stoeckert,et al.  OrthoMCL: identification of ortholog groups for eukaryotic genomes. , 2003, Genome research.

[15]  Nansheng Chen,et al.  Genblasta: Enabling Blast to Identify Homologous Gene Sequences , 2022 .

[16]  Steven M. Reppert,et al.  Navigational mechanisms of migrating monarch butterflies , 2010, Trends in Neurosciences.

[17]  Robert D. Finn,et al.  InterPro in 2011: new developments in the family and domain prediction database , 2011, Nucleic acids research.

[18]  Mark Borodovsky,et al.  Eukaryotic Gene Prediction Using GeneMark.hmm‐E and GeneMark‐ES , 2011, Current protocols in bioinformatics.

[19]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[20]  Maureen J Donlin,et al.  Using the Generic Genome Browser (GBrowse) , 2007, Current protocols in bioinformatics.

[21]  Gerard Talavera,et al.  Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. , 2007, Systematic biology.

[22]  Steven Salzberg,et al.  TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders , 2004, Bioinform..

[23]  Ruiqiang Li,et al.  SilkDB v2.0: a platform for silkworm (Bombyx mori ) genome biology , 2009, Nucleic Acids Res..

[24]  Peter F. Hallin,et al.  RNAmmer: consistent and rapid annotation of ribosomal RNA genes , 2007, Nucleic acids research.

[25]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[26]  Peter B. McGarvey,et al.  UniRef: comprehensive and non-redundant UniProt reference clusters , 2007, Bioinform..

[27]  S. Eddy,et al.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. , 1997, Nucleic acids research.

[28]  S. Kanginakudru,et al.  Defining behavioral and molecular differences between summer and migratory monarch butterflies , 2009, BMC Biology.

[29]  Mark Borodovsky,et al.  Eukaryotic Gene Prediction Using GeneMark.hmm , 2003, Current protocols in bioinformatics.

[30]  María Martín,et al.  The Gene Ontology: enhancements for 2011 , 2011, Nucleic Acids Res..

[31]  Dong He,et al.  SpBase: the sea urchin genome database and web site , 2008, Nucleic Acids Res..

[32]  Simon H. Martin,et al.  Butterfly genome reveals promiscuous exchange of mimicry adaptations among species , 2012, Nature.

[33]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.

[34]  Burkhard Morgenstern,et al.  AUGUSTUS: ab initio prediction of alternative transcripts , 2006, Nucleic Acids Res..

[35]  Sofia M. C. Robb,et al.  MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. , 2007, Genome research.

[36]  Steven M Reppert,et al.  A Colorful Model of the Circadian Clock , 2006, Cell.

[37]  Shuai Zhan,et al.  The Monarch Butterfly Genome Yields Insights into Long-Distance Migration , 2011, Cell.

[38]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[39]  Ian Korf,et al.  Gene finding in novel genomes , 2004, BMC Bioinformatics.

[40]  Tatiana A. Tatusova,et al.  NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy , 2011, Nucleic Acids Res..

[41]  S. Reppert,et al.  Chasing Migration Genes: A Brain Expressed Sequence Tag Resource for Summer and Migratory Monarch Butterflies (Danaus plexippus) , 2008, PloS one.

[42]  Benjamin M. Wheeler,et al.  The dynamic genome of Hydra , 2010, Nature.

[43]  P. Hebert,et al.  Genome size variation in lepidopteran insects , 2003 .

[44]  G. Weinstock,et al.  Creating a honey bee consensus gene set , 2007, Genome Biology.