InsectBase: a resource for insect genomes and transcriptomes

The genomes and transcriptomes of hundreds of insects have been sequenced. However, insect community lacks an integrated, up-to-date collection of insect gene data. Here, we introduce the first release of InsectBase, available online at http://www.insect-genome.com. The database encompasses 138 insect genomes, 116 insect transcriptomes, 61 insect gene sets, 36 gene families of 60 insects, 7544 miRNAs of 69 insects, 96 925 piRNAs of Drosophila melanogaster and Chilo suppressalis, 2439 lncRNA of Nilaparvata lugens, 22 536 pathways of 78 insects, 678 881 untranslated regions (UTR) of 84 insects and 160 905 coding sequences (CDS) of 70 insects. This release contains over 12 million sequences and provides search functionality, a BLAST server, GBrowse, insect pathway construction, a Facebook-like network for the insect community (iFacebook), and phylogenetic analysis of selected genes.

[1]  Fei Li,et al.  Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine , 2005, BMC Bioinformatics.

[2]  Yoshiaki Nagamura,et al.  KAIKObase: An integrated silkworm genome database and data mining tool , 2009, BMC Genomics.

[3]  Mark Gerstein,et al.  Closure of the NCBI SRA and implications for the long-term future of genomics data storage , 2011, Genome Biology.

[4]  Evan Bolton,et al.  Database resources of the National Center for Biotechnology Information , 2017, Nucleic Acids Res..

[5]  Fei Li,et al.  Prediction of piRNAs using transposon interaction and a support vector machine , 2014, BMC Bioinformatics.

[6]  Tian Xia,et al.  BmTEdb: a collective database of transposable elements in the silkworm genome , 2013, Database J. Biol. Databases Curation.

[7]  C. Alex Buerkle,et al.  Stick Insect Genomes Reveal Natural Selection’s Role in Parallel Speciation , 2014, Science.

[8]  Fei Li,et al.  iPathCons and iPathDB: an improved insect pathway construction tool and the database , 2014, Database J. Biol. Databases Curation.

[9]  Doina Caragea,et al.  BeetleBase in 2010: revisions to provide comprehensive genomic information for Tribolium castaneum , 2009, Nucleic Acids Res..

[10]  Sandra Gesing,et al.  VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases , 2014, Nucleic Acids Res..

[11]  Ana Kozomara,et al.  miRBase: annotating high confidence microRNAs using deep sequencing data , 2013, Nucleic Acids Res..

[12]  Jim Thurmond,et al.  FlyBase: introduction of the Drosophila melanogaster Release 6 reference genome assembly and large-scale migration of genome annotations , 2014, Nucleic Acids Res..

[13]  Evgeny M. Zdobnov,et al.  The Newick utilities: high-throughput phylogenetic tree processing in the Unix shell , 2010, Bioinform..

[14]  Gary Moore,et al.  The i5k Workspace@NAL—enabling genomic data access, visualization and curation of arthropod genomes , 2014, Nucleic Acids Res..

[15]  Shuai Zhan,et al.  MonarchBase: the monarch butterfly genome database , 2012, Nucleic Acids Res..

[16]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[17]  M. Ashburner,et al.  The transposable elements of the Drosophila melanogaster euchromatin: a genomics perspective , 2002, Genome Biology.

[18]  Feng Chen,et al.  OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups , 2005, Nucleic Acids Res..

[19]  Valentin Guignon,et al.  Chado Controller: advanced annotation management with a community annotation system , 2012, Bioinform..

[20]  Ruiqiang Li,et al.  SilkDB v2.0: a platform for silkworm (Bombyx mori ) genome biology , 2009, Nucleic Acids Res..

[21]  Guang Yang,et al.  DBM-DB: the diamondback moth genome database , 2014, Database J. Biol. Databases Curation.

[22]  Mark L. Blaxter,et al.  ButterflyBase: a platform for lepidopteran genomics , 2007, Nucleic Acids Res..

[23]  Christine G. Elsik,et al.  Hymenoptera Genome Database: integrated community resources for insect species of the order Hymenoptera , 2010, Nucleic Acids Res..

[24]  Akiyasu C. Yoshizawa,et al.  KAAS: an automatic genome annotation and pathway reconstruction server , 2007, Environmental health perspectives.

[25]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[26]  Ernesto Picardi,et al.  UTRdb and UTRsite (RELEASE 2010): a collection of sequences and regulatory motifs of the untranslated regions of eukaryotic mRNAs , 2009, Nucleic Acids Res..

[27]  Fei Li,et al.  ChiloDB: a genomic and transcriptome database for an important rice insect pest Chilo suppressalis , 2014, Database J. Biol. Databases Curation.

[28]  Fei Li,et al.  OMIGA: Optimized Maker-Based Insect Genome Annotation , 2014, Molecular Genetics and Genomics.

[29]  O. Collin,et al.  AphidBase: a centralized bioinformatic resource for annotation of the pea aphid genome , 2010, Insect molecular biology.

[30]  Lincoln Stein,et al.  Using GBrowse 2.0 to visualize and share next-generation sequence data , 2013, Briefings Bioinform..

[31]  Colin N. Dewey,et al.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis , 2013, Nature Protocols.

[32]  Matthew Fraser,et al.  InterProScan 5: genome-scale protein function classification , 2014, Bioinform..

[33]  David W Mount,et al.  Using the Basic Local Alignment Search Tool (BLAST). , 2007, CSH protocols.

[34]  Akiya Jouraku,et al.  KONAGAbase: a genomic and transcriptomic database for the diamondback moth, Plutella xylostella , 2013, BMC Genomics.