Motivation Along with the increasing accessibility to comprehensive sequence information, such as whole genomes and transcriptomes, the demand for assessing their quality has been multiplied. To this end, metrics based on sequence lengths, such as N50, have become a standard, but they only evaluate one aspect of assembly quality. Conversely, analyzing the coverage of pre‐selected reference protein‐coding genes provides essential content‐based quality assessment, but the currently available pipelines for this purpose, CEGMA and BUSCO, do not have a user‐friendly interface to serve as a uniform environment for assembly completeness assessment. Results Here, we introduce a brand‐new web server, gVolante, which provides an online tool for (i) on‐demand completeness assessment of sequence sets by means of the previously developed pipelines CEGMA and BUSCO and (ii) browsing pre‐computed completeness scores for publicly available data in its database section. Completeness assessments performed on gVolante report scores based on not just the coverage of reference genes but also on sequence lengths (e.g. N50 scaffold length), allowing quality control in multiple aspects. Using gVolante, one can compare the quality of original assemblies between their multiple versions (obtained through program choice and parameter tweaking, for example) and evaluate them in comparison to the scores of public resources found in the database section. Availability and implementation gVoalte is freely available at https://gvolante.riken.jp/. Contact shigehiro.kuraku@riken.jp
[1]
Evgeny M. Zdobnov,et al.
BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs
,
2015,
Bioinform..
[2]
Yuichiro Hara,et al.
Optimizing and benchmarking de novo transcriptome sequencing: from library preparation to assembly evaluation
,
2015,
BMC Genomics.
[3]
Osamu Nishimura,et al.
aLeaves facilitates on-demand exploration of metazoan gene family trees on MAFFT sequence alignment server with enhanced interactivity
,
2013,
Nucleic Acids Res..
[4]
Inanç Birol,et al.
Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species
,
2013,
GigaScience.
[5]
Keith Bradnam,et al.
CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes
,
2007,
Bioinform..
[6]
Keith Bradnam,et al.
Assessing the gene space in draft genomes
,
2008,
Nucleic acids research.