BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics

Genomics promises comprehensive surveying of genomes and metagenomes, but rapidly changing technologies and expanding data volumes make evaluation of completeness a challenging task. Technical sequencing quality metrics can be complemented by quantifying completeness in terms of the expected gene content of Benchmarking Universal Single-Copy Orthologs (BUSCO, http://busco.ezlab.org). Now in its third release, BUSCO utilities extend beyond quality control to applications in comparative genomics, gene predictor training, metagenomics, and phylogenomics.

[1]  Alexey M. Kozlov,et al.  Evolutionary History of the Hymenoptera , 2017, Current Biology.

[2]  Robert M. Waterhouse,et al.  Genomic Features of the Damselfly Calopteryx splendens Representing a Sister Clade to Most Insect Orders , 2017, Genome biology and evolution.

[3]  S. Scherer,et al.  De Novo Genome and Transcriptome Assembly of the Canadian Beaver (Castor canadensis) , 2017, G3: Genes, Genomes, Genetics.

[4]  Evgeny M. Zdobnov,et al.  OrthoDB v9.1: cataloging evolutionary and functional annotations for animal, fungal, plant, archaeal, bacterial and viral orthologs , 2016, Nucleic Acids Res..

[5]  Hans H. Cheng,et al.  A New Chicken Genome Assembly Provides Insight into Avian Genome Structure , 2016, G3: Genes, Genomes, Genetics.

[6]  M. Selbach,et al.  Hypofunctional TrkA Accounts for the Absence of Pain Sensitization in the African Naked Mole-Rat , 2016, Cell reports.

[7]  J. Good,et al.  Genomic imprinting, disrupted placental expression, and speciation , 2016, Evolution; international journal of organic evolution.

[8]  Antonis Rokas,et al.  Reconstructing the Backbone of the Saccharomycotina Yeast Phylogeny Using Genome-Scale Data , 2016, G3: Genes, Genomes, Genetics.

[9]  James R. Knight,et al.  An improved genome assembly uncovers prolific tandem repeats in Atlantic cod , 2016, bioRxiv.

[10]  M. Schatz,et al.  Phased diploid genome assembly with single-molecule real-time sequencing , 2016, Nature Methods.

[11]  Karen Meusemann,et al.  BaitFisher: A Software Package for Multispecies Target DNA Enrichment Probe Design. , 2016, Molecular biology and evolution.

[12]  Genome Update. Let the consumer beware: Streptomyces genome sequence quality , 2016, Microbial biotechnology.

[13]  J. Mallet,et al.  Major Improvements to the Heliconius melpomene Genome Assembly Used to Confirm 10 Chromosome Fusion Events in 6 Million Years of Butterfly Evolution , 2015, G3: Genes, Genomes, Genetics.

[14]  Evgeny M. Zdobnov,et al.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs , 2015, Bioinform..

[15]  D. Swinney,et al.  Intrahepatic Transcriptional Signature Associated with Response to Interferon-α Treatment in the Woodchuck Model of Chronic Hepatitis B , 2015, PLoS pathogens.

[16]  J. Opazo,et al.  Characterization of the Kidney Transcriptome of the Long-Haired Mouse Abrothrix hirta (Rodentia, Sigmodontinae) and Comparison with That of the Olive Mouse A. olivacea , 2015, PloS one.

[17]  Robert M. Waterhouse,et al.  A maturing understanding of the composition of the insect gene repertoire. , 2015, Current opinion in insect science.

[18]  Xiaofang Jiang,et al.  Extensive introgression in a malaria vector species complex revealed by phylogenomics , 2015, Science.

[19]  Jan Gorodkin,et al.  Quality Assessment of Domesticated Animal Genome Assemblies , 2015, Bioinformatics and biology insights.

[20]  Hong Wang,et al.  Gene regulation mediated by microRNAs in response to green tea polyphenol EGCG in mouse lung cancer , 2014, BMC Genomics.

[21]  Md. Shamsuzzoha Bayzid,et al.  Whole-genome analyses resolve early branches in the tree of life of modern birds , 2014, Science.

[22]  M. Yandell,et al.  Genome Annotation and Curation Using MAKER and MAKER‐P , 2014, Current protocols in bioinformatics.

[23]  Thomas K. F. Wong,et al.  Phylogenomics resolves the timing and pattern of insect evolution , 2014, Science.

[24]  G. Giribet,et al.  Phylogenomic Analysis of Spiders Reveals Nonmonophyly of Orb Weavers , 2014, Current Biology.

[25]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[26]  Kathleen Marchal,et al.  A network-based approach to identify substrate classes of bacterial glycosyltransferases , 2014, BMC Genomics.

[27]  Dan Graur,et al.  Finding the missing honey bee genes: lessons learned from a genome upgrade , 2014, BMC Genomics.

[28]  Alexandros Stamatakis,et al.  Metagenomic species profiling using universal phylogenetic marker genes , 2013, Nature Methods.

[29]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[30]  Evgeny M. Zdobnov,et al.  OrthoDB: a hierarchical catalog of animal, fungal and bacterial orthologs , 2012, Nucleic Acids Res..

[31]  Sean R. Eddy,et al.  Accelerated Profile HMM Searches , 2011, PLoS Comput. Biol..

[32]  Mark Borodovsky,et al.  Eukaryotic Gene Prediction Using GeneMark.hmm‐E and GeneMark‐ES , 2011, Current protocols in bioinformatics.

[33]  Martin Kollmar,et al.  A novel hybrid gene prediction method employing protein multiple sequence alignments , 2011, Bioinform..

[34]  Robert M. Waterhouse,et al.  Correlating Traits of Gene Retention, Sequence Divergence, Duplicability and Essentiality in Vertebrates, Arthropods, and Fungi , 2010, Genome biology and evolution.

[35]  Pedro M. Valero-Mora,et al.  ggplot2: Elegant Graphics for Data Analysis , 2010 .

[36]  Evgeny M. Zdobnov,et al.  The Newick utilities: high-throughput phylogenetic tree processing in the Unix shell , 2010, Bioinform..

[37]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[38]  Toni Gabaldón,et al.  trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses , 2009, Bioinform..

[39]  Tal Pupko,et al.  Rodent phylogeny revised: analysis of six nuclear genes from all major rodent clades , 2009, BMC Evolutionary Biology.

[40]  C. W. Kilpatrick,et al.  Multiple molecular evidences for a living mammalian fossil , 2007, Proceedings of the National Academy of Sciences.

[41]  Keith Bradnam,et al.  CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes , 2007, Bioinform..

[42]  Enrique Blanco,et al.  Using geneid to Identify Genes , 2002, Current protocols in bioinformatics.

[43]  F. Delsuc,et al.  Phylogenomics and the reconstruction of the tree of life , 2005, Nature Reviews Genetics.

[44]  Ian Korf,et al.  Gene finding in novel genomes , 2004, BMC Bioinformatics.