Composite Metagenome-Assembled Genomes Reduce the Quality of Public Genome Repositories

In their recent study, Espinoza et al. employ genome-resolved metagenomics to investigate supragingival plaque metagenomes of 88 individuals (1). The 34 metagenome-assembled genomes (MAGs) that the authors report include those that resolve to clades that have largely evaded cultivation efforts, such as Gracilibacteria (formerly GN02) and Saccharibacteria (formerly TM7) of the recently described Candidate Phyla Radiation (2). Generating new genomic insights into the understudied members of the human oral cavity is of critical importance for a comprehensive understanding of the microbial ecology and functioning of this biome, and we acknowledge the contribution of the authors on this front. However, the redundant occurrence of bacterial single-copy core genes suggests that more than half of the MAGs that Espinoza et al. report are composite genomes that do not meet the recent quality guidelines suggested by the community (3). Composite genomes that aggregate sequences originating from multiple distinct populations can yield misleading insights when treated and reported as single genomes (4). To briefly demonstrate their composite nature, we refined some of the key …

[1]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[2]  K. Nelson,et al.  Supragingival Plaque Microbiome Ecology and Functional Potential in the Context of Health and Disease , 2018, mBio.

[3]  Chao Xie,et al.  Fast and sensitive protein alignment using DIAMOND , 2014, Nature Methods.

[4]  Tom O. Delmont,et al.  Anvi’o: an advanced analysis and visualization platform for ‘omics data , 2015, PeerJ.

[5]  A. von Haeseler,et al.  IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies , 2014, Molecular biology and evolution.

[6]  Toni Gabaldón,et al.  trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses , 2009, Bioinform..

[7]  Robert D. Finn,et al.  A new genomic blueprint of the human gut microbiota , 2019, Nature.

[8]  Donovan H. Parks,et al.  Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life , 2017, Nature Microbiology.

[9]  Adam Gudys,et al.  FAMSA: Fast and accurate multiple sequence alignment of huge protein families , 2016, Scientific Reports.

[10]  Brian C. Thomas,et al.  Unusual biology across a group comprising more than 15% of domain Bacteria , 2015, Nature.

[11]  Lewis Stevens,et al.  No evidence for extensive horizontal gene transfer in the genome of the tardigrade Hypsibius dujardini , 2016, Proceedings of the National Academy of Sciences.

[12]  Tom O. Delmont,et al.  Identifying contamination with advanced visualization and analysis practices: metagenomic approaches for eukaryotic genome assemblies , 2016, PeerJ.

[13]  Brian C. Thomas,et al.  A new view of the tree of life , 2016, Nature Microbiology.

[14]  Natalia N. Ivanova,et al.  Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea , 2017, Nature Biotechnology.

[15]  Stijn van Dongen,et al.  Using MCL to extract clusters from networks. , 2012, Methods in molecular biology.

[16]  Edoardo Pasolli,et al.  Extensive Unexplored Human Microbiome Diversity Revealed by Over 150,000 Genomes from Metagenomes Spanning Age, Geography, and Lifestyle , 2019, Cell.

[17]  D. Söll,et al.  UGA is an additional glycine codon in uncultured SR1 bacteria from the human microbiota , 2013, Proceedings of the National Academy of Sciences.