Assessment of the cPAS-based BGISEQ-500 platform for metagenomic sequencing

Abstract Background More extensive use of metagenomic shotgun sequencing in microbiome research relies on the development of high-throughput, cost-effective sequencing. Here we present a comprehensive evaluation of the performance of the new high-throughput sequencing platform BGISEQ-500 for metagenomic shotgun sequencing and compare its performance with that of 2 Illumina platforms. Findings Using fecal samples from 20 healthy individuals, we evaluated the intra-platform reproducibility for metagenomic sequencing on the BGISEQ-500 platform in a setup comprising 8 library replicates and 8 sequencing replicates. Cross-platform consistency was evaluated by comparing 20 pairwise replicates on the BGISEQ-500 platform vs the Illumina HiSeq 2000 platform and the Illumina HiSeq 4000 platform. In addition, we compared the performance of the 2 Illumina platforms against each other. By a newly developed overall accuracy quality control method, an average of 82.45 million high-quality reads (96.06% of raw reads) per sample, with 90.56% of bases scoring Q30 and above, was obtained using the BGISEQ-500 platform. Quantitative analyses revealed extremely high reproducibility between BGISEQ-500 intra-platform replicates. Cross-platform replicates differed slightly more than intra-platform replicates, yet a high consistency was observed. Only a low percentage (2.02%–3.25%) of genes exhibited significant differences in relative abundance comparing the BGISEQ-500 and HiSeq platforms, with a bias toward genes with higher GC content being enriched on the HiSeq platforms. Conclusions Our study provides the first set of performance metrics for human gut metagenomic sequencing data using BGISEQ-500. The high accuracy and technical reproducibility confirm the applicability of the new platform for metagenomic studies, though caution is still warranted when combining metagenomic data from different platforms.

[1]  Duy Tin Truong,et al.  MetaPhlAn2 for enhanced metagenomic taxonomic profiling , 2015, Nature Methods.

[2]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[3]  A. Ericsson,et al.  Comparative Evaluation of DNA Extraction Methods from Feces of Multiple Host Species for Downstream Next-Generation Sequencing , 2015, PloS one.

[4]  Qiang Feng,et al.  A metagenome-wide association study of gut microbiota in type 2 diabetes , 2012, Nature.

[5]  J. Venter,et al.  Library preparation methodology can influence genomic and functional predictions in human microbiome research , 2015, Proceedings of the National Academy of Sciences.

[6]  K. A. Segraves,et al.  Comparative transcriptome analysis of chemosensory genes in two sister leaf beetles provides insights into chemosensory speciation. , 2016, Insect biochemistry and molecular biology.

[7]  Minfeng Chen,et al.  A brief utilization report on the Illumina HiSeq 2000 sequencer , 2011 .

[8]  A. Alexeev,et al.  cPAS-based sequencing on the BGISEQ-500 to explore small non-coding RNAs , 2016, Clinical Epigenetics.

[9]  Jens Roat Kultima,et al.  An integrated catalog of reference genes in the human gut microbiome , 2014, Nature Biotechnology.

[10]  Hui Jiang,et al.  A reference human genome dataset of the BGISEQ-500 sequencer , 2017, GigaScience.

[11]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[12]  Jinyang Zhao,et al.  Genome sequencing of the sweetpotato whitefly Bemisia tabaci MED/Q , 2017, GigaScience.

[13]  Huijue Jia,et al.  Gut microbiome and serum metabolome alterations in obesity and after weight-loss intervention , 2017, Nature Medicine.

[14]  Morris A. Swertz,et al.  Population-based metagenomics analysis reveals markers for gut microbiome composition and diversity , 2016, Science.

[15]  T. Sicheritz-Pontén,et al.  Comparative performance of the BGISEQ-500 vs Illumina HiSeq2500 sequencing platforms for palaeogenomic sequencing , 2017, GigaScience.