Phasing amplicon sequencing on Illumina Miseq for robust environmental microbial community analysis

BackgroundAlthough high-throughput sequencing, such as Illumina-based technologies (e.g. MiSeq), has revolutionized microbial ecology, adaptation of amplicon sequencing for environmental microbial community analysis is challenging due to the problem of low base diversity.ResultsA new phasing amplicon sequencing approach (PAS) was developed by shifting sequencing phases among different community samples from both directions via adding various numbers of bases (0–7) as spacers to both forward and reverse primers. Our results first indicated that the PAS method substantially ameliorated the problem of unbalanced base composition. Second, the PAS method substantially improved the sequence read base quality (an average of 10 % higher of bases above Q30). Third, the PAS method effectively increased raw sequence throughput (~15 % more raw reads). In addition, the PAS method significantly increased effective reads (9–47 %) and the effective read sequence length (16–96 more bases) after quality trim at Q30 with window 5. In addition, the PAS method reduced half of the sequencing errors (0.54–1.1 % less). Finally, two-step PCR amplification of the PAS method effectively ameliorated the amplification biases introduced by the long barcoded PCR primers.ConclusionThe developed strategy is robust for 16S rRNA gene amplicon sequencing. In addition, a similar strategy could also be used for sequencing other genes important to ecosystem functional processes

[1]  Marti J. Anderson,et al.  A new method for non-parametric multivariate analysis of variance in ecology , 2001 .

[2]  Steven Salzberg,et al.  BIOINFORMATICS ORIGINAL PAPER , 2004 .

[3]  Gregory B. Gloor,et al.  Deep Sequencing of the Vaginal Microbiota of Women with HIV , 2010, PloS one.

[4]  Jizhong Zhou,et al.  Reproducibility and quantitation of amplicon sequencing-based detection , 2011, The ISME Journal.

[5]  Sharon L. Grim,et al.  Analysis, Optimization and Verification of Illumina-Generated 16S rRNA Gene Amplicon Surveys , 2014, PloS one.

[6]  Rob Knight,et al.  UCHIME improves sensitivity and speed of chimera detection , 2011, Bioinform..

[7]  Jizhong Zhou,et al.  Evaluation of PCR-Generated Chimeras, Mutations, and Heteroduplexes with 16S rRNA Gene-Based Cloning , 2001, Applied and Environmental Microbiology.

[8]  J. Clemente,et al.  The Long-Term Stability of the Human Gut Microbiota , 2013 .

[9]  W. Liesack,et al.  The ammonia monooxygenase structural gene amoA as a functional marker: molecular fine-scale analysis of natural ammonia-oxidizing populations , 1997, Applied and environmental microbiology.

[10]  K. R. Clarke,et al.  Non‐parametric multivariate analyses of changes in community structure , 1993 .

[11]  William A. Walters,et al.  Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample , 2010, Proceedings of the National Academy of Sciences.

[12]  Fei Zou,et al.  BIPES, a cost-effective high-throughput method for assessing microbial diversity , 2011, The ISME Journal.

[13]  H. Ochman,et al.  Illumina-based analysis of microbial community diversity , 2011, The ISME Journal.

[14]  Ashish Choudhary,et al.  Targeted, high-depth, next-generation sequencing of cancer genes in formalin-fixed, paraffin-embedded and fine-needle aspiration tumor specimens. , 2013, The Journal of molecular diagnostics : JMD.

[15]  Mehrdad Hajibabaei,et al.  Simultaneous assessment of the macrobiome and microbiome in a bulk sample of tropical arthropods through DNA metasystematics , 2014, Proceedings of the National Academy of Sciences.

[16]  Jizhong Zhou,et al.  Microbial community structure and function of nitrobenzene reduction biocathode in response to carbon source switchover. , 2014, Water research.

[17]  L. Raskin,et al.  PCR Biases Distort Bacterial and Archaeal Community Structure in Pyrosequencing Datasets , 2012, PloS one.

[18]  M. Schloter,et al.  Temporal Dynamics of Abundance and Composition of Nitrogen-Fixing Communities across Agricultural Soils , 2013, PloS one.

[19]  Jean M. Macklaim,et al.  Vaginal Microbiome and Epithelial Gene Array in Post-Menopausal Women with Moderate to Severe Dryness , 2011, PloS one.

[20]  Jean M. Macklaim,et al.  Microbiome Profiling by Illumina Sequencing of Combinatorial Sequence-Tagged PCR Products , 2010, PLoS ONE.

[21]  Jacques Ravel,et al.  An improved dual-indexing approach for multiplexed 16S rRNA gene sequencing on the Illumina MiSeq platform , 2014, Microbiome.

[22]  W. Whitman,et al.  Prokaryotes: the unseen majority. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Paul W. Mielke,et al.  Use of an Improved Statistical Method for Group Comparisons to Study Effects of Prairie Fire , 1985 .

[24]  P. Mieczkowski,et al.  Practical innovations for high-throughput amplicon sequencing , 2013, Nature Methods.

[25]  B. Roe,et al.  Elevated Carbon Dioxide Alters the Structure of Soil Microbial Communities , 2012, Applied and Environmental Microbiology.

[26]  R. Sinke,et al.  Targeted Next‐Generation Sequencing can Replace Sanger Sequencing in Clinical Diagnostics , 2013, Human mutation.

[27]  Brian C. Thomas,et al.  Microbes in the neonatal intensive care unit resemble those found in the gut of premature infants , 2014, Microbiome.

[28]  M. Wagner,et al.  Barcoded Primers Used in Multiplex Amplicon Pyrosequencing Bias Amplification , 2011, Applied and Environmental Microbiology.

[29]  Sandra L. McLellan,et al.  Analysis of the Gull Fecal Microbial Community Reveals the Dominance of Catellicoccus marimammalium in Relation to Culturable Enterococci , 2013, Applied and Environmental Microbiology.

[30]  P. Qian,et al.  Conservative Fragments in Bacterial 16S rRNA Genes and Primer Design for 16S Ribosomal DNA Amplicons in Metagenomic Studies , 2009, PloS one.

[31]  William A. Walters,et al.  Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms , 2012, The ISME Journal.

[32]  Cameron S. Osborne,et al.  Large Scale Loss of Data in Low-Diversity Illumina Sequencing Libraries Can Be Recovered by Deferred Cluster Calling , 2011, PloS one.

[33]  J. Tiedje,et al.  DNA recovery from soils of diverse composition , 1996, Applied and environmental microbiology.

[34]  Carbon dioxide and hydrogen sulfide associations with regional bacterial diversity patterns in microbially induced concrete corrosion. , 2014, Environmental science & technology.

[35]  A. Kohlmann,et al.  Robustness of amplicon deep sequencing underlines its utility in clinical applications. , 2013, The Journal of molecular diagnostics : JMD.

[36]  Yong Kong,et al.  Btrim: A fast, lightweight adapter and quality trimming program for next-generation sequencing technologies , 2011, Genomics.

[37]  Robert C. Edgar,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[38]  José Costa,et al.  PicoGreen quantitation of DNA: effective evaluation of samples pre- or post-PCR , 1996, Nucleic Acids Res..

[39]  Sarah L. Westcott,et al.  Development of a Dual-Index Sequencing Strategy and Curation Pipeline for Analyzing Amplicon Sequence Data on the MiSeq Illumina Sequencing Platform , 2013, Applied and Environmental Microbiology.

[40]  Jizhong Zhou,et al.  Improvements on environmental DNA extraction and purification procedures for matagenomic analysis , 2012 .

[41]  Andrea K. Bartram,et al.  Generation of Multimillion-Sequence 16S rRNA Gene Libraries from Complex Microbial Communities by Assembling Paired-End Illumina Reads , 2011, Applied and Environmental Microbiology.

[42]  J. Tiedje,et al.  Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy , 2007, Applied and Environmental Microbiology.