Quantitative and qualitative assessment of pollen DNA metabarcoding using constructed species mixtures

Pollen DNA metabarcoding—marker‐based genetic identification of potentially mixed‐species pollen samples—has applications across a variety of fields. While basic species‐level pollen identification using standard DNA barcode markers is established, the extent to which metabarcoding (a) correctly assigns species identities to mixes (qualitative matching) and (b) generates sequence reads proportionally to their relative abundance in a sample (quantitative matching) is unclear, as these have not been assessed relative to known standards. We tested the quantitative and qualitative robustness of metabarcoding in constructed pollen mixtures varying in species richness (1–9 species), taxonomic relatedness (within genera to across class) and rarity (5%–100% of grains), using Illumina MiSeq with the markers rbcL and ITS2. Qualitatively, species composition determinations were largely correct, but false positives and negatives occurred. False negatives were typically driven by lack of a barcode gap or rarity in a sample. Species richness and taxonomic relatedness, however, did not strongly impact correct determinations. False positives were likely driven by contamination, chimeric sequences and/or misidentification by the bioinformatics pipeline. Quantitatively, the proportion of reads for each species was only weakly correlated with its relative abundance, in contrast to suggestions from some other studies. Quantitative mismatches are not correctable by consistent scaling factors, but instead are context‐dependent on the other species present in a sample. Together, our results show that metabarcoding is largely robust for determining pollen presence/absence but that sequence reads should not be used to infer relative abundance of pollen grains.

[1]  P. Hebert,et al.  Barcoding animal life: cytochrome c oxidase subunit 1 divergences among closely related species , 2003, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[2]  Jun Ying Lim,et al.  Estimating and mitigating amplification bias in qualitative and quantitative arthropod metabarcoding , 2017, Scientific Reports.

[3]  M. Doebeli,et al.  Correcting for 16S rRNA gene copy numbers in microbiome surveys remains an unsolved problem , 2018, Microbiome.

[4]  Y. Yarom,et al.  Replacing Sanger with Next Generation Sequencing to improve coverage and quality of reference DNA barcodes for plants , 2017, Scientific Reports.

[5]  B. Brosi,et al.  An rbcL reference library to aid in the identification of plant species mixtures by DNA metabarcoding1 , 2017, Applications in Plant Sciences.

[6]  V. Bryant,et al.  Forensic palynology: current status of a rarely used technique in the United States of America. , 2006, Forensic science international.

[7]  R. Bock,et al.  Chloroplast DNA in Mature and Senescing Leaves: A Reappraisal[W][OPEN] , 2014, Plant Cell.

[8]  B. Deagle,et al.  Quantitative DNA metabarcoding: improved estimates of species proportional biomass using correction factors derived from control material , 2016, Molecular ecology resources.

[9]  Lingang Zhang,et al.  A Conserved, Mg2+-Dependent Exonuclease Degrades Organelle DNA during Arabidopsis Pollen Development[C][W] , 2011, Plant Cell.

[10]  R. Mathewes Forensic palynology in Canada: an overview with emphasis on archaeology and anthropology. , 2006, Forensic science international.

[11]  N. Yoccoz,et al.  Plant DNA metabarcoding of lake sediments: How does it represent the contemporary vegetation , 2018, PloS one.

[12]  Eske Willerslev,et al.  Environmental DNA - An emerging tool in conservation for monitoring past and present biodiversity , 2015 .

[13]  B. Brosi,et al.  Pollen DNA barcoding: current applications and future prospects. , 2016, Genome.

[14]  Jean M. Macklaim,et al.  Microbiome Datasets Are Compositional: And This Is Not Optional , 2017, Front. Microbiol..

[15]  S. Creer,et al.  Using DNA metabarcoding to investigate honey bee foraging reveals limited flower use despite high floral availability , 2017, Scientific Reports.

[16]  William A. Walters,et al.  QIIME allows analysis of high-throughput community sequencing data , 2010, Nature Methods.

[17]  K. Stoof-Leichsenring,et al.  A comparison of sedimentary DNA and pollen from lake sediments in recording vegetation composition at the Siberian treeline , 2017, Molecular ecology resources.

[18]  U. Herzschuh,et al.  Relative pollen productivity estimates for common taxa of the northern Siberian Arctic , 2015 .

[19]  Robert C. Edgar,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[20]  Berry J. Brosi,et al.  Applying pollen DNA metabarcoding to the study of plant–pollinator interactions , 2017, Applications in Plant Sciences.

[21]  Jonathan A. Eisen,et al.  Incorporating 16S Gene Copy Number Information Improves Estimates of Microbial Diversity and Abundance , 2012, PLoS Comput. Biol..

[22]  T. Porter,et al.  Scaling up: A guide to high‐throughput genomic approaches for biodiversity analysis , 2018, Molecular ecology.

[23]  A. J. Bendich,et al.  Changes in the structure of DNA molecules and the amount of DNA per plastid during chloroplast development in maize. , 2004, Journal of molecular biology.

[24]  J. Pettis,et al.  A Comparison of Honey Bee-Collected Pollen From Working Agricultural Lands Using Light Microscopy and ITS Metabarcoding , 2016, Environmental Entomology.

[25]  J. Bosch,et al.  Plant-pollinator networks: adding the pollinator's perspective. , 2009, Ecology letters.

[26]  Michael Bunce,et al.  From Benchtop to Desktop: Important Considerations when Designing Amplicon Sequencing Workflows , 2015, PloS one.

[27]  J. Louveaux,et al.  Methods of Melissopalynology: International Commission for BEE Botany of I.U.B.S. , 1970 .

[28]  Ting Gao,et al.  Validation of the ITS2 Region as a Novel DNA Barcode for Identifying Medicinal Plant Species , 2010, PloS one.

[29]  P. Taberlet,et al.  A comparative study of ancient sedimentary DNA, pollen and macrofossils from permafrost sediments of northern Siberia reveals long‐term vegetational stability , 2012, Molecular ecology.

[30]  J. Vallès,et al.  Genome size in 21 Artemisia L. species (Asteraceae, Anthemideae): systematic, evolutionary, and ecological implications. , 2001, Genome.

[31]  Shiliang Zhou,et al.  Barcoding the kingdom Plantae: new PCR primers for ITS regions of plants with improved universality and specificity , 2016, Molecular ecology resources.

[32]  Yang Liu,et al.  Examination of the cytoplasmic DNA in male reproductive cells to determine the potential for cytoplasmic inheritance in 295 angiosperm species. , 2003, Plant & cell physiology.

[33]  Sarah L. Westcott,et al.  Development of a Dual-Index Sequencing Strategy and Curation Pipeline for Analyzing Amplicon Sequence Data on the MiSeq Illumina Sequencing Platform , 2013, Applied and Environmental Microbiology.

[34]  J. Bengtsson-Palme,et al.  Evaluating and optimizing the performance of software commonly used for the taxonomic classification of DNA metabarcoding sequence data , 2017, Molecular ecology resources.

[35]  S. Sugita,et al.  Theory of quantitative reconstruction of vegetation I: pollen from large sites REVEALS regional vegetation composition , 2007 .

[36]  J. S. Heslop-Harrison,et al.  Nuclear dna amounts in angiosperms. , 1976, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[37]  Berry J Brosi,et al.  Review and future prospects for DNA barcoding methods in forensic palynology. , 2016, Forensic science international. Genetics.

[38]  Nuclear DNA amounts in angiosperms , 1982, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[39]  Jeremy R. deWaard,et al.  Biological identifications through DNA barcodes , 2003, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[40]  D. Relman,et al.  Simple statistical identification and removal of contaminant sequences in marker-gene and metagenomics data , 2017, Microbiome.

[41]  W. John Kress,et al.  A DNA barcode for land plants , 2009, Proceedings of the National Academy of Sciences.

[42]  W. Symondson,et al.  New universal ITS2 primers for high-resolution herbivory analyses using DNA metabarcoding in both tropical and temperate zones , 2018, Scientific Reports.

[43]  D. Baird,et al.  Large-Scale Monitoring of Plants through Environmental DNA Metabarcoding of Soil: Recovery, Resolution, and Annotation of Four DNA Markers , 2016, PloS one.

[44]  Damon P. Little,et al.  Choosing and Using a Plant DNA Barcode , 2011, PloS one.

[45]  M. Wilkinson,et al.  Quantitative evaluation of bias in PCR amplification and next-generation sequencing derived from metabarcoding samples , 2015, Analytical and Bioanalytical Chemistry.

[46]  V. Grant The Systematic and Geographical Distribution of Hawkmoth Flowers in the Temperate North American Flora , 1983, Botanical Gazette.

[47]  Y. Salmaki,et al.  Pollen morphology of Stachys (Lamiaceae) in Iran and its systematic implication , 2008 .

[48]  L. Baillie,et al.  Using DNA Metabarcoding to Identify the Floral Composition of Honey: A New Tool for Investigating Honey Bee Foraging Preferences , 2015, PloS one.

[49]  Y. Salmaki,et al.  Pollen morphology of Campanula (Campanulaceae) and allied genera in Iran with special focus on its systematic implication , 2012 .

[50]  M. Cristescu,et al.  From barcoding single individuals to metabarcoding biological communities: towards an integrative approach to the study of global biodiversity. , 2014, Trends in ecology & evolution.

[51]  Ingolf Steffan-Dewenter,et al.  Increased efficiency in identifying mixed pollen samples by meta-barcoding with a dual-indexing approach , 2015, BMC Ecology.

[52]  J. Tiedje,et al.  Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy , 2007, Applied and Environmental Microbiology.

[53]  P. Taberlet,et al.  Towards next‐generation biodiversity assessment using DNA metabarcoding , 2012, Molecular ecology.

[54]  Y. Suyama,et al.  DNA sequence from a fossil pollen of Abies spp. from Pleistocene peat. , 1996, Genes & genetic systems.

[55]  Aurélien Ginolhac,et al.  A comparative study of ancient environmental DNA to pollen and macrofossils from lake sediments reveals taxonomic overlap and additional plant taxa , 2013 .

[56]  J. T. Dunnen,et al.  Efficient and sensitive identification and quantification of airborne pollen using next‐generation DNA sequencing , 2015, Molecular ecology resources.

[57]  Pierre Taberlet,et al.  Using metabarcoding to reveal and quantify plant-pollinator interactions , 2016, Scientific Reports.

[58]  G. Ficetola,et al.  Ancient plant DNA in lake sediments. , 2017, The New phytologist.

[59]  W. Alverson,et al.  New reports of nuclear DNA content for 407 vascular plant taxa from the United States. , 2012, Annals of botany.

[60]  Andrea Galimberti,et al.  A DNA Barcoding Approach to Characterize Pollen Collected by Honeybees , 2014, PloS one.

[61]  J. Pettis,et al.  Taxonomic Characterization of Honey Bee (Apis mellifera) Pollen Foraging Based on Non-Overlapping Paired-End Sequencing of Nuclear Ribosomal Loci , 2015, PloS one.

[62]  D. Southworth SOLUBILITY OF POLLEN EXINES , 1974 .

[63]  P. Taberlet,et al.  DNA Barcoding for Honey Biodiversity , 2010 .

[64]  Douglas B. Sponsler,et al.  Application of ITS2 metabarcoding to determine the provenance of pollen collected by honey bees in an agroecosystem , 2015, Applications in plant sciences.

[65]  Rodney,et al.  APPLICATION OF ITS 2 METABARCODING TO DETERMINE THE PROVENANCE OF POLLEN COLLECTED BY HONEY BEES IN AN AGROECOSYSTEM 1 , 2019 .

[66]  A. Galimberti,et al.  A DNA barcoding approach to identify plant species in multiflower honey. , 2015, Food chemistry.

[67]  Hugh J. Beckie,et al.  The biology of Canadian weeds. 138. Kochia scoparia (L.) Schrad. , 2009 .

[68]  I. Leitch,et al.  Nuclear DNA Amounts in Angiosperms and their Modern Uses—807 New Estimates , 2000 .

[69]  C. Zeyl,et al.  Organelle inheritance in plants , 1994, Heredity.

[70]  Jennifer M. Fettweis,et al.  The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies , 2015, BMC Microbiology.

[71]  Peter M Hollingsworth,et al.  Selecting barcoding loci for plants: evaluation of seven candidate loci with species‐level sampling in three divergent groups of land plants , 2009, Molecular ecology resources.

[72]  R. Fyfe,et al.  Pollen productivity estimates of key European plant taxa for quantitative reconstruction of past vegetation: a review , 2008 .

[73]  W. Sakamoto,et al.  Chloroplast Biogenesis: Control of Plastid Development, Protein Import, Division and Inheritance , 2008, The arabidopsis book.

[74]  J. Suda,et al.  Naturalized plants have smaller genomes than their non-invading relatives: a flow cytometric analysis of the Czech alien flora. , 2010 .

[75]  A. Brown,et al.  Sedimentary ancient DNA from Lake Skartjørna, Svalbard: Assessing the resilience of arctic flora to Holocene climate change , 2016 .

[76]  P. Taberlet,et al.  Who is eating what: diet assessment using next generation sequencing , 2012, Molecular ecology.

[77]  Reed M. Johnson,et al.  Rank-based characterization of pollen assemblages collected by honey bees using a multi-locus metabarcoding approach , 2015, Applications in plant sciences.

[78]  R. M. Hodgson,et al.  Progress towards an automated trainable pollen location and classifier system for use in the palynology laboratory , 2011 .

[79]  L. Giongo,et al.  Soft Fruit Traceability in Food Matrices using Real-Time PCR , 2009, Nutrients.

[80]  R. Qu,et al.  Nuclear dna content of thirteen turfgrass species by flow cytometry , 1999, Crop Science.

[81]  I. Steffan‐Dewenter,et al.  Evaluating multiplexed next-generation sequencing as a method in palynology for mixed pollen samples. , 2015, Plant biology.

[82]  Beth Shapiro,et al.  Minimizing polymerase biases in metabarcoding. , 2018, Molecular ecology resources.

[83]  A. Kumar,et al.  Nuclear DNA Amounts in Some Tropical Hardwoods , 1986 .

[84]  K. Karatzas,et al.  Monitoring, Modelling and Forecasting of the Pollen Season , 2013 .

[85]  J. Wilmshurst,et al.  Use of Pollen and Ancient DNA as Conservation Baselines for Offshore Islands in New Zealand , 2014, Conservation biology : the journal of the Society for Conservation Biology.