Increased efficiency in identifying mixed pollen samples by meta-barcoding with a dual-indexing approach

BackgroundMeta-barcoding of mixed pollen samples constitutes a suitable alternative to conventional pollen identification via light microscopy. Current approaches however have limitations in practicability due to low sample throughput and/or inefficient processing methods, e.g. separate steps for amplification and sample indexing.ResultsWe thus developed a new primer-adapter design for high throughput sequencing with the Illumina technology that remedies these issues. It uses a dual-indexing strategy, where sample-specific combinations of forward and reverse identifiers attached to the barcode marker allow high sample throughput with a single sequencing run. It does not require further adapter ligation steps after amplification. We applied this protocol to 384 pollen samples collected by solitary bees and sequenced all samples together on a single Illumina MiSeq v2 flow cell. According to rarefaction curves, 2,000–3,000 high quality reads per sample were sufficient to assess the complete diversity of 95% of the samples. We were able to detect 650 different plant taxa in total, of which 95% were classified at the species level. Together with the laboratory protocol, we also present an update of the reference database used by the classifier software, which increases the total number of covered global plant species included in the database from 37,403 to 72,325 (93% increase).ConclusionsThis study thus offers improvements for the laboratory and bioinformatical workflow to existing approaches regarding data quantity and quality as well as processing effort and cost-effectiveness. Although only tested for pollen samples, it is furthermore applicable to other research questions requiring plant identification in mixed and challenging samples.

[1]  M. Lascoux,et al.  Ancient DNA from pollen: a genetic record of population history in Scots pine , 2005, Molecular ecology.

[2]  R. Knight,et al.  The influence of sex, handedness, and washing on the diversity of hand surface bacteria , 2008, Proceedings of the National Academy of Sciences.

[3]  W. Meek,et al.  Assessing the value of annual and perennial forage mixtures for bumblebees by direct observation and pollen analysis , 2006 .

[4]  A. Schwabe,et al.  Analysis of pollen loads in a wild bee community (Hymenoptera: Apidae) — a method for elucidating habitat use and foraging distances , 2008, Apidologie.

[5]  William A. Walters,et al.  QIIME allows analysis of high-throughput community sequencing data , 2010, Nature Methods.

[6]  R. Tipping,et al.  Sensing small-scale human activity in the palaeoecological record: fine spatial resolution pollen analyses from Glen Affric, northern Scotland , 2004 .

[7]  J. T. Dunnen,et al.  Efficient and sensitive identification and quantification of airborne pollen using next‐generation DNA sequencing , 2015, Molecular ecology resources.

[8]  Robert C. Edgar,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[9]  P. Taberlet,et al.  New perspectives in diet analysis based on DNA barcoding and parallel pyrosequencing: the trnL approach , 2009, Molecular ecology resources.

[10]  Pierre Taberlet,et al.  Analysing diet of small herbivores: the efficiency of DNA barcoding coupled with high-throughput pyrosequencing for deciphering the composition of complex plant mixtures , 2009, Frontiers in Zoology.

[11]  Achim Gathmann,et al.  Foraging ranges of solitary bees , 2002 .

[12]  J. Tiedje,et al.  Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy , 2007, Applied and Environmental Microbiology.

[13]  D. Holway,et al.  Pollen foraging behaviour of solitary Hawaiian bees revealed through molecular pollen analysis , 2010, Molecular ecology.

[14]  Gladys K. Andino,et al.  Multiple Routes of Pesticide Exposure for Honey Bees Living Near Agricultural Fields , 2012, PloS one.

[15]  S. Dorn,et al.  Host recognition in a pollen-specialist bee: evidence for a genetic basis , 2008, Apidologie.

[16]  Ting Gao,et al.  Validation of the ITS2 Region as a Novel DNA Barcode for Identifying Medicinal Plant Species , 2010, PloS one.

[17]  N. Koeniger,et al.  Comparison of pollen spectra collected by four different subspecies of the honey bee Apis mellifera , 2007, Apidologie.

[18]  Robert C. Edgar,et al.  UPARSE: highly accurate OTU sequences from microbial amplicon reads , 2013, Nature Methods.

[19]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[20]  Sarah L. Westcott,et al.  Development of a Dual-Index Sequencing Strategy and Curation Pipeline for Analyzing Amplicon Sequence Data on the MiSeq Illumina Sequencing Platform , 2013, Applied and Environmental Microbiology.

[21]  T. White Amplification and direct sequencing of fungal ribosomal RNA genes for phylogenetics , 1990 .

[22]  Felix Gugerli,et al.  Ancient plant DNA: review and prospects. , 2005, The New phytologist.

[23]  N. Williams,et al.  Resource distributions among habitats determine solitary bee offspring production in a mosaic landscape. , 2007, Ecological applications : a publication of the Ecological Society of America.

[24]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[25]  S. Primrose,et al.  Food forensics: using DNA technology to combat misdescription and fraud. , 2004, Trends in biotechnology.

[26]  Alexander Keller,et al.  The ITS2 Database III—sequences and structures for phylogeny , 2009, Nucleic Acids Res..

[27]  Susan Holmes,et al.  phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data , 2013, PloS one.

[28]  I. Steffan‐Dewenter,et al.  Evaluating multiplexed next-generation sequencing as a method in palynology for mixed pollen samples. , 2015, Plant biology.

[29]  Thomas J. White,et al.  PCR protocols: a guide to methods and applications. , 1990 .

[30]  P. Dixon VEGAN, a package of R functions for community ecology , 2003 .

[31]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[32]  Keith Bennett,et al.  DNA from pollen: principles and potential , 2006 .

[33]  Andrea Galimberti,et al.  A DNA Barcoding Approach to Characterize Pollen Collected by Honeybees , 2014, PloS one.

[34]  Zhiyong Lu,et al.  Database resources of the National Center for Biotechnology Information , 2010, Nucleic Acids Res..

[35]  P. Taberlet,et al.  DNA Barcoding for Honey Biodiversity , 2010 .

[36]  Douglas B. Sponsler,et al.  Application of ITS2 metabarcoding to determine the provenance of pollen collected by honey bees in an agroecosystem , 2015, Applications in plant sciences.

[37]  A. Galimberti,et al.  A DNA barcoding approach to identify plant species in multiflower honey. , 2015, Food chemistry.

[38]  Thomas Dandekar,et al.  5.8S-28S rRNA interaction and HMM-based ITS2 annotation. , 2009, Gene.

[39]  H. Behling,et al.  Late Quaternary Araucaria forest, grassland (Campos), fire and climate dynamics, studied by high-resolution pollen, charcoal and multivariate analysis of the Cambará do Sul core in southern Brazil , 2004 .