Targeted gene enrichment and high‐throughput sequencing for environmental biomonitoring: a case study using freshwater macroinvertebrates

Recent studies have advocated biomonitoring using DNA techniques. In this study, two high‐throughput sequencing (HTS)‐based methods were evaluated: amplicon metabarcoding of the cytochrome C oxidase subunit I (COI) mitochondrial gene and gene enrichment using MYbaits (targeting nine different genes including COI). The gene‐enrichment method does not require PCR amplification and thus avoids biases associated with universal primers. Macroinvertebrate samples were collected from 12 New Zealand rivers. Macroinvertebrates were morphologically identified and enumerated, and their biomass determined. DNA was extracted from all macroinvertebrate samples and HTS undertaken using the illumina miseq platform. Macroinvertebrate communities were characterized from sequence data using either six genes (three of the original nine were not used) or just the COI gene in isolation. The gene‐enrichment method (all genes) detected the highest number of taxa and obtained the strongest Spearman rank correlations between the number of sequence reads, abundance and biomass in 67% of the samples. Median detection rates across rare (<1% of the total abundance or biomass), moderately abundant (1–5%) and highly abundant (>5%) taxa were highest using the gene‐enrichment method (all genes). Our data indicated primer biases occurred during amplicon metabarcoding with greater than 80% of sequence reads originating from one taxon in several samples. The accuracy and sensitivity of both HTS methods would be improved with more comprehensive reference sequence databases. The data from this study illustrate the challenges of using PCR amplification‐based methods for biomonitoring and highlight the potential benefits of using approaches, such as gene enrichment, which circumvent the need for an initial PCR step.

[1]  X. Pochon,et al.  Accurate assessment of the impact of salmon farming on benthic sediment enrichment using foraminiferal metabarcoding. , 2015, Marine pollution bulletin.

[2]  J. Spouge,et al.  CBOL Protist Working Group: Barcoding Eukaryotic Richness beyond the Animal, Plant, and Fungal Kingdoms , 2012, PLoS biology.

[3]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[4]  Mehrdad Hajibabaei,et al.  Next‐generation sequencing technologies for environmental DNA research , 2012, Molecular ecology.

[5]  P. Bork,et al.  Eukaryotic plankton diversity in the sunlit ocean , 2015, Science.

[6]  B. Deagle,et al.  Quantifying sequence proportions in a DNA‐based diet study using Ion Torrent amplicon sequencing: which counts count? , 2013, Molecular ecology resources.

[7]  Mehrdad Hajibabaei,et al.  Assessing biodiversity of a freshwater benthic macroinvertebrate community through non-destructive environmental barcoding of DNA from preservative ethanol , 2012, BMC Ecology.

[8]  François Pompanon,et al.  DNA metabarcoding and the cytochrome c oxidase subunit I marker: not a perfect match , 2014, Biology Letters.

[9]  Shane S. Sturrock,et al.  Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data , 2012, Bioinform..

[10]  T. Dallman,et al.  Performance comparison of benchtop high-throughput sequencing platforms , 2012, Nature Biotechnology.

[11]  R. Daniel,et al.  Metagenomic Analyses: Past and Future Trends , 2010, Applied and Environmental Microbiology.

[12]  V. Ranwez,et al.  A new versatile primer set targeting a short fragment of the mitochondrial COI region for metabarcoding metazoan diversity: application for characterizing coral reef fish gut contents , 2013, Frontiers in Zoology.

[13]  S. Pääbo,et al.  Multiplexed DNA Sequence Capture of Mitochondrial Genomes Using PCR Products , 2010, PloS one.

[14]  C. Veltman,et al.  Predicting dry weight of New Zealand aquatic macroinvertebrates from linear dimensions , 1994 .

[15]  L. Weyrich,et al.  Environmental metabarcodes for insects: in silico PCR reveals potential for taxonomic bias , 2014, Molecular ecology resources.

[16]  D. Baird,et al.  Environmental Barcoding: A Next-Generation Sequencing Approach for Biomonitoring Applications Using River Benthos , 2011, PloS one.

[17]  Susanne Horn,et al.  Target enrichment via DNA hybridization capture. , 2012, Methods in molecular biology.

[18]  N. Blow Genomics: catch me if you can , 2009, Nature Methods.

[19]  I. Duggan,et al.  Do rotifers have potential as bioindicators of lake trophic state? , 2001 .

[20]  Travis C. Glenn,et al.  A Phylogeny of Birds Based on Over 1,500 Loci Collected by Target Enrichment and High-Throughput Sequencing , 2012, PloS one.

[21]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[22]  M. Winterbourn,et al.  Guide to the aquatic insects of New Zealand. 3rd edition. , 1981 .

[23]  Douglas W. Yu,et al.  Using metabarcoding to ask if easily collected soil and leaf-litter samples can be used as a general biodiversity indicator , 2014 .

[24]  Martin Hartmann,et al.  Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities , 2009, Applied and Environmental Microbiology.

[25]  P. Taberlet,et al.  Using next‐generation sequencing for molecular reconstruction of past Arctic vegetation and climate , 2010, Molecular ecology resources.

[26]  Timothy S. Newman,et al.  Performance Comparison , 2021, Satellite Formation Flying.

[27]  R. Henrik Nilsson,et al.  Taxonomic Reliability of DNA Sequences in Public Sequence Databases: A Fungal Perspective , 2006, PloS one.

[28]  Jing Wang,et al.  Environmental bio-monitoring with high-throughput sequencing , 2013, Briefings Bioinform..

[29]  Louis A. Tremblay,et al.  Molecular genetic tools for environmental monitoring of New Zealand's aquatic habitats, past, present and the future , 2013 .

[30]  Margaret R. Caldwell,et al.  Harnessing DNA to improve environmental management , 2014, Science.

[31]  Mehrdad Hajibabaei,et al.  Simultaneous assessment of the macrobiome and microbiome in a bulk sample of tropical arthropods through DNA metasystematics , 2014, Proceedings of the National Academy of Sciences.

[32]  Alain Franc,et al.  A Next-Generation Sequencing Approach to River Biomonitoring Using Benthic Diatoms , 2014, Freshwater Science.

[33]  P. Miller,et al.  Does DNA Barcoding Improve Performance of Traditional Stream Bioassessment Metrics? , 2013, Freshwater Science.

[34]  P. Taberlet,et al.  DNA metabarcoding multiplexing and validation of data accuracy for diet assessment: application to omnivorous diet , 2014, Molecular ecology resources.

[35]  Anastasija Zaiko,et al.  Metabarcoding approach for the ballast water surveillance--an advantageous solution or an awkward challenge? , 2015, Marine pollution bulletin.

[36]  R. Giblin-Davis,et al.  Reproducibility of read numbers in high‐throughput sequencing analysis of nematode community composition and structure , 2009, Molecular ecology resources.

[37]  Qing Yang,et al.  Ultra-deep sequencing enables high-fidelity recovery of biodiversity for bulk arthropod samples without PCR amplification , 2013, GigaScience.

[38]  William A. Walters,et al.  QIIME allows analysis of high-throughput community sequencing data , 2010, Nature Methods.

[39]  Anastasija Zaiko,et al.  Early detection of eukaryotic communities from marine biofilm using high-throughput sequencing: an assessment of different sampling devices , 2015, Biofouling.

[40]  M. Winterbourn The freshwater insects of Australasia and their affinities , 1980 .

[41]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[42]  Marcus J. Claesson,et al.  Comparison of two next-generation sequencing technologies for resolving highly complex microbiota composition using tandem variable 16S rRNA gene regions , 2010, Nucleic acids research.

[43]  L. Orlando,et al.  Meta‐barcoding of ‘dirt’ DNA from soil reflects vertebrate biodiversity , 2012, Molecular ecology.

[44]  Steven R. Head,et al.  Next-generation sequencing , 2010, Nature Reviews Drug Discovery.

[45]  Philippe Esling,et al.  Environmental monitoring through protist next‐generation sequencing metabarcoding: assessing the impact of fish farming on benthic foraminifera communities , 2014, Molecular ecology resources.

[46]  Eske Willerslev,et al.  Environmental DNA - An emerging tool in conservation for monitoring past and present biodiversity , 2015 .

[47]  Matthew J. Colloff,et al.  Ecological assessment of estuarine sediments by pyrosequencing eukaryotic ribosomal DNA , 2010 .

[48]  V. Pettigrove,et al.  Molecular identification of Chironomus spp. (Diptera) for biomonitoring of aquatic ecosystems , 2004 .

[49]  Douglas W. Yu,et al.  Reliable, verifiable and efficient monitoring of biodiversity via metabarcoding. , 2013, Ecology letters.

[50]  X. Pochon,et al.  Evaluating Detection Limits of Next-Generation Sequencing for the Surveillance and Monitoring of International Marine Pests , 2013, PloS one.

[51]  I. Hodkinson,et al.  Terrestrial and Aquatic Invertebrates as Bioindicators for Environmental Monitoring, with Particular Reference to Mountain Ecosystems , 2005, Environmental management.

[52]  R. Giblin-Davis,et al.  Ultrasequencing of the meiofaunal biosphere: practice, pitfalls and promises , 2010, Molecular ecology.

[53]  Atte Moilanen,et al.  Genetic diversity in widespread species is not congruent with species richness in alpine plant communities. , 2012, Ecology letters.

[54]  Mehrdad Hajibabaei,et al.  Biomonitoring 2.0: a new paradigm in ecosystem assessment made possible by next‐generation DNA sequencing , 2012, Molecular ecology.

[55]  P. J. van der Zaag,et al.  Targeted enrichment of genomic DNA regions for next-generation sequencing , 2011, Briefings in functional genomics.

[56]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[57]  B. W. Sweeney,et al.  Can DNA barcodes of stream macroinvertebrates improve descriptions of community structure and water quality? , 2011, Journal of the North American Benthological Society.

[58]  Holly M. Bik,et al.  Sequencing our way towards understanding global eukaryotic biodiversity. , 2012, Trends in ecology & evolution.

[59]  Xavier Pochon,et al.  Assessing the effects of salmon farming seabed enrichment using bacterial community diversity and high-throughput sequencing. , 2015, FEMS microbiology ecology.

[60]  Jacob A. Esselstyn,et al.  The Challenges of Resolving a Rapid, Recent Radiation: Empirical and Simulated Phylogenomics of Philippine Shrews. , 2015, Systematic biology.

[61]  Philippe Esling,et al.  Environmental Monitoring: Inferring the Diatom Index from Next-Generation Sequencing Data. , 2015, Environmental science & technology.

[62]  W. Richard McCombie,et al.  High-Throughput Sequencing , 2011 .

[63]  John D. Stark,et al.  Performance of the macroinvertebrate community index: effects of sampling method, sample replication, water depth, current velocity, and substratum on index values , 1993 .

[64]  R. Vrijenhoek,et al.  DNA primers for amplification of mitochondrial cytochrome c oxidase subunit I from diverse metazoan invertebrates. , 1994, Molecular marine biology and biotechnology.

[65]  P. Somerfield,et al.  Next Generation Sequencing Reveals the Hidden Diversity of Zooplankton Assemblages , 2013, PloS one.

[66]  M. Lafont,et al.  Molecular Barcoding of Aquatic Oligochaetes: Implications for Biomonitoring , 2015, PloS one.

[67]  Adrian W. Briggs,et al.  Primer Extension Capture: Targeted Sequence Retrieval from Heavily Degraded DNA Sources , 2009, Journal of visualized experiments : JoVE.

[68]  T. Fleituch,et al.  Macroinvertebrates as indicators of water quality in rivers: a scientific basis for Polish standard method , 2002 .

[69]  I. Landscape Freshwater Biodiversity in the , 2013 .

[70]  Patrick J. Biggs,et al.  SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data , 2010, BMC Bioinformatics.

[71]  K. Katoh,et al.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. , 2002, Nucleic acids research.

[72]  S. Wood,et al.  Successional Change in Microbial Communities of Benthic Phormidium-Dominated Biofilms , 2014, Microbial Ecology.

[73]  Philippe Esling,et al.  High-throughput sequencing and morphology perform equally well for benthic monitoring of marine ecosystems , 2015, Scientific Reports.

[74]  Gareth Jones,et al.  Taxon‐specific PCR for DNA barcoding arthropod prey in bat faeces , 2011, Molecular ecology resources.

[75]  J. Landry,et al.  A universal DNA mini-barcode for biodiversity analysis , 2008, BMC Genomics.

[76]  Andrea Buffagni,et al.  The AQEM Multimetric System for the Southern Italian Apennines: Assessing the Impact of Water Quality and Habitat Degradation on Pool Macroinvertebrates in Mediterranean Rivers , 2004 .

[77]  J. Geller,et al.  Redesign of PCR primers for mitochondrial cytochrome c oxidase subunit I for marine invertebrates and application in all‐taxa biotic surveys , 2013, Molecular ecology resources.

[78]  N. Baeshen,et al.  Biological Identifications Through DNA Barcodes , 2012 .

[79]  D. Janzen,et al.  Ten species in one: DNA barcoding reveals cryptic species in the neotropical skipper butterfly Astraptes fulgerator. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[80]  B. Chessman,et al.  New sensitivity grades for Australian river macroinvertebrates , 2003 .

[81]  J. Stark,et al.  A biotic index for New Zealand's soft‐bottomed streams , 2007 .

[82]  B. Deagle,et al.  Analysis of Australian fur seal diet by pyrosequencing prey DNA in faeces , 2009, Molecular ecology.

[83]  A. Buffagni,et al.  The AQEM multimetric system for the southern Italian Apennines: assessing the impact of water quality and habitat degradation on pool macroinvertebrates in Mediterranean rivers , 2004, Hydrobiologia.

[84]  R. Cooper,et al.  Estimation of insect biomass by length and width , 1993 .