Mitochondrial capture enriches mito‐DNA 100 fold, enabling PCR‐free mitogenomics biodiversity analysis

Biodiversity analyses based on next‐generation sequencing (NGS) platforms have developed by leaps and bounds in recent years. A PCR‐free strategy, which can alleviate taxonomic bias, was considered as a promising approach to delivering reliable species compositions of targeted environments. The major impediment of such a method is the lack of appropriate mitochondrial DNA enrichment ways. Because mitochondrial genomes (mitogenomes) make up only a small proportion of total DNA, PCR‐free methods will inevitably result in a huge excess of data (>99%). Furthermore, the massive volume of sequence data is highly demanding on computing resources. Here, we present a mitogenome enrichment pipeline via a gene capture chip that was designed by virtue of the mitogenome sequences of the 1000 Insect Transcriptome Evolution project (1KITE, www.1kite.org). A mock sample containing 49 species was used to evaluate the efficiency of the mitogenome capture method. We demonstrate that the proportion of mitochondrial DNA can be increased by approximately 100‐fold (from the original 0.47% to 42.52%). Variation in phylogenetic distances of target taxa to the probe set could in principle result in bias in abundance. However, the frequencies of input taxa were largely maintained after capture (R2 = 0.81). We suggest that our mitogenome capture approach coupled with PCR‐free shotgun sequencing could provide ecological researchers an efficient NGS method to deliver reliable biodiversity assessment.

[1]  A. Vogler,et al.  Validating the power of mitochondrial metagenomics for community ecology and phylogenetics of complex assemblages , 2015 .

[2]  Guanliang Meng,et al.  High‐throughput monitoring of wild bee diversity and abundance via mitogenomics , 2015, Methods in ecology and evolution.

[3]  Thomas K. F. Wong,et al.  Phylogenomics resolves the timing and pattern of insect evolution , 2014, Science.

[4]  Aibing Zhang,et al.  Multiplex sequencing of pooled mitochondrial genomes—a crucial step toward biodiversity analysis using mito-metagenomics , 2014, Nucleic acids research.

[5]  A. Vogler,et al.  Bulk De Novo Mitogenome Assembly from Pooled Total DNA Elucidates the Phylogeny of Weevils (Coleoptera: Curculionoidea) , 2014, Molecular biology and evolution.

[6]  Jonathan E. Allen,et al.  Ancient pathogen DNA in archaeological samples detected with a Microbial Detection Array , 2014, Scientific Reports.

[7]  S. Cameron Insect mitochondrial genomics: implications for evolution and phylogeny. , 2014, Annual review of entomology.

[8]  Xun Xu,et al.  SOAPdenovo-Trans: de novo transcriptome assembly with short RNA-Seq reads , 2013, Bioinform..

[9]  P. Foster,et al.  The complete mitochondrial genome of a turbinid vetigastropod from MiSeq Illumina sequencing of genomic DNA and steps towards a resolved gastropod phylogeny. , 2014, Gene.

[10]  Qing Yang,et al.  SOAPBarcode: revealing arthropod biodiversity through assembly of Illumina shotgun sequences of PCR amplicons , 2013 .

[11]  Douglas W. Yu,et al.  Reliable, verifiable and efficient monitoring of biodiversity via metabarcoding. , 2013, Ecology letters.

[12]  M. Hofreiter,et al.  Capturing protein-coding genes across highly divergent species. , 2013, BioTechniques.

[13]  Nimrod D. Rubinstein,et al.  Deep Sequencing of Mixed Total DNA without Barcodes Allows Efficient Assembly of Highly Plastic Ascidian Mitochondrial Genomes , 2013, Genome biology and evolution.

[14]  Qing Yang,et al.  Ultra-deep sequencing enables high-fidelity recovery of biodiversity for bulk arthropod samples without PCR amplification , 2013, GigaScience.

[15]  V. Savolainen,et al.  Next-Generation Museomics Disentangles One of the Largest Primate Radiations , 2013, Systematic biology.

[16]  P. Dutton,et al.  Targeted multiplex next‐generation sequencing: advances in techniques of mitochondrial and nuclear DNA sequencing for population genomics , 2013, Molecular ecology resources.

[17]  Jian Wang,et al.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler , 2012, GigaScience.

[18]  A. Lemmon,et al.  Anchored hybrid enrichment for massively high-throughput phylogenomics. , 2012, Systematic biology.

[19]  Douglas W. Yu,et al.  Biodiversity soup: metabarcoding of arthropods for rapid biodiversity assessment and biomonitoring , 2012 .

[20]  Siu-Ming Yiu,et al.  IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth , 2012, Bioinform..

[21]  Eric Coissac,et al.  Bioinformatic challenges for DNA metabarcoding of plants and animals , 2012, Molecular ecology.

[22]  P. Taberlet,et al.  Towards next‐generation biodiversity assessment using DNA metabarcoding , 2012, Molecular ecology.

[23]  P. Taberlet,et al.  Tracking earthworm communities from soil DNA , 2012, Molecular ecology.

[24]  Mehrdad Hajibabaei,et al.  Biomonitoring 2.0: a new paradigm in ecosystem assessment made possible by next‐generation DNA sequencing , 2012, Molecular ecology.

[25]  Olivier David,et al.  Coalescent-Based DNA Barcoding: Multilocus Analysis and Robustness , 2012, J. Comput. Biol..

[26]  W. Pirovano,et al.  The complete mitogenome of Cylindrus obtusus (Helicidae, Ariantinae) using Illumina next generation sequencing , 2012, BMC Genomics.

[27]  J. Shendure,et al.  Exome sequencing as a tool for Mendelian disease gene discovery , 2011, Nature Reviews Genetics.

[28]  W. Murphy,et al.  Efficient cross-species capture hybridization and next-generation sequencing of mitochondrial genomes from noninvasively sampled museum specimens. , 2011, Genome research.

[29]  Alain Viari,et al.  ecoPrimers: inference of new DNA barcode markers from whole genome sequence analysis , 2011, Nucleic acids research.

[30]  E. Vallender Expanding whole exome resequencing into non-human primates , 2011, Genome Biology.

[31]  D. Baird,et al.  Environmental Barcoding: A Next-Generation Sequencing Approach for Biomonitoring Applications Using River Benthos , 2011, PloS one.

[32]  Greg Stuart-Hill,et al.  Effect of biodiversity on economic benefits from communal lands in Namibia , 2011 .

[33]  Kate E. Jones,et al.  Impacts of biodiversity on the emergence and transmission of infectious diseases , 2010, Nature.

[34]  A. Vogler,et al.  Why barcode? High-throughput multiplex sequencing of mitochondrial genomes for molecular systematics , 2010, Nucleic acids research.

[35]  Philip L. F. Johnson,et al.  Targeted Investigation of the Neandertal Genome by Array-Based Sequence Capture , 2010, Science.

[36]  R. Giblin-Davis,et al.  Reproducibility of read numbers in high‐throughput sequencing analysis of nematode community composition and structure , 2009, Molecular ecology resources.

[37]  Abraham E. Tucker,et al.  Evaluating high‐throughput sequencing as a method for metagenomic analysis of nematode diversity , 2009, Molecular ecology resources.

[38]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[39]  Sophie Lemoine,et al.  An evaluation of custom microarray applications: the oligonucleotide design challenge , 2009, Nucleic acids research.

[40]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[41]  Jeremy R. deWaard,et al.  An inexpensive, automation-friendly protocol for recovering high-quality DNA , 2006 .

[42]  Mehrdad Hajibabaei,et al.  A minimalist barcode can identify a specimen whose DNA is degraded , 2006 .

[43]  W. Reid,et al.  Millennium Ecosystem Assessment , 2005 .

[44]  John Quackenbush,et al.  TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets , 2003, Bioinform..

[45]  M. T. Ahmed Millennium ecosystem assessment , 2002, Environmental science and pollution research international.

[46]  J. SantaLucia,et al.  A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics. , 1998, Proceedings of the National Academy of Sciences of the United States of America.