Targeted multiplex next‐generation sequencing: advances in techniques of mitochondrial and nuclear DNA sequencing for population genomics

Next‐generation sequencing (NGS) is emerging as an efficient and cost‐effective tool in population genomic analyses of nonmodel organisms, allowing simultaneous resequencing of many regions of multi‐genomic DNA from multiplexed samples. Here, we detail our synthesis of protocols for targeted resequencing of mitochondrial and nuclear loci by generating indexed genomic libraries for multiplexing up to 100 individuals in a single sequencing pool, and then enriching the pooled library using custom DNA capture arrays. Our use of DNA sequence from one species to capture and enrich the sequencing libraries of another species (i.e. cross‐species DNA capture) indicates that efficient enrichment occurs when sequences are up to about 12% divergent, allowing us to take advantage of genomic information in one species to sequence orthologous regions in related species. In addition to a complete mitochondrial genome on each array, we have included between 43 and 118 nuclear loci for low‐coverage sequencing of between 18 kb and 87 kb of DNA sequence per individual for single nucleotide polymorphisms discovery from 50 to 100 individuals in a single sequencing lane. Using this method, we have generated a total of over 500 whole mitochondrial genomes from seven cetacean species and green sea turtles. The greater variation detected in mitogenomes relative to short mtDNA sequences is helping to resolve genetic structure ranging from geographic to species‐level differences. These NGS and analysis techniques have allowed for simultaneous population genomic studies of mtDNA and nDNA with greater genomic coverage and phylogeographic resolution than has previously been possible in marine mammals and turtles.

[1]  I. Ishiyama,et al.  [Water-soluble eumelanin as a PCR-inhibitor and a simple method for its removal]. , 1993, Nihon hoigaku zasshi = The Japanese journal of legal medicine.

[2]  Yaping Zhang,et al.  Mitogenomic analysis of Chinese snub-nosed monkeys: Evidence of positive selection in NADH dehydrogenase genes in high-altitude adaptation. , 2011, Mitochondrion.

[3]  Jesse Dabney,et al.  Length and GC-biases during sequencing library amplification: a comparison of various polymerase-buffer systems with ancient and modern DNA sequencing libraries. , 2012, BioTechniques.

[4]  S. Mesnick,et al.  Characterization of 18 SNP markers for sperm whale (Physeter macrocephalus) , 2007 .

[5]  Zhenyu Xuan,et al.  Hybrid selection of discrete genomic intervals on custom-designed microarrays for massively parallel sequencing , 2009, Nature Protocols.

[6]  Jose V. Lopez,et al.  Numt, a recent transfer and tandem amplification of mitochondrial DNA to the nuclear genome of the domestic cat , 1994, Journal of Molecular Evolution.

[7]  R. Mueller Evolutionary rates, divergence dates, and the performance of mitochondrial genes in Bayesian phylogenetic analysis. , 2006, Systematic biology.

[8]  K. Bjorndal,et al.  Mitogenomic sequences better resolve stock structure of southern Greater Caribbean green turtle rookeries , 2012, Molecular ecology.

[9]  Hiroyuki Toh,et al.  Improvement in the accuracy of multiple sequence alignment program MAFFT. , 2005, Genome informatics. International Conference on Genome Informatics.

[10]  N. Aitken,et al.  Single nucleotide polymorphism (SNP) discovery in mammals: a targeted‐gene approach , 2004, Molecular ecology.

[11]  Elaine R Mardis,et al.  Direct genomic selection , 2005, Nature Methods.

[12]  A. Guggisberg,et al.  RAD in the realm of next‐generation sequencing technologies , 2011, Molecular ecology.

[13]  S. Pääbo,et al.  Multiplexed DNA Sequence Capture of Mitochondrial Genomes Using PCR Products , 2010, PloS one.

[14]  R. W. Baird,et al.  Sperm whale population structure in the eastern and central North Pacific inferred by the use of single‐nucleotide polymorphisms, microsatellites and mitochondrial DNA , 2011, Molecular ecology resources.

[15]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[16]  P. Dutton,et al.  Marine turtle mitogenome phylogenetics and evolution. , 2012, Molecular phylogenetics and evolution.

[17]  P. Dutton,et al.  Molecular phylogeny for marine turtles based on sequences of the ND4-leucine tRNA and control regions of mitochondrial DNA. , 1996, Molecular phylogenetics and evolution.

[18]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[19]  D. Hartl,et al.  Mitochondrial pseudogenes: evolution's misplaced witnesses. , 2001, Trends in ecology & evolution.

[20]  Sudhir Kumar,et al.  Evolution of modern birds revealed by mitogenomics: timing the radiation and origin of major orders. , 2011, Molecular biology and evolution.

[21]  P. Morin,et al.  Comparative mitochondrial and nuclear quantitative PCR of historical marine mammal tissue, bone, baleen, and tooth samples , 2007 .

[22]  B. Venkatesh,et al.  Evolutionary origin and phylogeny of the modern holocephalans (Chondrichthyes: Chimaeriformes): a mitogenomic perspective. , 2010, Molecular biology and evolution.

[23]  James Taylor,et al.  Next-generation sequencing data interpretation: enhancing reproducibility and accessibility , 2012, Nature Reviews Genetics.

[24]  H. Hoekstra,et al.  Double Digest RADseq: An Inexpensive Method for De Novo SNP Discovery and Genotyping in Model and Non-Model Species , 2012, PloS one.

[25]  P. Etter,et al.  Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers , 2008, PloS one.

[26]  B. Wielstra,et al.  Unraveling the rapid radiation of crested newts (Triturus cristatus superspecies) using complete mitogenomic sequences , 2011, BMC Evolutionary Biology.

[27]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[28]  A. Lemmon,et al.  Anchored hybrid enrichment for massively high-throughput phylogenomics. , 2012, Systematic biology.

[29]  J. Seeb,et al.  The genetic population structure of lacustrine sockeye salmon, Oncorhynchus nerka, in Japan as the endangered species , 2011, Environmental Biology of Fishes.

[30]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[31]  Frédéric Delsuc,et al.  Tunicate mitogenomics and phylogenetics: peculiarities of the Herdmania momus mitochondrial genome and support for the new chordate phylogeny , 2009, BMC Genomics.

[32]  Steven L Salzberg,et al.  Complete Columbian mammoth mitogenome suggests interbreeding with woolly mammoths , 2011, Genome Biology.

[33]  Joshua S. Paul,et al.  Genotype and SNP calling from next-generation sequencing data , 2011, Nature Reviews Genetics.

[34]  J. George,et al.  Empirical comparison of single nucleotide polymorphisms and microsatellites for population and demographic analyses of bowhead whales , 2012 .

[35]  Aakrosh Ratan,et al.  Intraspecific phylogenetic analysis of Siberian woolly mammoths using complete mitochondrial genomes , 2008, Proceedings of the National Academy of Sciences.

[36]  Shirley A. Miller,et al.  A simple salting out procedure for extracting DNA from human nucleated cells. , 1988, Nucleic acids research.

[37]  S. Ho,et al.  Mitogenomic phylogenetic analyses of the Delphinidae with an emphasis on the Globicephalinae , 2011, BMC Evolutionary Biology.

[38]  Robert J. Elshire,et al.  A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species , 2011, PloS one.

[39]  Travis C Glenn,et al.  Ultraconserved elements anchor thousands of genetic markers spanning multiple evolutionary timescales. , 2012, Systematic biology.

[40]  D. Reich,et al.  Cost-effective, high-throughput DNA sequencing libraries for multiplexed target capture , 2012, Genome research.

[41]  A. Liston,et al.  Mitochondrial genome sequences illuminate maternal lineages of conservation concern in a rare carnivore , 2011, BMC Ecology.

[42]  J. Maguire,et al.  Solution Hybrid Selection with Ultra-long Oligonucleotides for Massively Parallel Targeted Sequencing , 2009, Nature Biotechnology.

[43]  P. Wade,et al.  Positive selection on the killer whale mitogenome , 2011, Biology Letters.

[44]  Matthias Meyer,et al.  Illumina sequencing library preparation for highly multiplexed target capture and sequencing. , 2010, Cold Spring Harbor protocols.

[45]  Feng Chen,et al.  Sequencing and Analysis of Neanderthal Genomic DNA , 2006, Science.

[46]  M. Arnegard,et al.  Remarkable morphological stasis in an extant vertebrate despite tens of millions of years of divergence , 2011, Proceedings of the Royal Society B: Biological Sciences.

[47]  L. Seeb,et al.  Number of Alleles as a Predictor of the Relative Assignment Accuracy of Short Tandem Repeat (STR) and Single‐Nucleotide‐Polymorphism (SNP) Baselines for Chum Salmon , 2008 .

[48]  S. Palumbi,et al.  Big and slow: phylogenetic estimates of molecular evolution in baleen whales (suborder mysticeti). , 2009, Molecular biology and evolution.

[49]  Martin Kircher,et al.  Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform , 2011, Nucleic acids research.

[50]  Dennis C. Friedrich,et al.  A scalable, fully automated process for construction of sequence-ready human exome targeted capture libraries , 2011, Genome Biology.

[51]  Michael A. Thomas,et al.  Complete mitochondrial genome phylogeographic analysis of killer whales (Orcinus orca) indicates multiple species. , 2010, Genome research.

[52]  M. Hofreiter,et al.  Mitochondrial Genomes Reveal Slow Rates of Molecular Evolution and the Timing of Speciation in Beavers (Castor), One of the Largest Rodent Species , 2011, PloS one.

[53]  Beaked whale phylogeography SC / 64 / SM 14 1 Preliminary analysis of mitochondrial genome phylogeography of Blainville ’ s , Cuvier ’ s and Gervais ’ beaked whales , 2012 .

[54]  A. Amores,et al.  Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markers. , 2007, Genome research.

[55]  Julia T. Vilstrup,et al.  Mitogenome Phylogenetics: The Impact of Using Single Regions and Partitioning Schemes on Topology, Substitution Rate and Divergence Time Estimation , 2011, PloS one.

[56]  H. Urlaub,et al.  Supplementary Table 3 , 2011 .

[57]  P. Morin,et al.  Highly accurate SNP genotyping from historical and low‐quality samples , 2007 .

[58]  Characterization of single nucleotide polymorphism markers for the green sea turtle (Chelonia mydas) , 2009, Molecular ecology resources.

[59]  B. Taylor,et al.  Assessing statistical power of SNPs for population structure and conservation studies , 2009, Molecular ecology resources.

[60]  M. Miya,et al.  Multiple Invasions into Freshwater by Pufferfishes (Teleostei: Tetraodontidae): A Mitogenomic Perspective , 2011, PloS one.

[61]  P. Donnelly,et al.  Genomic Tools for Evolution and Conservation in the Chimpanzee: Pan troglodytes ellioti Is a Genetically Distinct Population , 2012, PLoS genetics.

[62]  B. May,et al.  Application of a method for estimating effective population size and admixture using diagnostic single nucleotide polymorphisms (SNPs): implications for conservation of threatened Paiute cutthroat trout (Oncorhynchus clarkii seleniris) in Silver King Creek, California , 2011 .

[63]  J. George,et al.  SNPs for bowhead whale population genetics SC / 64 / AWMP 3 1 An empirical comparison of SNPs and microsatellites for population structure , assignment , and demographic analyses of bowhead whale populations , 2012 .