Systematic comparison of three genomic enrichment methods for massively parallel DNA sequencing.

Massively parallel DNA sequencing technologies have greatly increased our ability to generate large amounts of sequencing data at a rapid pace. Several methods have been developed to enrich for genomic regions of interest for targeted sequencing. We have compared three of these methods: Molecular Inversion Probes (MIP), Solution Hybrid Selection (SHS), and Microarray-based Genomic Selection (MGS). Using HapMap DNA samples, we compared each of these methods with respect to their ability to capture an identical set of exons and evolutionarily conserved regions associated with 528 genes (2.61 Mb). For sequence analysis, we developed and used a novel Bayesian genotype-assigning algorithm, Most Probable Genotype (MPG). All three capture methods were effective, but sensitivities (percentage of targeted bases associated with high-quality genotypes) varied for an equivalent amount of pass-filtered sequence: for example, 70% (MIP), 84% (SHS), and 91% (MGS) for 400 Mb. In contrast, all methods yielded similar accuracies of >99.84% when compared to Infinium 1M SNP BeadChip-derived genotypes and >99.998% when compared to 30-fold coverage whole-genome shotgun sequencing data. We also observed a low false-positive rate with all three methods; of the heterozygous positions identified by each of the capture methods, >99.57% agreed with 1M SNP BeadChip, and >98.840% agreed with the whole-genome shotgun data. In addition, we successfully piloted the genomic enrichment of a set of 12 pooled samples via the MGS method using molecular bar codes. We find that these three genomic enrichment methods are highly accurate and practical, with sensitivities comparable to that of 30-fold coverage whole-genome shotgun data.

[1]  M. Chee,et al.  Microarray-based multicycle-enrichment of genomic subsets for targeted next-generation sequencing. , 2009, Genome research.

[2]  Matthew J. Huentelman,et al.  IDENTIFICATION OF GENETIC VARIANTS USING BARCODED MULTIPLEXED SEQUENCING , 2008, Nature Methods.

[3]  James R. Knight,et al.  Genome sequencing in microfabricated high-density picolitre reactors , 2005, Nature.

[4]  G. Weinstock,et al.  Direct selection of human genomic loci by microarray hybridization , 2007, Nature Methods.

[5]  Jay Shendure,et al.  Massively parallel exon capture and library-free resequencing across 16 genomes , 2009, Nature Methods.

[6]  Nancy F. Hansen,et al.  Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry , 2008, Nature.

[7]  J. Seidman,et al.  Filter-based hybridization capture of subgenomes enables resequencing and copy-number detection , 2009, Nature Methods.

[8]  Jay Shendure,et al.  Methods for genomic partitioning. , 2009, Annual review of genomics and human genetics.

[9]  Alexander F. Wilson,et al.  Research in Genomic Medicine the Clinseq Project: Piloting Large-scale Genome Sequencing for Material Supplemental , 2009 .

[10]  J. Shendure,et al.  Materials and Methods Som Text Figs. S1 and S2 Tables S1 to S4 References Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome , 2022 .

[11]  David T. Okou,et al.  Microarray-based genomic selection for high-throughput resequencing , 2007, Nature Methods.

[12]  Jay Shendure,et al.  Multiplex amplification of large sets of human exons , 2007, Nature Methods.

[13]  Zhenyu Xuan,et al.  Hybrid selection of discrete genomic intervals on custom-designed microarrays for massively parallel sequencing , 2009, Nature Protocols.

[14]  D. Summerer Enabling technologies of genomic-scale sequence enrichment for targeted high-throughput sequencing. , 2009, Genomics.

[15]  K. Frazer,et al.  Microdroplet-based PCR amplification for large scale targeted sequencing , 2009, Nature Biotechnology.

[16]  K. Garber Fixing the front end , 2008, Nature Biotechnology.

[17]  Malek Faham,et al.  A comprehensive assay for targeted multiplex amplification of human DNA sequences , 2008, Proceedings of the National Academy of Sciences.

[18]  A. Zaranek,et al.  Multiplex padlock targeted sequencing reveals human hypermutable CpG variations. , 2009, Genome research.

[19]  M. Beier,et al.  Targeted next-generation sequencing by specific capture of multiple genomic loci using low-volume microfluidic DNA arrays , 2009, Analytical and bioanalytical chemistry.

[20]  Emily H Turner,et al.  Target-enrichment strategies for next-generation sequencing , 2010, Nature Methods.

[21]  Z. Xuan,et al.  Genome-wide in situ exon capture for selective resequencing , 2007, Nature Genetics.

[22]  J. Maguire,et al.  Solution Hybrid Selection with Ultra-long Oligonucleotides for Massively Parallel Targeted Sequencing , 2009, Nature Biotechnology.

[23]  R N Bergman,et al.  Mapping Genes for NIDDM: Design of the Finland—United States Investigation of NIDDM Genetics (FUSION) Study , 1998, Diabetes Care.

[24]  B. Trask,et al.  Segmental duplications: organization and impact within the current human genome project assembly. , 2001, Genome research.