Validation of multiple single nucleotide variation calls by additional exome analysis with a semiconductor sequencer to supplement data of whole-genome sequencing of a human population

BackgroundValidation of single nucleotide variations in whole-genome sequencing is critical for studying disease-related variations in large populations. A combination of different types of next-generation sequencers for analyzing individual genomes may be an efficient means of validating multiple single nucleotide variations calls simultaneously.ResultsHere, we analyzed 12 independent Japanese genomes using two next-generation sequencing platforms: the Illumina HiSeq 2500 platform for whole-genome sequencing (average depth 32.4×), and the Ion Proton semiconductor sequencer for whole exome sequencing (average depth 109×). Single nucleotide polymorphism (SNP) calls based on the Illumina Human Omni 2.5-8 SNP chip data were used as the reference. We compared the variant calls for the 12 samples, and found that the concordance between the two next-generation sequencing platforms varied between 83% and 97%.ConclusionsOur results show the versatility and usefulness of the combination of exome sequencing with whole-genome sequencing in studies of human population genetics and demonstrate that combining data from multiple sequencing platforms is an efficient approach to validate and supplement SNP calls.

[1]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[2]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[3]  Nancy F. Hansen,et al.  Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry , 2008, Nature.

[4]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[5]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[6]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[7]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[8]  Masao Nagasaki,et al.  Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencing , 2010, Nature Genetics.

[9]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[10]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[11]  Aleksandar Milosavljevic,et al.  An integrative variant analysis suite for whole exome next-generation sequencing data , 2012, BMC Bioinformatics.

[12]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer , 2011, Nature Biotechnology.

[13]  Tom H. Pringle,et al.  Genetic diversity and population structure of the endangered marsupial Sarcophilus harrisii (Tasmanian devil) , 2011, Proceedings of the National Academy of Sciences.

[14]  Hui Jiang,et al.  Comprehensive comparison of three commercial human whole-exome capture platforms , 2011, Genome Biology.

[15]  M. Spector,et al.  A comparative analysis of exome capture , 2011, Genome Biology.

[16]  Muin J Khoury,et al.  Deploying whole genome sequencing in clinical practice and public health: Meeting the challenge one bin at a time , 2011, Genetics in Medicine.

[17]  P. Bork,et al.  Enterotypes of the human gut microbiome , 2011, Nature.

[18]  Hugo Y. K. Lam,et al.  Performance comparison of exome DNA sequencing technologies , 2011, Nature Biotechnology.

[19]  Juliane C. Dohm,et al.  Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and Genome Analyzer systems , 2011, Genome Biology.

[20]  Bernard P. Puc,et al.  An integrated semiconductor device enabling non-optical genome sequencing , 2011, Nature.

[21]  G. Fischer,et al.  Next-Generation Ion Torrent Sequencing of Drug Resistance Mutations in Mycobacterium tuberculosis Strains , 2012, Journal of Clinical Microbiology.

[22]  H. Swerdlow,et al.  A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers , 2012, BMC Genomics.

[23]  W. Dinjens,et al.  Molecular diagnostics of a single multifocal non-small cell lung cancer case using targeted next generation sequencing , 2013, Virchows Archiv.

[24]  J. Lupski,et al.  Human genome sequencing in health and disease. , 2012, Annual review of medicine.

[25]  Puay Hoon Tan,et al.  Development of a next-generation sequencing method for BRCA mutation screening: a comparison between a high-throughput and a benchtop platform. , 2012, The Journal of molecular diagnostics : JMD.

[26]  Jens Stoye,et al.  Bacterial Community Shift in Treated Periodontitis Patients Revealed by Ion Torrent 16S rRNA Gene Amplicon Sequencing , 2012, PloS one.

[27]  A. Elliott,et al.  Rapid detection of the ACMG/ACOG-recommended 23 CFTR disease-causing mutations using ion torrent semiconductor sequencing. , 2012, Journal of biomolecular techniques : JBT.

[28]  N. Lennon,et al.  Characterizing and measuring bias in sequence data , 2013, Genome Biology.

[29]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[30]  T. Dallman,et al.  Performance comparison of benchtop high-throughput sequencing platforms , 2012, Nature Biotechnology.

[31]  Zhen Xuan Yeo,et al.  Improving Indel Detection Specificity of the Ion Torrent PGM Benchtop Sequencer , 2012, PloS one.

[32]  B. Marshall,et al.  Choosing a Benchtop Sequencing Machine to Characterise Helicobacter pylori Genomes , 2013, PloS one.

[33]  W. Miller,et al.  Comparison of Sequencing Platforms for Single Nucleotide Variant Calls in a Human Sample , 2013, PloS one.

[34]  S. Cook,et al.  Towards Clinical Molecular Diagnosis of Inherited Cardiac Conditions: A Comparison of Bench-Top Genome DNA Sequencers , 2013, PloS one.

[35]  D. Goldstein,et al.  Sequencing studies in human genetics: design and interpretation , 2013, Nature Reviews Genetics.

[36]  Ignacio Blanco,et al.  Next-generation sequencing meets genetic diagnostics: development of a comprehensive workflow for the analysis of BRCA1 and BRCA2 genes , 2012, European Journal of Human Genetics.

[37]  Timothy T Harkins,et al.  A Comprehensive Assay for CFTR Mutational Analysis Using Next‐Generation Sequencing , 2013, Clinical chemistry.

[38]  Peng Chen,et al.  Deep whole-genome sequencing of 100 southeast Asian Malays. , 2013, American journal of human genetics.

[39]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[40]  Ashish Choudhary,et al.  Targeted, high-depth, next-generation sequencing of cancer genes in formalin-fixed, paraffin-embedded and fine-needle aspiration tumor specimens. , 2013, The Journal of molecular diagnostics : JMD.

[41]  Rashmi Kanagal-Shamanna,et al.  Clinical validation of a next-generation sequencing screen for mutational hotspots in 46 cancer-related genes. , 2013, The Journal of molecular diagnostics : JMD.

[42]  Philip Hugenholtz,et al.  Shining a Light on Dark Sequencing: Characterising Errors in Ion Torrent PGM Data , 2013, PLoS Comput. Biol..

[43]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[44]  Stephen J. Chanock,et al.  The new sequencer on the block: comparison of Life Technology’s Proton sequencer to an Illumina HiSeq for whole-exome sequencing , 2013, Human Genetics.

[45]  Rashmi Kanagal-Shamanna,et al.  Next-generation sequencing-based multi-gene mutation profiling of solid tumors using fine needle aspiration samples: promises and challenges for routine clinical diagnostics , 2014, Modern Pathology.