Coming of age: ten years of next-generation sequencing technologies

Since the completion of the human genome project in 2003, extraordinary progress has been made in genome sequencing technologies, which has led to a decreased cost per megabase and an increase in the number and diversity of sequenced genomes. An astonishing complexity of genome architecture has been revealed, bringing these sequencing technologies to even greater advancements. Some approaches maximize the number of bases sequenced in the least amount of time, generating a wealth of data that can be used to understand increasingly complex phenotypes. Alternatively, other approaches now aim to sequence longer contiguous pieces of DNA, which are essential for resolving structurally complex regions. These and other strategies are providing researchers and clinicians a variety of tools to probe genomes in greater depth, leading to an enhanced understanding of how genome sequence variants underlie phenotype and disease.

[1]  F. Crick,et al.  The structure of DNA. , 1953, Cold Spring Harbor symposia on quantitative biology.

[2]  L. Augenlicht,et al.  Cloning and screening of sequences expressed in a mouse colon tumor. , 1982, Cancer research.

[3]  U Landegren,et al.  A ligase-mediated gene detection technique. , 1988, Science.

[4]  R. Abramson,et al.  Detection of specific polymerase chain reaction product by utilizing the 5'----3' exonuclease activity of Thermus aquaticus DNA polymerase. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[5]  D. Schwartz,et al.  Ordered restriction maps of Saccharomyces cerevisiae chromosomes constructed by optical mapping. , 1993, Science.

[6]  J. Tait,et al.  Challenges and opportunities. , 1996, Journal of psychiatric and mental health nursing.

[7]  L. Penland,et al.  Use of a cDNA microarray to analyse gene expression patterns in human cancer , 1996, Nature Genetics.

[8]  Ash A. Alizadeh,et al.  Genomic-scale gene expression profiling of normal and malignant immune cells. , 2000, Current opinion in immunology.

[9]  G L Andersen,et al.  Sequence-specific identification of 18 pathogenic microorganisms using microarray technology. , 2002, Molecular and cellular probes.

[10]  D. Dressman,et al.  Transforming single DNA molecules into fluorescent magnetic particles for detection and enumeration of genetic variations , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[11]  S. Turner,et al.  Zero-Mode Waveguides for Single-Molecule Analysis at High Concentrations , 2003, Science.

[12]  Jan Berka,et al.  A massively parallel PicoTiterPlate™ based platform for discrete picoliter‐scale polymerase chain reactions , 2003, Electrophoresis.

[13]  J. Lieb,et al.  ChIP-chip: considerations for the design, analysis, and application of genome-wide chromatin immunoprecipitation experiments. , 2004, Genomics.

[14]  Experimental methods and applications of X-ray diffraction analysis , 2004 .

[15]  P. Brown,et al.  Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[16]  D. Stenger,et al.  Nucleic Acid Amplification Strategies for DNA Microarray-Based Pathogen Detection , 2004, Applied and Environmental Microbiology.

[17]  Evan E. Eichler,et al.  An assessment of the sequence gaps: Unfinished business in a finished human genome , 2004, Nature Reviews Genetics.

[18]  K. Liedl,et al.  Towards an Understanding of DNA Recognition by the Methyl-CpG Binding Domain 1 , 2005, Journal of biomolecular structure & dynamics.

[19]  James R. Knight,et al.  Genome sequencing in microfabricated high-density picolitre reactors , 2005, Nature.

[20]  A. Gnirke,et al.  Reduced representation bisulfite sequencing for comparative high-resolution DNA methylation analysis , 2005, Nucleic acids research.

[21]  Nicholas J. Turro,et al.  Four-color DNA sequencing by synthesis using cleavable fluorescent nucleotide reversible terminators , 2006, Proceedings of the National Academy of Sciences.

[22]  A. Tomkinson,et al.  DNA ligases: structure, reaction mechanism, and function. , 2006, Chemical reviews.

[23]  A. Tomkinson,et al.  DNA ligases: structure, reaction mechanism, and function. , 2006, Chemical reviews.

[24]  M. Fedurco,et al.  BTA, a novel reagent for DNA attachment on glass and efficient generation of solid-phase amplified DNA colonies , 2006, Nucleic acids research.

[25]  David S Dandy,et al.  Array feature size influences nucleic acid surface capture in DNA microarrays , 2007, Proceedings of the National Academy of Sciences.

[26]  G. Church,et al.  Polony Multiplex Analysis of Gene Expression (PMAGE) in Mouse Hypertrophic Cardiomyopathy , 2007, Science.

[27]  S. Mirkin Expandable DNA repeats and human disease , 2007, Nature.

[28]  S. Mccarroll,et al.  Copy-number variation and association studies of human disease , 2007, Nature Genetics.

[29]  P. Morin,et al.  Highly accurate SNP genotyping from historical and low‐quality samples , 2007 .

[30]  Z. Xuan,et al.  Genome-wide in situ exon capture for selective resequencing , 2007, Nature Genetics.

[31]  N. Carter Methods and strategies for analyzing copy number variation using DNA microarrays , 2007, Nature Genetics.

[32]  J. Buizer-Voskamp,et al.  Recurrent CNVs disrupt three candidate genes in schizophrenia patients. , 2008, American journal of human genetics.

[33]  Jingyue Ju,et al.  Four-color DNA sequencing with 3′-O-modified nucleotide reversible terminators and chemically cleavable fluorescent dideoxynucleotides , 2008, Proceedings of the National Academy of Sciences.

[34]  Juliane C. Dohm,et al.  Substantial biases in ultra-short read data sets from high-throughput DNA sequencing , 2008, Nucleic acids research.

[35]  H. D. Vanguilder,et al.  Twenty-five years of quantitative PCR for gene expression analysis. , 2008, BioTechniques.

[36]  D. Pinto,et al.  Structural variation of chromosomes in autism spectrum disorder. , 2008, American journal of human genetics.

[37]  Steven M. Johnson,et al.  A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning. , 2008, Genome research.

[38]  Antony V. Cox,et al.  Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing , 2008, Nature Genetics.

[39]  S. Quake,et al.  Single-Molecule DNA Sequencing of a Viral Genome , 2008, Science.

[40]  Nancy F. Hansen,et al.  Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry , 2008, Nature.

[41]  Timothy B. Stockwell,et al.  Evaluation of next generation sequencing platforms for population targeted sequencing studies , 2009, Genome Biology.

[42]  C. T. Farley,et al.  Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome , 2008 .

[43]  I. Pe’er,et al.  Caenorhabditis elegans mutant allele identification by whole-genome sequencing , 2008, Nature Methods.

[44]  Amy E. Hawkins,et al.  DNA sequencing of a cytogenetically normal acute myeloid leukemia genome , 2008, Nature.

[45]  Mark I. McCarthy,et al.  Concept, Design and Implementation of a Cardiovascular Gene-Centric 50 K SNP Array for Large-Scale Genomic Association Studies , 2008, PloS one.

[46]  S. Salzberg,et al.  Bioinformatics challenges of new sequencing technology. , 2008, Trends in genetics : TIG.

[47]  Yufeng Shen,et al.  Comparing Platforms for C. elegans Mutant Identification Using High-Throughput Whole-Genome Sequencing , 2008, PloS one.

[48]  Rafael A Irizarry,et al.  Comprehensive high-throughput arrays for relative methylation (CHARM). , 2008, Genome research.

[49]  Timothy E. Reddy,et al.  Distinct DNA methylation patterns characterize differentiated human embryonic stem cells and developing human fetal liver. , 2009, Genome research.

[50]  Reid F. Thompson,et al.  High-resolution genome-wide cytosine methylation profiling with simultaneous copy number analysis and optimization for limited cell numbers , 2009, Nucleic acids research.

[51]  H. Bayley,et al.  Continuous base identification for single-molecule nanopore DNA sequencing. , 2009, Nature nanotechnology.

[52]  Johnf . Thompson,et al.  Virtual Terminator nucleotides for next generation DNA sequencing , 2009, Nature Methods.

[53]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[54]  P. Park ChIP–seq: advantages and challenges of a maturing technology , 2009, Nature Reviews Genetics.

[55]  S. Turner,et al.  Real-Time DNA Sequencing from Single Polymerase Molecules , 2009, Science.

[56]  C. Perou,et al.  Mammary development meets cancer genomics , 2009, Nature Medicine.

[57]  Ramesh Ramakrishnan,et al.  Taking qPCR to a higher level: Analysis of CNV reveals the power of high throughput qPCR to enhance quantitative resolution. , 2010, Methods.

[58]  Tyson A. Clark,et al.  Direct detection of DNA methylation during single-molecule, real-time sequencing , 2010, Nature Methods.

[59]  Robert B. Hartlage,et al.  This PDF file includes: Materials and Methods , 2009 .

[60]  Peilin Jia,et al.  Common variants conferring risk of schizophrenia: A pathway analysis of GWAS data , 2010, Schizophrenia Research.

[61]  M. Metzker Sequencing technologies — the next generation , 2010, Nature Reviews Genetics.

[62]  C. E. Pearson,et al.  Table S2: Trans-factors and trinucleotide repeat instability Trans-factor , 2010 .

[63]  P. Stankiewicz,et al.  Structural variation in the human genome and its role in disease. , 2010, Annual review of medicine.

[64]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[65]  D. Goldstein,et al.  Uncovering the roles of rare variants in common disease through whole-genome sequencing , 2010, Nature Reviews Genetics.

[66]  M. Schatz,et al.  Assembly of large genomes using second-generation sequencing. , 2010, Genome research.

[67]  E. Dolgin Personalized investigation , 2010, Nature Medicine.

[68]  Martin Kircher,et al.  High‐throughput DNA sequencing – concepts and limitations , 2010, BioEssays : news and reviews in molecular, cellular and developmental biology.

[69]  Elie Dolgin NEWS FEATURE:個別化遺伝学研究 , 2010 .

[70]  Larry J Kricka,et al.  Concordance study of 3 direct-to-consumer genetic-testing services. , 2011, Clinical chemistry.

[71]  Margaret C. Linak,et al.  Sequence-specific error profile of Illumina sequencers , 2011, Nucleic acids research.

[72]  Michael Krawczak,et al.  Technology-specific error signatures in the 1000 Genomes Project data , 2011, Human Genetics.

[73]  T. Glenn Field guide to next‐generation DNA sequencers , 2011, Molecular ecology resources.

[74]  Juliane C. Dohm,et al.  Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and Genome Analyzer systems , 2011, Genome Biology.

[75]  Bernard P. Puc,et al.  An integrated semiconductor device enabling non-optical genome sequencing , 2011, Nature.

[76]  H. Swerdlow,et al.  A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers , 2012, BMC Genomics.

[77]  Michael F. Walker,et al.  De novo mutations revealed by whole-exome sequencing are strongly associated with autism , 2012, Nature.

[78]  Shashikant Kulkarni,et al.  Assuring the quality of next-generation sequencing in clinical laboratory practice , 2012, Nature Biotechnology.

[79]  Puay Hoon Tan,et al.  Development of a next-generation sequencing method for BRCA mutation screening: a comparison between a high-throughput and a benchtop platform. , 2012, The Journal of molecular diagnostics : JMD.

[80]  Bradley P. Coe,et al.  Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations , 2012, Nature.

[81]  Mauricio O. Carneiro,et al.  Pacific biosciences sequencing technology for genotyping and variation discovery in human data , 2012, BMC Genomics.

[82]  Shamil R Sunyaev,et al.  Inferring causality and functional significance of human coding DNA variants. , 2012, Human molecular genetics.

[83]  Joshua F. McMichael,et al.  Whole Genome Analysis Informs Breast Cancer Response to Aromatase Inhibition , 2012, Nature.

[84]  P. Kwok,et al.  Genome mapping on nanochannel arrays for structural variation analysis and sequence assembly , 2012, Nature Biotechnology.

[85]  M. Schatz,et al.  Hybrid error correction and de novo assembly of single-molecule sequencing reads , 2012, Nature Biotechnology.

[86]  T. Dallman,et al.  Performance comparison of benchtop high-throughput sequencing platforms , 2012, Nature Biotechnology.

[87]  Donald Sharon,et al.  A single-molecule long-read survey of the human transcriptome , 2013, Nature Biotechnology.

[88]  M. C. Schatz,et al.  The DNA data deluge , 2013, IEEE Spectrum.

[89]  Sarah McCalmon,et al.  Sequencing the unsequenceable: Expanded CGG-repeat alleles of the fragile X gene , 2013, Genome research.

[90]  Pui-Yan Kwok,et al.  Rapid Genome Mapping in Nanochannel Arrays for Highly Complete and Accurate De Novo Sequence Assembly of the Complex Aegilops tauschii Genome , 2013, PloS one.

[91]  E. Mardis Next-generation sequencing platforms. , 2013, Annual review of analytical chemistry.

[92]  Ben Langmead,et al.  The DNA Data Deluge: Fast, efficient genome sequencing machines are spewing out more data than geneticists can analyze. , 2013, IEEE spectrum.

[93]  Shibing Deng,et al.  Multiplexed gene expression and fusion transcript analysis to detect ALK fusions in lung cancer. , 2013, The Journal of molecular diagnostics : JMD.

[94]  Roland Eils,et al.  Coverage Bias and Sensitivity of Variant Calling for Four Whole-genome Sequencing Technologies , 2013, PloS one.

[95]  Simona Soverini,et al.  Comparison of Next-Generation Sequencing Systems , 2013 .

[96]  Robert C. Green,et al.  Ethics and Genomic Incidental Findings , 2013, Science.

[97]  Sean Ferree,et al.  Analytical validation of the PAM50-based Prosigna Breast Cancer Prognostic Gene Signature Assay and nCounter Analysis System using formalin-fixed paraffin-embedded breast tumor specimens , 2014, BMC Cancer.

[98]  A. Yoder,et al.  The utility of PacBio circular consensus sequencing for characterizing complex gene families in non-model organisms , 2014, BMC Genomics.

[99]  K. Dewar,et al.  Sequencing of the Dutch elm disease fungus genome using the Roche/454 GS-FLX Titanium System in a comparison of multiple genomics core facilities. , 2013, Journal of biomolecular techniques : JBT.

[100]  Liming Liang,et al.  A cross-platform analysis of 14,177 expression quantitative trait loci derived from lymphoblastoid cell lines , 2013, Genome research.

[101]  Xavier Estivill,et al.  The complex SNP and CNV genetic architecture of the increased risk of congenital heart defects in Down syndrome , 2013, Genome research.

[102]  Howard Y. Chang,et al.  Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position , 2013, Nature Methods.

[103]  Aaron M. Newman,et al.  The genome sequence of the colonial chordate, Botryllus schlosseri , 2013, eLife.

[104]  M. Akeson,et al.  Nanopores Discriminate among Five C5-Cytosine Variants in DNA , 2014, Journal of the American Chemical Society.

[105]  Keith R. Jerome,et al.  Clinical Utility of Droplet Digital PCR for Human Cytomegalovirus , 2014, Journal of Clinical Microbiology.

[106]  N. Risch,et al.  Estimating genotype error rates from high-coverage next-generation sequence data , 2014, Genome research.

[107]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[108]  Sheng Li,et al.  Multi-platform assessment of transcriptome profiling using RNA-seq in the ABRF next-generation sequencing study , 2014, Nature Biotechnology.

[109]  Boris Yamrom,et al.  The contribution of de novo coding mutations to autism spectrum disorder , 2014, Nature.

[110]  E. Diamandis,et al.  Whole genome sequencing as a diagnostic test: challenges and opportunities. , 2014, Clinical chemistry.

[111]  Xun Xu,et al.  Rapid detection of structural variation in a human genome using nanochannel-based genome mapping technology , 2014, GigaScience.

[112]  G. Troncone,et al.  Ion Torrent next-generation sequencing for routine identification of clinically relevant mutations in colorectal cancer patients , 2014, Journal of Clinical Pathology.

[113]  A. Bittner,et al.  Comparison of RNA-Seq and Microarray in Transcriptome Profiling of Activated T Cells , 2014, PloS one.

[114]  M. Ahn,et al.  High-throughput profiling identifies clinically actionable mutations in salivary duct carcinoma , 2014, Journal of Translational Medicine.

[115]  David Hsu,et al.  Characterization of structural variants with single molecule and hybrid sequencing approaches , 2014, Bioinform..

[116]  A. N’Diaye,et al.  A novel low energy electron microscope for DNA sequencing and surface analysis. , 2014, Ultramicroscopy.

[117]  N. Neff,et al.  Reconstructing lineage hierarchies of the distal lung epithelium using single cell RNA-seq , 2014, Nature.

[118]  Dmitry Pushkarev,et al.  Whole-genome haplotyping using long reads and statistical methods , 2014, Nature Biotechnology.

[119]  Rajiv C. McCoy,et al.  Illumina TruSeq synthetic long-reads empower de novo assembly and resolve complex, highly repetitive transposable elements , 2014 .

[120]  J. Zook,et al.  Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls , 2013, Nature Biotechnology.

[121]  Matthew W. Snyder,et al.  Haplotype-resolved genome sequencing: experimental methods and applications , 2015, Nature Reviews Genetics.

[122]  David A. Eccles,et al.  MinION Analysis and Reference Consortium: Phase 1 data release and analysis , 2015, F1000Research.

[123]  Alan M. Kwong,et al.  Genome sequencing elucidates Sardinian genetic architecture and augments association analyses for lipid and blood inflammatory markers , 2015, Nature Genetics.

[124]  Tom R. Gaunt,et al.  The UK10K project identifies rare variants in health and disease , 2016 .

[125]  Caleb F. Davis,et al.  Assessing structural variation in a personal genome—towards a human reference diploid genome , 2015, BMC Genomics.

[126]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[127]  Benedict Paten,et al.  Improved data analysis for the MinION nanopore sequencer , 2015, Nature Methods.

[128]  Mark J. P. Chaisson,et al.  Resolving the complexity of the human genome using single-molecule sequencing , 2014, Nature.

[129]  Joshua Quick,et al.  Rapid draft sequencing and real-time nanopore sequencing in a hospital outbreak of Salmonella , 2015, Genome Biology.

[130]  Sara Goodwin,et al.  Oxford Nanopore sequencing, hybrid error correction, and de novo assembly of a eukaryotic genome , 2015, bioRxiv.

[131]  Russell E. Durrett,et al.  Assembly and diploid architecture of an individual human genome via single-molecule technologies , 2015, Nature Methods.

[132]  Bjarni V. Halldórsson,et al.  Large-scale whole-genome sequencing of the Icelandic population , 2015, Nature Genetics.

[133]  Gabor T. Marth,et al.  An integrated map of structural variation in 2,504 human genomes , 2015, Nature.

[134]  Joshua F. McMichael,et al.  Optimizing cancer genome sequencing and analysis. , 2015, Cell systems.

[135]  Evan E. Eichler,et al.  Genetic variation and the de novo assembly of human genomes , 2015, Nature Reviews Genetics.

[136]  Xiaoqing Yu,et al.  A trimming-and-retrieving alignment scheme for reduced representation bisulfite sequencing , 2015, Bioinform..

[137]  David A. Matthews,et al.  Real-time, portable genome sequencing for Ebola surveillance , 2016, Nature.