An integrated system for DNA sequencing by synthesis using novel nucleotide analogues.

The Human Genome Project has concluded, but its successful completion has increased, rather than decreased, the need for high-throughput DNA sequencing technologies. The possibility of clinically screening a full genome for an individual's mutations offers tremendous benefits, both for pursuing personalized medicine and for uncovering the genomic contributions to diseases. The Sanger sequencing method, although enormously productive for more than 30 years, requires an electrophoretic separation step that, unfortunately, remains a key technical obstacle for achieving economically acceptable full-genome results. Alternative sequencing approaches thus focus on innovations that can reduce costs. The DNA sequencing by synthesis (SBS) approach has shown great promise as a new sequencing platform, with particular progress reported recently. The general fluorescent SBS approach involves (i) incorporation of nucleotide analogs bearing fluorescent reporters, (ii) identification of the incorporated nucleotide by its fluorescent emissions, and (iii) cleavage of the fluorophore, along with the reinitiation of the polymerase reaction for continuing sequence determination. In this Account, we review the construction of a DNA-immobilized chip and the development of novel nucleotide reporters for the SBS sequencing platform. Click chemistry, with its high selectivity and coupling efficiency, was explored for surface immobilization of DNA. The first generation (G-1) modified nucleotides for SBS feature a small chemical moiety capping the 3'-OH and a fluorophore tethered to the base through a chemically cleavable linker; the design ensures that the nucleotide reporters are good substrates for the polymerase. The 3'-capping moiety and the fluorophore on the DNA extension products, generated by the incorporation of the G-1 modified nucleotides, are cleaved simultaneously to reinitiate the polymerase reaction. The sequence of a DNA template immobilized on a surface via click chemistry is unambiguously identified with this chip-SBS system. The second generation (G-2) SBS system was developed based on the concept that the closer the structures of the added nucleotide and the primer are to their natural counterparts, the more faithfully the polymerase would incorporate the nucleotide. In this approach, the polymerase reaction is performed with the combination of 3'-capped nucleotide reversible terminators (NRTs) and cleavable fluorescent dideoxynucleotides (ddNTPs). By sacrifice of a small amount of the primers permanently terminated by ddNTPs, the majority of the primers extended by the reversible terminators are reverted to the natural ones after each sequencing cycle. We have also developed the 3'-capped nucleotide reversible terminators to solve the problem of deciphering the homopolymeric regions of the template in conventional pyrosequencing. The 3'-capping moiety on the DNA extension product temporarily terminates the polymerase reaction, which allows only one nucleotide to be incorporated during each sequencing cycle. Thus, the number of nucleotides in the homopolymeric regions are unambiguously determined using the 3'-capped NRTs. It has been established that millions of DNA templates can be immobilized on a chip surface through a variety of approaches. Therefore, the integration of these high-density DNA chips with the molecular-level SBS approaches described in this Account is expected to generate a high-throughput and accurate DNA sequencing system with wide applications in biological research and health care.

[1]  Nicholas J. Turro,et al.  Four-color DNA sequencing by synthesis using cleavable fluorescent nucleotide reversible terminators , 2006, Proceedings of the National Academy of Sciences.

[2]  C. Heiner,et al.  New dye-labeled terminators for improved DNA sequencing patterns. , 1997, Nucleic acids research.

[3]  J. Lupski,et al.  The complete genome of an individual by massively parallel DNA sequencing , 2008, Nature.

[4]  G. Church,et al.  Polony Multiplex Analysis of Gene Expression (PMAGE) in Mouse Hypertrophic Cardiomyopathy , 2007, Science.

[5]  Nancy F. Hansen,et al.  Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry , 2008, Nature.

[6]  A. Waggoner,et al.  Directly labeled DNA probes using fluorescent nucleotides with different length linkers. , 1994, Nucleic acids research.

[7]  Johnf . Thompson,et al.  Virtual Terminator nucleotides for next generation DNA sequencing , 2009, Nature Methods.

[8]  S. Turner,et al.  Real-Time DNA Sequencing from Single Polymerase Molecules , 2009, Science.

[9]  M. Fedurco,et al.  BTA, a novel reagent for DNA attachment on glass and efficient generation of solid-phase amplified DNA colonies , 2006, Nucleic acids research.

[10]  Jay Shendure,et al.  Fluorescent in situ sequencing on polymerase colonies. , 2003, Analytical biochemistry.

[11]  M. G. Finn,et al.  Click Chemistry: Diverse Chemical Function from a Few Good Reactions , 2001 .

[12]  Andreas von Bubnoff,et al.  Next-Generation Sequencing: The Race Is On , 2008, Cell.

[13]  S. Quake,et al.  Single-Molecule DNA Sequencing of a Viral Genome , 2008, Science.

[14]  A. Mortazavi,et al.  Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.

[15]  E. Mardis Next-generation DNA sequencing methods. , 2008, Annual review of genomics and human genetics.

[16]  Samuel H. Wilson,et al.  Structures of ternary complexes of rat DNA polymerase beta, a DNA template-primer, and ddCTP. , 1994, Science.

[17]  D. Branton,et al.  Characterization of individual polynucleotide molecules using a membrane channel. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[18]  J R Edwards,et al.  DNA sequencing using biotinylated dideoxynucleotides and mass spectrometry. , 2001, Nucleic acids research.

[19]  Tae Seok Seo,et al.  Photocleavable fluorescent nucleotides for DNA sequencing on a chip constructed by site-specific coupling chemistry. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[20]  S. Martin,et al.  DNA sequencing by delayed extraction-matrix-assisted laser desorption/ionization time of flight mass spectrometry. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[21]  C. T. Farley,et al.  Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome , 2008 .

[22]  Nicholas J Turro,et al.  3′-O-modified nucleotides as reversible terminators for pyrosequencing , 2007, Proceedings of the National Academy of Sciences.

[23]  M. Fedurco,et al.  A new class of cleavable fluorescent nucleotides: synthesis and optimization as reversible terminators for DNA sequencing by synthesis† , 2008, Nucleic acids research.

[24]  F. Collins,et al.  A vision for the future of genomics research , 2003, Nature.

[25]  James R. Knight,et al.  Genome sequencing in microfabricated high-density picolitre reactors , 2005, Nature.

[26]  Tony Smith,et al.  Whole genome variation analysis using single molecule sequencing , 2004 .

[27]  S. Quake,et al.  Sequence information can be obtained from single DNA molecules , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Dustin E. Schones,et al.  High-Resolution Profiling of Histone Methylations in the Human Genome , 2007, Cell.

[29]  R. Drmanac,et al.  Accurate sequencing by hybridization for DNA diagnostics and individual genomics , 1998, Nature Biotechnology.

[30]  G. Turcatti,et al.  Solid phase DNA amplification: characterisation of primer attachment and amplification mechanisms. , 2000, Nucleic acids research.

[31]  F. Sanger,et al.  DNA sequencing with chain-terminating inhibitors. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Jingyue Ju,et al.  Four-color DNA sequencing with 3′-O-modified nucleotide reversible terminators and chemically cleavable fluorescent dideoxynucleotides , 2008, Proceedings of the National Academy of Sciences.

[33]  N. Perrimon,et al.  An endogenous small interfering RNA pathway in Drosophila , 2008, Nature.

[34]  Charles R. Cantor,et al.  Sequencing exons 5 to 8 of the p53 gene by MALDI-TOF mass spectrometry , 1998, Nature Biotechnology.

[35]  Kendall N Houk,et al.  Accounts of Chemical Research. , 2008, Accounts of chemical research.

[36]  Francisco M. De La Vega,et al.  Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding. , 2009, Genome research.

[37]  Tae Seok Seo,et al.  Four-color DNA sequencing by synthesis on a chip using photocleavable fluorescent nucleotides , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[38]  M. Ronaghi,et al.  A Sequencing Method Based on Real-Time Pyrophosphate , 1998, Science.

[39]  T. Mikkelsen,et al.  Genome-wide maps of chromatin state in pluripotent and lineage-committed cells , 2007, Nature.

[40]  M. Ronaghi,et al.  Real-time DNA sequencing using detection of pyrophosphate release. , 1996, Analytical biochemistry.