Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry

DNA sequence information underpins genetic research, enabling discoveries of important biological or medical benefit. Sequencing projects have traditionally used long (400–800 base pair) reads, but the existence of reference sequences for the human and many other genomes makes it possible to develop new, fast approaches to re-sequencing, whereby shorter reads are compared to a reference to identify intraspecies genetic variation. Here we report an approach that generates several billion bases of accurate nucleotide sequence per experiment at low cost. Single molecules of DNA are attached to a flat surface, amplified in situ and used as templates for synthetic sequencing with fluorescent reversible terminator deoxyribonucleotides. Images of the surface are analysed to generate high-quality sequence. We demonstrate application of this approach to human genome sequencing on flow-sorted X chromosomes and then scale the approach to determine the genome sequence of a male Yoruba from Ibadan, Nigeria. We build an accurate consensus sequence from >30× average depth of paired 35-base reads. We characterize four million single-nucleotide polymorphisms and four hundred thousand structural variants, many of which were previously unknown. Our approach is effective for accurate, rapid and economical whole-genome re-sequencing and many other biomedical applications.

Nancy F. Hansen | Lisa J. Murray | Phillip J. Black | Rithy K. Roth | Dale H. Buermann | James C. Burrows | R. Durbin | Klaudia Walter | N. Carter | M. Hurles | K. F. Fajardo | D. Bentley | J. Mullikin | J. Rogers | G. Smith | S. Luo | I. Khrebtukova | Lu Zhang | G. Schroth | S. Tzonev | S. Balasubramanian | H. Swerdlow | J. Milton | Clive Brown | K. Hall | D. Evers | C. Barnes | Helen Bignell | J. M. Boutell | J. Bryant | Richard J. Carter | R. K. Cheetham | A. Cox | D. Ellis | Michael R. Flatbush | N. Gormley | S. Humphray | Leslie J. Irving | Mirian Karbelashvili | S. Kirk | Heng Li | Xiaohai Liu | K. Maisinger | B. Obradovic | T. Ost | Michael L. Parkinson | Mark Pratt | I. Rasolonjatovo | M. Reed | R. Rigatti | C. Rodighiero | M. Ross | A. Sabot | S. Sankar | A. Scally | Mark E. B. Smith | Vincent P. Smith | A. Spiridou | Peta E. Torrance | E. Vermaas | Xiaolin Wu | Mohammed D. Alam | Carole Anastasi | I. Aniebo | D. M. Bailey | I. Bancarz | Saibal Banerjee | Selena G. Barbour | P. Baybayan | Vincent A. Benoit | Kevin Benson | Claire Bevis | Asha Boodhun | J. S. Brennan | J. Bridgham | Rob C. Brown | A. Brown | Abass A. Bundu | Nestor Castillo | M. C. E. Catenazzi | Simon Chang | R. Cooley | Natasha R. Crake | Olubunmi O. Dada | Konstantinos D. Diakoumakos | Belen Dominguez-Fernandez | D. J. Earnshaw | Ugonna C. Egbujor | Dave Elmore | S. Etchin | Mark Ewan | M. Fedurco | Louise J. Fraser | W. S. Furey | Dave George | Kimberley J. Gietzen | Colin P. Goddard | G. Golda | Philip A. Granieri | David E. Green | D. Gustafson | N. Hansen | K. Harnish | C. Haudenschild | Narinder I. Heyer | M. Hims | Johnny T. Ho | A. Horgan | Katya Hoschler | Steve Hurwitz | D. V. Ivanov | Maria Q. Johnson | T. James | T. A. Jones | Gyoung-Dong Kang | Tzvetana H. Kerelska | A. Kersey | A. Kindwall | Z. Kingsbury | P. Kokko-Gonzales | Anil Kumar | M. Laurent | C. Lawley | Sarah E. Lee | X. Lee | A. Liao | J. Loch | Mitch Lok | Radhika M. Mammen | J. W. Martin | P. McCauley | P. McNitt | Parul D. Mehta | K. Moon | Joe W. Mullens | T. Newington | Z. Ning | B. Ng | Sonia M. Novo | Michael J. O'neill | M. Osborne | A. Osnowski | Omead Ostadan | L. Paraschos | L. Pickering | Andrew C. Pike | A. Pike | D. Chris Pinkard | Daniel P. Pliskin | Joe Podhasky | Victor J. Quijano | C. Raczy | Vicki H. Rae | S. Rawlings | Ana Chiva Rodriguez | Phyllida Roe | J. Rogers | M. C. Rogert Bacigalupo | Nikolai Romanov | A. Romieu | Natalie J. Rourke | Silke T. Ruediger | E. Rusman | Raquel M. Sanches-Kuiper | M. Schenker | J. Seoane | R. Shaw | Mitch K. Shiver | S. W. Short | N. Sizto | Johannes P. Sluis | Melanie A. Smith | J. S. Sohna | Eric Spence | K. Stevens | Neil W. Sutton | L. Szajkowski | C. Tregidgo | G. Turcatti | S. vandeVondele | Yuli Verhovsky | Selene M. Virk | S. Wakelin | Gregory C. Walcott | Jingwen Wang | G. Worsley | Juying Yan | L. Yau | Mike Zuerlein | N. J. McCooke | John S. West | F. Oaks | Peter L. Lundberg | D. Klenerman | Anthony J. Smith | P. Mehta | Terena James | C. Anastasi | Mark E. B. Smith | Xiaolin Wu | D. Earnshaw | Richard Shaw | Vincent Benoit | S. Short | M. Smith | M. A. Smith | Mark E. B. Smith | M. Smith | Mark E. B. Smith | Parul Mehta | Svilen Tzonev | G. P. Smith | Scott M. Kirk | Adrian Horgan | Eric H. Vermaas | Keith Moon | S. Vandevondele | Shujun J Luo | L. Murray | Jennifer A. Loch | Selena Barbour | Anastassia Spiridou | Matthew M. Hims | Colin L. Barnes | Gregory Walcott | Mitch Shiver

[1]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[2]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[3]  M. Cargill Characterization of single-nucleotide polymorphisms in coding regions of human genes , 1999, Nature Genetics.

[4]  M. Daly,et al.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms , 2001, Nature.

[5]  Philip Lijnzaad,et al.  The Ensembl genome database project , 2002, Nucleic Acids Res..

[6]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[7]  P. Tam The International HapMap Consortium. The International HapMap Project (Co-PI of Hong Kong Centre which responsible for 2.5% of genome) , 2003 .

[8]  J. Bonfield,et al.  Finishing the euchromatic sequence of the human genome , 2004, Nature.

[9]  J. Bonfield,et al.  Finishing the euchromatic sequence of the human genome , 2004, Nature.

[10]  International Human Genome Sequencing Consortium Finishing the euchromatic sequence of the human genome , 2004 .

[11]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[12]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[13]  J. Shendure,et al.  Materials and Methods Som Text Figs. S1 and S2 Tables S1 to S4 References Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome , 2022 .

[14]  E. Eichler,et al.  Fine-scale structural variation of the human genome , 2005, Nature Genetics.

[15]  James R. Knight,et al.  Genome sequencing in microfabricated high-density picolitre reactors , 2005, Nature.

[16]  D. Conrad,et al.  Global variation in copy number in the human genome , 2006, Nature.

[17]  M. Fedurco,et al.  BTA, a novel reagent for DNA attachment on glass and efficient generation of solid-phase amplified DNA colonies , 2006, Nucleic acids research.

[18]  Jay Shendure,et al.  Multiplex amplification of large sets of human exons , 2007, Nature Methods.

[19]  Timothy B. Stockwell,et al.  The Diploid Genome Sequence of an Individual Human , 2007, PLoS biology.

[20]  T. Mikkelsen,et al.  Genome-wide maps of chromatin state in pluripotent and lineage-committed cells , 2007, Nature.

[21]  Philip M. Kim,et al.  Paired-End Mapping Reveals Extensive Structural Variation in the Human Genome , 2007, Science.

[22]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[23]  Dustin E. Schones,et al.  High-Resolution Profiling of Histone Methylations in the Human Genome , 2007, Cell.

[24]  A. Mortazavi,et al.  Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.

[25]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[26]  Z. Xuan,et al.  Genome-wide in situ exon capture for selective resequencing , 2007, Nature Genetics.

[27]  J. Lupski,et al.  The complete genome of an individual by massively parallel DNA sequencing , 2008, Nature.

[28]  R. Lister,et al.  Highly Integrated Single-Base Resolution Maps of the Epigenome in Arabidopsis , 2008, Cell.

[29]  Antony V. Cox,et al.  Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing , 2008, Nature Genetics.

[30]  S. Quake,et al.  Single-Molecule DNA Sequencing of a Viral Genome , 2008, Science.

[31]  Gabor T. Marth,et al.  Whole-genome sequencing and variant discovery in C. elegans , 2008, Nature Methods.

[32]  Peiqian Zhao,et al.  Parallel confocal detection of single molecules in real time. , 2008, Optics letters.

[33]  Z. Weng,et al.  High-Resolution Mapping and Characterization of Open Chromatin across the Genome , 2008, Cell.

[34]  Joshua M. Korn,et al.  Mapping and sequencing of structural variation from eight human genomes , 2008, Nature.

[35]  R. Durbin,et al.  Mapping Quality Scores Mapping Short Dna Sequencing Reads and Calling Variants Using P

, 2022 .

[36]  J. Shendure The beginning of the end for microarrays? , 2008, Nature Methods.

[37]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.