Rapid genome sequencing with short universal tiling probes

The increasing availability of high-quality reference genomic sequences has created a demand for ways to survey the sequence differences present in individual genomes. Here we describe a DNA sequencing method based on hybridization of a universal panel of tiling probes. Millions of shotgun fragments are amplified in situ and subjected to sequential hybridization with short fluorescent probes. Long fragments of 200 bp facilitate unique placement even in large genomes. The sequencing chemistry is simple, enzyme-free and consumes only dilute solutions of the probes, resulting in reduced sequencing cost and substantially increased speed. A prototype instrument based on commonly available equipment was used to resequence the Bacteriophage λ and Escherichia coli genomes to better than 99.93% accuracy with a raw throughput of 320 Mbp/day, albeit with a significant number of small gaps attributed to losses in sample preparation.

[1]  O. White,et al.  Environmental Genome Shotgun Sequencing of the Sargasso Sea , 2004, Science.

[2]  R. Drmanac,et al.  Sequencing of megabase plus DNA by hybridization: theory of the method. , 1989, Genomics.

[3]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[4]  James R. Knight,et al.  Genome sequencing in microfabricated high-density picolitre reactors , 2005, Nature.

[5]  K. Khrapko,et al.  [Determination of the nucleotide sequence of DNA using hybridization with oligonucleotides. A new method]. , 1988, Doklady Akademii nauk SSSR.

[6]  Niall J. Haslam,et al.  An analysis of the feasibility of short read sequencing , 2005, Nucleic acids research.

[7]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[8]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[9]  Jennifer L. Ong,et al.  Directed evolution of polymerase function by compartmentalized self-replication , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[10]  E. D. Hyman A new method of sequencing DNA. , 1988, Analytical biochemistry.

[11]  Sequencing thoroughbreds , 2006, Nature Biotechnology.

[12]  Clive Brown,et al.  Toward the $1000 human genome , 2005 .

[13]  B. Canard,et al.  DNA polymerase fluorescent substrates with reversible 3'-tags. , 1994, Gene.

[14]  Andrew D Griffiths,et al.  Amplification of complex gene libraries by emulsion PCR , 2006, Nature Methods.

[15]  Clive Brown,et al.  Toward the 1,000 dollars human genome. , 2005, Pharmacogenomics.

[16]  R A Gibbs,et al.  Termination of DNA synthesis by novel 3'-modified-deoxyribonucleoside 5'-triphosphates. , 1994, Nucleic acids research.

[17]  L. M. Smith,et al.  High speed DNA sequencing by capillary electrophoresis. , 1990, Nucleic acids research.

[18]  J. Shendure,et al.  Advanced sequencing technologies: methods and goals , 2004, Nature Reviews Genetics.

[19]  Ron Shamir,et al.  A computational method for resequencing long DNA targets by universal oligonucleotide arrays , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Rithy K. Roth,et al.  Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays , 2000, Nature Biotechnology.

[21]  Gesine Reinert,et al.  Poisson Process Approximation for Sequence Repeats and Sequencing by Hybridization , 1996, J. Comput. Biol..

[22]  S. Quake,et al.  Sequence information can be obtained from single DNA molecules , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[23]  W. Bains,et al.  A novel method for nucleic acid sequence determination. , 1988, Journal of theoretical biology.

[24]  C. T. Farley,et al.  Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome , 2008 .

[25]  R. Drmanac,et al.  Accurate sequencing by hybridization for DNA diagnostics and individual genomics , 1998, Nature Biotechnology.

[26]  F. Sanger,et al.  DNA sequencing with chain-terminating inhibitors. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Richard A Mathies,et al.  Microfabricated bioprocessor for integrated nanoliter-scale Sanger DNA sequencing. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[28]  J. M. Prober,et al.  A system for rapid DNA sequencing with fluorescent chain-terminating dideoxynucleotides. , 1987, Science.

[29]  Radoje Drmanac,et al.  Sequencing by hybridization (SBH): advantages, achievements, and opportunities. , 2002, Advances in biochemical engineering/biotechnology.

[30]  W. Donachie,et al.  The cell cycle of Escherichia coli. , 1993, Annual review of microbiology.

[31]  Geoffrey B. Nilsen,et al.  Whole-Genome Patterns of Common DNA Variation in Three Human Populations , 2005, Science.

[32]  M. Ronaghi,et al.  Real-time DNA sequencing using detection of pyrophosphate release. , 1996, Analytical biochemistry.

[33]  Kunkel Jm,et al.  Spontaneous subclavain vein thrombosis: a successful combined approach of local thrombolytic therapy followed by first rib resection. , 1989 .

[34]  G. Church,et al.  In situ localized amplification and contact replication of many individual DNA molecules. , 1999, Nucleic acids research.

[35]  Poul Nielsen,et al.  LNA (Locked Nucleic Acids): Synthesis of the adenine, cytosine, guanine, 5-methylcytosine, thymine and uracil bicyclonucleoside monomers, oligomerisation, and unprecedented nucleic acid recognition , 1998 .

[36]  P. Lizardi,et al.  Mutation detection and single-molecule counting using isothermal rolling-circle amplification , 1998, Nature Genetics.

[37]  Mariza de Andrade,et al.  High-resolution whole-genome association study of Parkinson disease. , 2005, American journal of human genetics.

[38]  J. Ott,et al.  Complement Factor H Polymorphism in Age-Related Macular Degeneration , 2005, Science.

[39]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[40]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[41]  Ron Shamir,et al.  Large Scale Sequencing by Hybridization , 2002, J. Comput. Biol..