Preselection of shotgun clones by oligonucleotide fingerprinting: an efficient and high throughput strategy to reduce redundancy in large-scale sequencing projects.

Large-scale genomic sequencing projects generally rely on random sequencing of shotgun clones, followed by different gap closing strategies. To reduce the overall effort and cost of those projects and to accelerate the sequencing throughput, we have developed an efficient, high throughput oligonucleotide fingerprinting protocol to select optimal shotgun clone sets prior to sequencing. Both computer simulations and experimental results, obtained from five PAC-derived shotgun libraries spanning 535 kb of the 17p11.2 region of the human genome, demonstrate that at least a 2-fold reduction in the number of sequence reads required to sequence an individual genomic clone (cosmid, PAC, etc.) can be achieved. Treatment of clone contigs with significant clone overlaps will allow an even greater reduction.

[1]  H. Lehrach,et al.  A subcloning strategy for DNA sequence analysis. , 1980, Nucleic acids research.

[2]  A. Poustka,et al.  Molecular approaches to mammalian genetics. , 1986, Cold Spring Harbor symposia on quantitative biology.

[3]  D. Berg,et al.  Tn5supF, a 264-base-pair transposon derived from Tn5 for insertion mutagenesis and sequencing DNAs cloned in phage lambda. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[4]  R. Drmanac,et al.  Sequencing of megabase plus DNA by hybridization: theory of the method. , 1989, Genomics.

[5]  V A McKusick,et al.  HUGO news. The Human Genome Organisation: history, purposes, and membership. , 1989, Genomics.

[6]  H. Lehrach,et al.  Ordering of cosmid clones covering the herpes simplex virus type I (HSV-I) genome: a test case for fingerprinting by hybridisation. , 1990, Nucleic acids research.

[7]  R. F. Johnston,et al.  Autoradiography using storage phosphor technology , 1990, Electrophoresis.

[8]  R. Drmanac,et al.  Reliable hybridization of oligonucleotides as short as six nucleotides. , 1990, DNA and cell biology.

[9]  Lysov YuP,et al.  A method for DNA sequencing by hybridization with oligonucleotide matrix. , 1991, DNA sequence : the journal of DNA sequencing and mapping.

[10]  R. Drmanac,et al.  DNA sequencing by hybridization: 100 bases read by a non-gel-based method. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[11]  N. Kleckner,et al.  Uses of transposons with emphasis on Tn10. , 1991, Methods in enzymology.

[12]  M. Palazzolo,et al.  Transposon-facilitated DNA sequencing. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[13]  R. Drmanac,et al.  Sequencing by hybridization: Towards an automated sequencing of one million M13 clones arrayed on membranes , 1992, Electrophoresis.

[14]  AC Tose Cell , 1993, Cell.

[15]  L. Hood,et al.  DNA sequence determination by hybridization: a strategy for efficient large-scale sequencing. , 1993, Science.

[16]  Hans Lehrach,et al.  An automated approach to generating expressed sequence catalogues , 1993, Nature.

[17]  Hans Lehrach,et al.  High resolution cosmid and P1 maps spanning the 14 Mb genome of the fission yeast S. pombe , 1993, Cell.

[18]  S. Meier-Ewert,et al.  Application of robotic technology to automated sequence fingerprint analysis by oligonucleotide hybridisation. , 1994, Journal of biotechnology.

[19]  S. Devine,et al.  Efficient integration of artificial transposons into plasmid targets in vitro: a useful tool for DNA mapping, sequencing and genetic analysis. , 1994, Nucleic acids research.

[20]  H. Blocker,et al.  The 'shortmer' approach to nucleic acid sequence analysis. I: Computer simulation of sequencing projects to find economical primer sets , 1994, Comput. Appl. Biosci..

[21]  A. Mirzabekov,et al.  DNA sequencing by hybridization--a megasequencing method and a diagnostic tool? , 1994, Trends in biotechnology.

[22]  A. Milosavljevic,et al.  Clone clustering by hybridization. , 1995, Genomics.

[23]  S Meier-Ewert,et al.  Fine-mapping of shotgun template-libraries; an efficient strategy for the systematic sequencing of genomic DNA. , 1995, Nucleic acids research.

[24]  A. Smit,et al.  Ancestral, mammalian-wide subfamilies of LINE-1 repetitive sequences. , 1995, Journal of molecular biology.

[25]  R. Drmanac,et al.  Gene-representing cDNA clusters defined by hybridization of 57,419 clones from infant brain libraries with short oligonucleotide probes. , 1996, Genomics.

[26]  P. Little Genome analysis , 1996 .

[27]  A. Milosavljevic,et al.  Discovering distinct genes represented in 29,570 clones from infant brain cDNA libraries by applying sequencing by hybridization methodology. , 1996, Genome research.

[28]  A. A. Chernyi,et al.  Efficiency of sequencing by hybridization on oligonucleotide matrix supplemented by measurement of the distance between DNA segments. , 1996, DNA sequence : the journal of DNA sequencing and mapping.

[29]  J. Weber,et al.  Human whole-genome shotgun sequencing. , 1997, Genome research.

[30]  Hans Lehrach,et al.  Automated array technologies for gene expression profiling , 1997 .

[31]  P. Green,et al.  Against a whole-genome shotgun. , 1997, Genome research.

[32]  R. Quatrano Genomics , 1998, Plant Cell.

[33]  R Herwig,et al.  Comparative gene expression profiling by oligonucleotide fingerprinting. , 1998, Nucleic acids research.

[34]  J. Badge DNA sequencing. , 1998, Methods in molecular biology.

[35]  R. Drmanac,et al.  Accurate sequencing by hybridization for DNA diagnostics and individual genomics , 1998, Nature Biotechnology.

[36]  M. Adams,et al.  Shotgun Sequencing of the Human Genome , 1998, Science.