Unlocking hidden genomic sequence.

Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not destroy original sequence information, but distributes it amongst multiple variants. Some of these variants lack problematic features of the target and are more amenable to conventional sequencing. The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs.

[1]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[2]  E. Ohtsuka,et al.  Studies on nucleic acid interactions. I. Stabilities of mini-duplexes (dG2A4XA4G2-dC2T4YT4C2) and self-complementary d(GGGAAXYTTCCC) containing deoxyinosine and other mismatched bases. , 1986, Nucleic acids research.

[3]  F. Seela,et al.  Improvement of the dideoxy chain termination method of DNA sequencing by use of deoxy-7-deazaguanosine triphosphate in place of dGTP. , 1986, Nucleic acids research.

[4]  C. Richardson,et al.  DNA sequence analysis with a modified bacteriophage T7 DNA polymerase. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[5]  L. McConlogue,et al.  Structure-independent DNA amplification by PCR using 7-deaza-2'-deoxyguanosine. , 1988, Nucleic acids research.

[6]  G. Sarkar,et al.  Specific amplification with PCR of a refractory segment of genomic DNA. , 1988, Nucleic acids research.

[7]  L. Loeb,et al.  Direct selection of mutations in the human mitochondrial tRNAThr gene: reversion of an 'uncloneable' phenotype. , 1988, Mutation research.

[8]  S. Leung,et al.  Point mutational analysis of the human c-fos serum response factor binding site. , 1989, Nucleic acids research.

[9]  R. Rachubinski,et al.  Incorporation of 7-deaza dGTP during the amplification step in the polymerase chain reaction procedure improves subsequent DNA sequencing. , 1990, DNA sequence : the journal of DNA sequencing and mapping.

[10]  G. Hong,et al.  Sequencing with the large fragment of DNA polymerase I from Bacillus stearothermophilus. , 1991, DNA sequence : the journal of DNA sequencing and mapping.

[11]  H. Manor,et al.  Formation of DNA triplexes accounts for arrests of DNA synthesis at d(TC)n and d(GA)n tracts. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[12]  D. Tautz,et al.  Slippage synthesis of simple sequence DNA. , 1992, Nucleic acids research.

[13]  P. Marynen,et al.  Incorporation of dITP or 7-deaza dGTP during PCR improves sequencing of the product. , 1993, Nucleic acids research.

[14]  R. Pless,et al.  Elimination of band compression in sequencing gels by the use of N4-methyl-2'-deoxycytidine 5'-triphosphate. , 1993, Nucleic acids research.

[15]  R. Eritja,et al.  Ionization of bromouracil and fluorouracil stimulates base mispairing frequencies with guanine. , 1993, The Journal of biological chemistry.

[16]  W. D. de Vos,et al.  Efficient random mutagenesis method with adjustable mutation frequency by use of PCR and dITP. , 1993, Nucleic acids research.

[17]  K Varadaraj,et al.  Denaturants or cosolvents improve the specificity of PCR amplification of a G + C-rich DNA using genetically engineered DNA polymerases. , 1994, Gene.

[18]  D. M. Brown,et al.  5-Nitroindole as an universal base analogue. , 1994, Nucleic acids research.

[19]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[20]  M. Donlin,et al.  Mutants affecting nucleotide recognition by T7 DNA polymerase. , 1994, Biochemistry.

[21]  D. Seto,et al.  DMSO resolves certain compressions and signal dropouts in fluorescent dye labeled primer-based DNA sequencing reactions. , 1995, DNA sequence : the journal of DNA sequencing and mapping.

[22]  A. Jackson,et al.  In vitro expansion of GGC:GCC repeats: identification of the preferred strand of expansion. , 1996, Nucleic acids research.

[23]  D. M. Brown,et al.  An approach to random mutagenesis of DNA using mixtures of triphosphate derivatives of nucleoside analogues. , 1996, Journal of molecular biology.

[24]  R. Kuick,et al.  A methylated human 9-kb repetitive sequence on acrocentric chromosomes is homologous to a subtelomeric repeat in chimpanzees. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[25]  P. Gillevet,et al.  Sequencing homopolymer tracts and repetitive elements. , 1996, BioTechniques.

[26]  D. Bergstrom,et al.  Comparison of the base pairing properties of a series of nitroazole nucleobase analogs in the oligodeoxyribonucleotide sequence 5'-d(CGCXAATTYGCG)-3'. , 1997, Nucleic acids research.

[27]  D. M. Brown,et al.  Comparative mutagenicities of N6-methoxy-2,6-diaminopurine and N6-methoxyaminopurine 2'-deoxyribonucleosides and their 5'-triphosphates. , 1998, Nucleic acids research.

[28]  D. M. Brown,et al.  A thymine-like base analogue forms wobble pairs with adenine in a Z-DNA duplex. , 1998, Journal of molecular biology.

[29]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[30]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[31]  B. Weinshenker,et al.  DNA compression caused by an upstream point mutation. , 1998, BioTechniques.

[32]  J. R. Fresco,et al.  Identification by UV resonance Raman spectroscopy of an imino tautomer of 5-hydroxy-2'-deoxycytidine, a powerful base analog transition mutagen with a much higher unfavored tautomer frequency than that of the natural residue 2'-deoxycytidine. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[33]  S. Pääbo,et al.  Improved cycle sequencing of GC-rich templates by a combination of nucleotide analogs. , 2000, BioTechniques.

[34]  H R Garner,et al.  Mix of sequencing technologies for sequence closure: an example. , 2000, BioTechniques.

[35]  F. Seela,et al.  The N(8)-(2'-deoxyribofuranoside) of 8-aza-7-deazaadenine: a universal nucleoside forming specific hydrogen bonds with the four canonical DNA constituents. , 2000, Nucleic acids research.

[36]  B. Roe,et al.  Regions of genomic instability on 22q11 and 11q23 as the etiology for the recurrent constitutional t(11;22). , 2000, Human molecular genetics.

[37]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[38]  D. Loakes,et al.  Survey and summary: The applications of universal DNA base analogues. , 2001, Nucleic acids research.

[39]  S V Razin,et al.  Non-clonability correlates with genomic instability: a case study of a unique DNA region. , 2001, Journal of molecular biology.

[40]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[41]  Matthias Platzer,et al.  Sequence and analysis of chromosome 2 of Dictyostelium discoideum , 2002, Nature.

[42]  C. Schutt,et al.  Novel sulfoxides facilitate GC-rich template amplification. , 2002, BioTechniques.

[43]  Peter Adams,et al.  A simulated annealing algorithm for finding consensus sequences , 2002, Bioinform..

[44]  Jonathan E. Allen,et al.  Genome sequence of the human malaria parasite Plasmodium falciparum , 2002, Nature.

[45]  D. Harris,et al.  Can you bank on GenBank , 2003 .

[46]  A. T. Vasconcelos,et al.  PCR-assisted contig extension: stepwise strategy for bacterial genome closure. , 2003, BioTechniques.

[47]  Peter Adams,et al.  Inferring an Original Sequence from Erroneous Copies: Two Approaches , 2003 .

[48]  Peter Adams,et al.  Inferring an Original Sequence from Erroneous Copies : A Bayesian Approach , 2003, APBC.