Automated detection of point mutations using fluorescent sequence trace subtraction.

The final step in the detection of mutations is to determine the sequence of the suspected mutant and to compare it with that of the wild-type, and for this fluorescence-based sequencing instruments are widely used. We describe some simple algorithms forcomparing sequence traces which, as part of our sequence assembly and analysis package, are proving useful for the discovery of mutations and which may also help to identify misplaced readings in sequence assembly projects. The mutations can be detected automatically by a new program called TRACE_DIFF and new types of trace display in our program GAP4 greatly simplify visual checking of the assigned changes. To assess the accuracy of the automatic mutation detection algorithm we analysed 214 sequence readings from hypermutating DNA comprising a total of 108 497 bases. After the readings were assembled there were 1232 base differences, including 392 Ns and 166 alignment characters. Visual inspection of the traces established that of the 1232 differences, 353 were real mutations while the rest were due to base calling errors. The TRACE_DIFF algorithm automatically identified all but 36, with 28 false positives. Further information about the software can be obtained from http://www.mrc-lmb.cam.ac.uk/pubseq/

[1]  E. Rappaport,et al.  Systematic use of automated fluorescence‐based sequence analysis of amplified genomic DNA for rapid detection of point mutations , 1994, American journal of hematology.

[2]  M. Grompe,et al.  The rapid detection of unknown mutations in nucleic acids , 1993, Nature genetics.

[3]  R. Carrell,et al.  Hydrolink gels: a rapid and simple approach to the detection of DNA mutations in thromboembolic disease. , 1992, Journal of clinical pathology.

[4]  S. Braun,et al.  Molecular diagnosis of mucopolysaccharidosis type II (Hunter syndrome) by automated sequencing and computer-assisted interpretation: toward mutation mapping of the iduronate-2-sulfatase gene. , 1995, American journal of human genetics.

[5]  R. Quatrano Genomics , 1998, Plant Cell.

[6]  T. Sekiya,et al.  Detection of polymorphisms of human DNA by gel electrophoresis as single-strand conformation polymorphisms. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[7]  R Staden,et al.  The application of numerical estimates of base calling accuracy to DNA sequencing projects. , 1995, Nucleic acids research.

[8]  R. Myers,et al.  Detection and localization of single base changes by denaturing gradient gel electrophoresis. , 1987, Methods in enzymology.

[9]  C. Milstein,et al.  Somatic hypermutation of immunoglobulin kappa may depend on sequences 3′ of C kappa and occurs on passenger transgenes. , 1991, The EMBO journal.

[10]  C. Milstein,et al.  Analysis of somatic hypermutation in mouse Peyer's patches using immunoglobulin kappa light-chain transgenes. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[11]  D. Nickerson,et al.  PolyPhred: automating the detection and genotyping of single nucleotide substitutions using fluorescence-based resequencing. , 1997, Nucleic acids research.

[12]  S. O’Brien,et al.  Detecting single base substitutions as heteroduplex polymorphisms. , 1992, Genomics.

[13]  R. D. Campbell,et al.  Reactivity of cytosine and thymine in single-base-pair mismatches with hydroxylamine and osmium tetroxide and its application to the study of mutations. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[14]  C. Milstein,et al.  The 5′ hypermutation boundary of x chains is independent of local and neighbouring sequences and related to the distance from the initiation of transcription , 1997, European journal of immunology.

[15]  D. Nickerson,et al.  Comparative analysis of human DNA variations by fluorescence-based sequencing of PCR products. , 1994, Genomics.

[16]  J. Bonfield,et al.  Experiment files and their application during large-scale sequencing projects. , 1996, DNA sequence : the journal of DNA sequencing and mapping.

[17]  R. Staden,et al.  A standard file format for data from DNA sequencing instruments. , 1992, DNA sequence : the journal of DNA sequencing and mapping.

[18]  J. Bonfield,et al.  A new DNA sequence assembly program. , 1995, Nucleic acids research.