An algorithm has been developed for the determination of nucleotide sequence from data produced in fluorescence-based automated DNA sequencing instruments employing the four-color strategy. This algorithm takes advantage of object oriented programming techniques for modularity and extensibility. The algorithm is adaptive in that data sets from a wide variety of instruments and sequencing conditions can be used with good results. Confidence values are provided on the base calls as an estimate of accuracy. The algorithm iteratively employs confidence determinations from several different modules, each of which examines a different feature of the data for accurate peak identification. Modules within this system can be added or removed for increased performance or for application to a different task. In comparisons with commercial software, the algorithm performed well.
[1]
T. S. West.
Analytical Chemistry
,
1969,
Nature.
[2]
NeXT Computer,et al.
Nextstep object‐oriented programming and the objective C language : 日本語版
,
1993
.
[3]
T. Creighton.
Methods in Enzymology
,
1968,
The Yale Journal of Biology and Medicine.
[4]
J. E. Glynn,et al.
Numerical Recipes: The Art of Scientific Computing
,
1989
.
[5]
Nils J. Nilsson,et al.
Artificial Intelligence
,
1974,
IFIP Congress.