A case study in genome-level fragment assembly

MOTIVATION We use the fact of two teams independently sequencing the one megabase genome of Borrelia burgdorferi as an opportunity to study the accuracy of genome-level assembly. RESULTS We compare the results of three different assembly programs (PHRAP, TIGR Assembler, and STROLL) on the DNA fragments used in both the Brookhaven and TIGR sequencing projects. We also describe the algorithms and data structures used in our assembly program STROLL, which was used in the Brookhaven Borrelia project.

[1]  F. Studier,et al.  Ligation of hexamers on hexamer templates to produce primers for cycle sequencing or the polymerase chain reaction. , 1995, Analytical biochemistry.

[2]  Eugene W. Myers,et al.  ReAligner: A Program for Refining DNA Sequence Multi-Alignments , 1997, J. Comput. Biol..

[3]  X. Huang,et al.  An improved sequence assembly program. , 1996, Genomics.

[4]  R. Staden,et al.  A sequence assembly and editing program for efficient management of large projects. , 1991, Nucleic acids research.

[5]  F. Studier,et al.  DNA sequencing by primer walking with strings of contiguous hexamers. , 1992, Science.

[6]  Eugene W. Myers,et al.  Suffix arrays: a new method for on-line string searches , 1993, SODA '90.

[7]  Owen White,et al.  TIGR Assembler: A New Tool for Assembling Large Shotgun Sequencing Projects , 1995 .

[8]  S. Salzberg,et al.  Genomic sequence of a Lyme disease spirochaete, Borrelia burgdorferi , 1997, Nature.

[9]  Eugene W. Myers,et al.  An Interface for a Fragment Assembly Kernel , 1996 .

[10]  Eugene W. Myers,et al.  ReAligner: a program for refining DNA sequence multi-alignments , 1997, RECOMB '97.

[11]  Steven Skiena,et al.  Trie-Based Data Structures for Sequence Assembly , 1997, CPM.

[12]  F. Studier,et al.  A strategy for high-volume sequencing of cosmid DNAs: random and directed priming with a library of oligonucleotides. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Michael S. Waterman,et al.  Introduction to computational biology , 1995 .