A Tool for Analyzing Mate Pairs in Assemblies (TAMPA)

The current generation of genome assembly programs uses distance and orientation relationships of paired end reads of clones (mate pairs) to order and orient contigs. Mate pair data can also be used to evaluate and compare assemblies after the fact. Earlier work employed a simple heuristic to detect assembly problems by scanning across an assembly to locate peak concentrations of unsatisfied mate pairs. TAMPA is a novel, computational geometry-based approach to detecting assembly breakpoints by exploiting constraints that mate pairs impose on each other. The method can be used to improve assemblies and determine which of two assemblies is correct in the case of sequence disagreement. Results from several human genome assemblies are presented.

[1]  Eugene W. Myers,et al.  A whole-genome assembly of Drosophila. , 2000, Science.

[2]  Eugene W. Myers,et al.  Comparing Assemblies Using Fragments and Mate-Pairs , 2001, WABI.

[3]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[4]  J. Mullikin,et al.  The phusion assembler. , 2003, Genome research.

[5]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.

[6]  Haixu Tang,et al.  Fragment assembly with double-barreled data , 2001, ISMB.

[7]  Rocco Rongo,et al.  A parallel cellular tool for interactive modeling and simulation , 1996 .

[8]  G. Rubin,et al.  A computer program for aligning a cDNA sequence with a genomic DNA sequence. , 1998, Genome research.

[9]  E. Mauceli,et al.  Whole-genome sequence assembly for mammalian genomes: Arachne 2. , 2003, Genome research.

[10]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[11]  Eugene W. Myers,et al.  Whole-genome DNA sequencing , 1999, Comput. Sci. Eng..

[12]  B. Berger,et al.  ARACHNE: a whole-genome shotgun assembler. , 2002, Genome research.

[13]  Godfried T. Toussaint,et al.  A simple linear algorithm for intersecting convex polygons , 1985, The Visual Computer.

[14]  Randall A. Bolanos,et al.  Whole-genome shotgun assembly and comparison of human genome assemblies , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[15]  G. Sutton,et al.  Gene and alternative splicing annotation with AIR. , 2005, Genome research.

[16]  William H. Majoros,et al.  A Comparison of Whole-Genome Shotgun-Derived Mouse Chromosome 16 and the Human Genome , 2002, Science.

[17]  Jian Wang,et al.  The Genome Sequence of the Malaria Mosquito Anopheles gambiae , 2002, Science.

[18]  Eugene W. Myers,et al.  Design of a compartmentalized shotgun assembler for the human genome , 2001, ISMB.

[19]  David L. Wheeler,et al.  GenBank: update , 2004, Nucleic Acids Res..

[20]  E. Myers,et al.  Finishing a whole-genome shotgun: Release 3 of the Drosophila melanogaster euchromatic genome sequence , 2002, Genome Biology.