Generating a Genome Assembly with PCAP

This unit describes how to use the Parallel Contig Assembly Program (PCAP) to assemble the data produced by a whole‐genome shotgun sequencing project. We present a basic protocol for using PCAP on a multiprocessor computer in a 300‐Mb genome assembly project. A support protocol to prepare input files for PCAP is also described. Another basic protocol for using PCAP on a distributed cluster of computers in a 3‐Gb genome assembly project is presented, in addition to suggestions for understanding results from PCAP.

[1]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[2]  J. Mullikin,et al.  The phusion assembler. , 2003, Genome research.

[3]  G. Weinstock,et al.  The Atlas genome assembly system. , 2004, Genome research.

[4]  Eugene W. Myers,et al.  A whole-genome assembly of Drosophila. , 2000, Science.

[5]  J. Kruskal On the shortest spanning subtree of a graph and the traveling salesman problem , 1956 .

[6]  S. B. Needleman,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 1989 .

[7]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[8]  Paramvir S. Dehal,et al.  Whole-Genome Shotgun Assembly and Analysis of the Genome of Fugu rubripes , 2002, Science.

[9]  E. Mauceli,et al.  Whole-genome sequence assembly for mammalian genomes: Arachne 2. , 2003, Genome research.

[10]  X. Huang,et al.  CAP3: A DNA sequence assembly program. , 1999, Genome research.

[11]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[12]  L. Hillier,et al.  PCAP: a whole-genome assembly program. , 2003, Genome research.

[13]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.