Sequence-based physical mapping of complex genomes by whole genome profiling.

We present whole genome profiling (WGP), a novel next-generation sequencing-based physical mapping technology for construction of bacterial artificial chromosome (BAC) contigs of complex genomes, using Arabidopsis thaliana as an example. WGP leverages short read sequences derived from restriction fragments of two-dimensionally pooled BAC clones to generate sequence tags. These sequence tags are assigned to individual BAC clones, followed by assembly of BAC contigs based on shared regions containing identical sequence tags. Following in silico analysis of WGP sequence tags and simulation of a map of Arabidopsis chromosome 4 and maize, a WGP map of Arabidopsis thaliana ecotype Columbia was constructed de novo using a six-genome equivalent BAC library. Validation of the WGP map using the Columbia reference sequence confirmed that 350 BAC contigs (98%) were assembled correctly, spanning 97% of the 102-Mb calculated genome coverage. We demonstrate that WGP maps can also be generated for more complex plant genomes and will serve as excellent scaffolds to anchor genetic linkage maps and integrate whole genome sequence data.

[1]  J. Lupski,et al.  The complete genome of an individual by massively parallel DNA sequencing , 2008, Nature.

[2]  B. Birren,et al.  Cloning and stable maintenance of 300-kilobase-pair fragments of human DNA in Escherichia coli using an F-factor-based vector. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[3]  J. Bakker,et al.  Genetic and physical mapping of homologues of the virus resistance gene Rx1 and the cyst nematode resistance gene Gpa2 in potato , 2003, Theoretical and Applied Genetics.

[4]  J E Mullet,et al.  A high-throughput AFLP-based method for constructing integrated genetic and physical maps: progress toward a sorghum genome map. , 2000, Genome research.

[5]  P. Nimmakayala,et al.  Construction and characterization of a soybean bacterial artificial chromosome library and use of multiple complementary libraries for genome physical mapping , 2004, Theoretical and Applied Genetics.

[6]  H. Malke Genetic and Physical Mapping. : Kay E. Davies and Shirley M. Tilghman (Editors). Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1990, ISBN 0-87969-358-4, viii + 189 pp., $40. , 1993 .

[7]  Aleksandar Milosavljevic,et al.  Pooled genomic indexing of rhesus macaque. , 2005, Genome research.

[8]  Carolyn Thomas,et al.  High-throughput fingerprinting of bacterial artificial chromosomes using the snapshot labeling kit and sizing of restriction fragments by capillary electrophoresis. , 2003, Genomics.

[9]  Gabor T. Marth,et al.  Whole-genome sequencing and variant discovery in C. elegans , 2008, Nature Methods.

[10]  James R. Knight,et al.  De Novo Next Generation Sequencing of Plant Genomes , 2009, Rice.

[11]  D. Baulcombe,et al.  Homologues of a single resistance-gene cluster in potato confer resistance to distinct pathogens: a virus and a nematode. , 2000, The Plant journal : for cell and molecular biology.

[12]  Peter A. Meric,et al.  Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse , 2009, PLoS biology.

[13]  Steven G. Schroeder,et al.  Physical and Genetic Structure of the Maize Genome Reflects Its Complex Evolutionary History , 2007, PLoS genetics.

[14]  Takuji Sasaki,et al.  The map-based sequence of the rice genome , 2005, Nature.

[15]  Carol Soderlund,et al.  FPC: a system for building contigs from restriction fingerprinted clones , 1997, Comput. Appl. Biosci..

[16]  Galina Fuks,et al.  Whole-Genome Validation of High-Information-Content Fingerprinting1 , 2005, Plant Physiology.

[17]  Niall J. Haslam,et al.  An analysis of the feasibility of short read sequencing , 2005, Nucleic acids research.

[18]  D. Bentley,et al.  Genome mapping by fluorescent fingerprinting. , 1997, Genome research.

[19]  David C. Schwartz,et al.  A Single Molecule Scaffold for the Maize Genome , 2009, PLoS genetics.

[20]  R. Gibbs,et al.  A clone-array pooled shotgun strategy for sequencing large genomes. , 2001, Genome research.

[21]  K. Chin,et al.  End-sequence profiling: Sequence-based analysis of aberrant genomes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[22]  D. Schwartz,et al.  Ordered restriction maps of Saccharomyces cerevisiae chromosomes constructed by optical mapping. , 1993, Science.

[23]  T. Borm Construction and use of a physical map of potato , 2008 .

[24]  G. Gort,et al.  Significance Tests and Weighted Values for AFLP Similarities, Based on Arabidopsis in Silico AFLP Fragment Length Distributions , 2004, Genetics.

[25]  Carol Soderlund,et al.  Integrating sequence with FPC fingerprint maps , 2009, Nucleic acids research.

[26]  P. Vos,et al.  AFLP: a new technique for DNA fingerprinting. , 1995, Nucleic acids research.

[27]  R. Wilson,et al.  High throughput fingerprint analysis of large-insert clones. , 1997, Genome research.

[28]  P. Etter,et al.  Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers , 2008, PloS one.

[29]  W. Sinz,et al.  An integrated physical and genetic map of the nematode Pristionchus pacificus , 2003, Molecular Genetics and Genomics.

[30]  Mark J. P. Chaisson,et al.  De novo fragment assembly with short mate-paired reads: Does the read length matter? , 2009, Genome research.

[31]  Stephen J O'Brien,et al.  Every genome sequence needs a good map. , 2009, Genome research.

[32]  D. Marshall,et al.  Computational and experimental characterization of physically clustered simple sequence repeats in plants. , 2000, Genetics.

[33]  Joshua M. Korn,et al.  Mapping and sequencing of structural variation from eight human genomes , 2008, Nature.

[34]  J. Sambrook,et al.  Molecular Cloning: A Laboratory Manual , 2001 .