ODS_BOOTSTRAP: assessing the statistical reliability of physical maps by bootstrap resampling

In the program ODS_BOOTSTRAP we provide a methodology for quickly ordering clones in a genomic library into a physical map and for applying a statistical tool known as the bootstrap to assess the statistical reliability of a clonal ordering. Each clone is assigned a binary fingerprint by one of a variety of experimental approaches to physical mapping. For example, the binary fingerprints might be generated by hybridizing a panel of m probes to a library of n clones. The resulting n x m binary data matrix, X, is input to ODS_BOOTSTRAP, which utilizes the similarity in binary fingerprints of clones to construct a physical map. Under this particular implementation of bootstrap resampling, the m probes (or columns of the data matrix) are sampled randomly with replacement in the computer to generate a new n x m data matrix, X*, from which a second physical map is constructed. The resampling process is repeated 100 or more times to generate 100 or more X* matrices. The resulting 100 or more physical maps are compared with the original physical map based on the original data matrix X by counting how often links in the original physical map reappear. Three confidence statistics are introduced for each link in a physical map. The statistic C1 is defined as the percentage of time two neighboring clones on the original map reappear as neighbors under resampling. The statistic C2 is defined as the percentage of time that two neighboring clones i and j on the original map reappear as neighbors or that a clone with an identical binary fingerprint to clone i reappears as a neighbor to clone j. The statistic C3 is defined as the percentage of time that two neighboring clones on the original map reappear in the same contig under resampling.

[1]  A. Zharkikh,et al.  Statistical properties of bootstrap estimation of phylogenetic variability from nucleotide sequences. I. Four taxa with a molecular clock. , 1992, Molecular biology and evolution.

[2]  V Sgaramella,et al.  An ordered collection of Bacillus subtilis DNA segments cloned in yeast artificial chromosomes. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[3]  S T Cole,et al.  Use of an ordered cosmid library to deduce the genomic organization of Mycobacterium leprae , 1993, Molecular microbiology.

[4]  Hans Lehrach,et al.  High resolution cosmid and P1 maps spanning the 14 Mb genome of the fission yeast S. pombe , 1993, Cell.

[5]  L. Hood,et al.  A common language for physical mapping of the human genome. , 1989, Science.

[6]  A. Cuticchia,et al.  Chromosome-specific recombinant DNA libraries from the fungus Aspergillus nidulans. , 1991, Nucleic acids research.

[7]  A. Cuticchia,et al.  The use of simulated annealing in chromosome reconstruction experiments based on binary scoring. , 1992, Genetics.

[8]  Carol Soderlund,et al.  GRAM and genfragII: solving and testing the single-digest, partially ordered restriction map problem , 1994, Comput. Appl. Biosci..

[9]  Joseph Felsenstein,et al.  Is there something wrong with the bootstrap on phylogenies? A reply to Hillis and Bull , 1993 .

[10]  J. Felsenstein CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP , 1985, Evolution; international journal of organic evolution.

[11]  S. Hedges The number of replications needed for accurate estimation of the bootstrap P value in phylogenetic studies. , 1992, Molecular biology and evolution.

[12]  A. Coulson,et al.  Toward a physical map of the genome of the nematode Caenorhabditis elegans. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[13]  D. Page,et al.  The human Y chromosome: overlapping DNA clones spanning the euchromatic region. , 1992, Science.

[14]  S. Brenner,et al.  Characterization of the pufferfish (Fugu) genome as a compact model vertebrate genome , 1993, Nature.

[15]  E. Lander,et al.  Genomic mapping by anchoring random clones: a mathematical analysis. , 1991, Genomics.

[16]  Michael J Sanderson,et al.  CONFIDENCE LIMITS ON PHYLOGENIES: THE BOOTSTRAP REVISITED , 1989, Cladistics : the international journal of the Willi Hennig Society.

[17]  G. Wahl,et al.  Cosmid vectors for rapid genomic walking, restriction mapping, and gene transfer. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[18]  M. Olson,et al.  Chromosomal region of the cystic fibrosis gene in yeast artificial chromosomes: a model for human genome mapping. , 1990, Science.

[19]  J. Griffith,et al.  On the consistency of a physical mapping method to reconstruct a chromosome in vitro. , 1996, Genetics.

[20]  Y. Fu,et al.  On the design of genome mapping experiments using short synthetic oligonucleotides. , 1992, Biometrics.

[21]  J Griffith,et al.  A fast random cost algorithm for physical mapping. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[22]  K. Isono,et al.  The physical map of the whole E. coli chromosome: Application of a new strategy for rapid analysis and sorting of a large genomic library , 1987, Cell.

[23]  D. Hartl,et al.  Optimized strategies for sequence-tagged-site selection in genome mapping. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[24]  W. Press,et al.  Numerical Recipes: The Art of Scientific Computing , 1987 .

[25]  M. Ashburner,et al.  Toward cloning and mapping the genome of Drosophila. , 1991, Science.

[26]  B. Dujon,et al.  The complete DNA sequence of yeast chromosome III , 1992, Nature.

[27]  M. Waterman,et al.  The accuracy of DNA sequences: estimating sequence quality. , 1992, Genomics.

[28]  Jonathan Arnold,et al.  ODS: ordering DNA sequences - a physical mapping algorithm based on simulated annealing , 1993, Comput. Appl. Biosci..

[29]  Peisen Zhang,et al.  An algorithm based on graph theory for the assembly of contigs in physical mapping of DNA , 1994, Comput. Appl. Biosci..

[30]  J. Weissenbach,et al.  A first-generation physical map of the human genome , 1993, Nature.

[31]  T. Marr,et al.  A 13 kb resolution cosmid map of the 14 Mb fission yeast genome by nonrandom sequence-tagged site mapping , 1993, Cell.

[32]  T G Marr,et al.  Genome mapping by nonrandom anchoring: a discrete theoretical analysis. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[33]  J. Bull,et al.  An Empirical Test of Bootstrapping as a Method for Assessing Confidence in Phylogenetic Analysis , 1993 .

[34]  Stephanie Forrest,et al.  Genetic Algorithms for DNA Sequence Assembly , 1993, ISMB.