A Quantile Method for Sizing Optical Maps

Optical mapping is an integrated system for the analysis of single DNA molecules. It constructs restriction maps (noted as "optical map" ) from individual DNA molecules presented on surfaces after they are imaged by fluorescence microscopy. Because restriction digestion and fluorochrome staining are performed after molecules are mounted, resulting restriction fragments retain their order. Maps of fragment sizes and order are constructed by image processing techniques employing integrated fluorescence intensity measurements. Such analysis, in place of molecular length measurements, obviates need for uniformly elongated molecules, but requires samples containing small fluorescent reference molecules for accurate sizing. Although robust in practice, elimination of internal reference molecules would reduce errors and extend single molecule analysis to other platforms. In this paper, we introduce a new approach that does not use reference molecules for direct estimation of restriction fragment sizes, by the exploitation of the quantiles associated with their expected distribution. We show that this approach is comparable to the current reference-based method as evaluated by map alignment techniques in terms of the rate of placement of optical maps to published sequence.

[1]  Craig A. Stewart,et al.  Introduction to computational biology , 2005 .

[2]  O. White,et al.  Whole-genome shotgun optical mapping of Deinococcus radiodurans. , 1999, Science.

[3]  Yi Yang,et al.  Alignment of Optical Maps , 2005, RECOMB.

[4]  Juan J de Pablo,et al.  A microfluidic system for large DNA molecule arrays. , 2004, Analytical chemistry.

[5]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[6]  Werner A. Stahel,et al.  Robust Statistics: The Approach Based on Influence Functions , 1987 .

[7]  R. Redner,et al.  Mixture densities, maximum likelihood, and the EM algorithm , 1984 .

[8]  Michael S. Waterman,et al.  Algorithms for restriction map comparisons , 1984, Nucleic Acids Res..

[9]  Nicholas H. Putnam,et al.  The Genome of the Diatom Thalassiosira Pseudonana: Ecology, Evolution, and Metabolism , 2004, Science.

[10]  M S Waterman,et al.  The distribution of restriction enzyme sites in Escherichia coli. , 1990, Nucleic acids research.

[11]  G. Grimmett,et al.  Probability and random processes , 2002 .

[12]  A. Cohen Truncated and Censored Samples: Theory and Applications , 1991 .

[13]  B. Ripley,et al.  Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.

[14]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[15]  David C. Schwartz,et al.  Whole-Genome Shotgun Optical Mapping of Rhodospirillum rubrum , 2004, Applied and Environmental Microbiology.

[16]  John Law,et al.  Robust Statistics—The Approach Based on Influence Functions , 1986 .