Positional sequencing by hybridization

Sequencing by hybridization (SBH) is a promising alternative to the classical DNA sequencing approaches. However, the resolving power of SBH is rather low: with 64kb sequencing chips, unknown DNA fragments only as long as 200 bp can be reconstructed in a single SBH experiment. To improve the resolving power of SBH, positional SBH (PSBH) has recently been suggested; this allows (with additional experimental work) approximate positions of every l-tuple in a target DNA fragment to be measured. We study the positional Eulerian path problem motivated by PSBH. The input to the positional eulerian path problem is an Eulerian graph G(V, E) in which every edge has an associated range of integers and the problem is to find an Eulerian path e1,...,e/E/ in G such that the range of ei contains i. We show that the positional Eulerian path problem is NP-complete even when the maximum out-degree (in-degree) of any vertex in the graph is 2. On a positive note we present polynomial algorithms to solve a special case of PSBH (bounded PSBH), where the range of the allowed positions for any edge is bounded by a constant (it corresponds to accurate experimental measurements of positions in PSBH). Moreover, if the positions of every l-tuple in an unknown DNA fragment of length n are measured with O(log n) error, then our algorithm runs in polynomial time. We also present an estimate of the resolving power of PSBH for a more realistic case when positions are measured with theta (n) error.

[1]  C R Cantor,et al.  Enhanced DNA sequencing by hybridization. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Pavel A. Pevzner,et al.  Towards DNA Sequencing Chips , 1994, MFCS.

[3]  A D Mirzabekov,et al.  [DNA sequencing by hybridization with oligonucleotides immobilized in a gel. Chemical ligation as a method of expanding the prospects for the method]. , 1994, Molekuliarnaia biologiia.

[4]  Ján Plesník,et al.  The NP-Completeness of the Hamiltonian Cycle Problem in Planar Digraphs with Degree Bound Two , 1979, Inf. Process. Lett..

[5]  R. Drmanac,et al.  Sequencing of megabase plus DNA by hybridization: theory of the method. , 1989, Genomics.

[6]  F. Kramer,et al.  Sequencing of pools of nucleic acids on oligonucleotide arrays. , 1993, Bio Systems.

[7]  W. Bains,et al.  A novel method for nucleic acid sequence determination. , 1988, Journal of theoretical biology.

[8]  Steven Skiena,et al.  Reconstructing strings from substrings in rounds , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.

[9]  Frank Harary,et al.  Graph Theory , 2016 .

[10]  OLEG RAZGULYAEV,et al.  Sequencing Potential of Nested Strand Hybridization , 1995, J. Comput. Biol..

[11]  A. A. Chernyi,et al.  DNA sequencing by hybridization to oligonucleotide matrix. Calculation of continuous stacking hybridization efficiency. , 1994, Journal of biomolecular structure & dynamics.

[12]  P. Pevzner 1-Tuple DNA sequencing: computer analysis. , 1989, Journal of biomolecular structure & dynamics.

[13]  P. Pevzner,et al.  Improved chips for sequencing by hybridization. , 1991, Journal of biomolecular structure & dynamics.

[14]  Mikhail S. Gelfand,et al.  Reconstruction of a String from Substring Precedence Data , 1995, J. Comput. Biol..

[15]  K. Khrapko,et al.  An oligonucleotide hybridization approach to DNA sequencing , 1989, FEBS letters.

[16]  Fred Russell Kramer,et al.  Oligonucleotide Arrays: New Concepts and Possibilities , 1994, Bio/Technology.

[17]  Lysov YuP,et al.  A method for DNA sequencing by hybridization with oligonucleotide matrix. , 1991, DNA sequence : the journal of DNA sequencing and mapping.