Sequencing by Hybridization with the Generic 6-mer Oligonucleotide Microarray: An Advanced Scheme for Data Processing

Abstract DNA sequencing by hybridization was carried out with a microarray of all 46 = 4,096 hexadeoxyribonucleotides (the generic microchip). The oligonucleotides immobilized in 100 × 100 × 20µm polyacrylamide gel pads of the generic microchip were hybridized with fluorescently labeled ssDNA, providing perfect and mismatched duplexes. Melting curves were measured in parallel for all microchip duplexes with a fluorescence microscope equipped with CCD camera. This allowed us to discriminate the perfect duplexes formed by the oligonucleotides, which are complementary to the target DNA. The DNA sequence was reconstructed by overlapping the complementary oligonucleotide probes. We developed a data processing scheme to heighten the discrimination of perfect duplexes from mismatched ones. The procedure was united with a reconstruction of the DNA sequence. The scheme includes the proper definition of a discriminant signal, preprocessing, and the variational principle for the sequence indicator function. The effectiveness of the procedure was confirmed by sequencing, proofreading, and nucleotide polymorphism (mutation) analysis of 13 DNA fragments from 31 to 70 nucleotides long.

[1]  R. Drmanac,et al.  An algorithm for the DNA sequence generation from k-tuple word contents of the minimal number of random fragments. , 1991, Journal of biomolecular structure & dynamics.

[2]  D. Sen,et al.  Guanine quartet structures. , 1992, Methods in enzymology.

[3]  Eugene L. Lawler,et al.  Traveling Salesman Problem , 2016 .

[4]  Lysov YuP,et al.  A method for DNA sequencing by hybridization with oligonucleotide matrix. , 1991, DNA sequence : the journal of DNA sequencing and mapping.

[5]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[6]  John Eccleston,et al.  Statistics and Computing , 2006 .

[7]  A. Mirzabekov,et al.  Massive parallel analysis of DNA-Hoechst 33258 binding specificity with a generic oligodeoxyribonucleotide microchip. , 1999, Nucleic acids research.

[8]  J. R. Williamson,et al.  G-quartet structures in telomeric DNA. , 1994, Annual review of biophysics and biomolecular structure.

[9]  M. D. Morris,et al.  Optical Melting of 128 Octamer DNA Duplexes , 1995, The Journal of Biological Chemistry.

[10]  K. Khrapko,et al.  An oligonucleotide hybridization approach to DNA sequencing , 1989, FEBS letters.

[11]  W. Bains,et al.  A novel method for nucleic acid sequence determination. , 1988, Journal of theoretical biology.

[12]  Shyang Chang,et al.  A shortest path algorithm for a nonrotating object among obstacles of arbitrary shapes , 1993, IEEE Trans. Syst. Man Cybern..

[13]  William H. Press,et al.  Numerical recipes , 1990 .

[14]  Anil K. Jain,et al.  Neural networks and pattern recognition , 1994 .

[15]  A. Mirzabekov,et al.  Chemical methods of DNA and RNA fluorescent labeling. , 1996, Nucleic acids research.

[16]  Darrell Whitley,et al.  A genetic algorithm tutorial , 1994, Statistics and Computing.

[17]  D. Stern,et al.  Thermodynamics of Duplex Formation and Mismatch Discrimination on Photolithographically Synthesized Oligonucleotide Arrays , 1997 .

[18]  A. A. Chernyi,et al.  DNA sequencing by hybridization to oligonucleotide matrix. Calculation of continuous stacking hybridization efficiency. , 1994, Journal of biomolecular structure & dynamics.

[19]  A. Mirzabekov,et al.  DNA analysis and diagnostics on oligonucleotide microchips. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[20]  John W. Sammon,et al.  An Optimal Set of Discriminant Vectors , 1975, IEEE Transactions on Computers.

[21]  A. A. Chernyi,et al.  Efficiency of sequencing by hybridization on oligonucleotide matrix supplemented by measurement of the distance between DNA segments. , 1996, DNA sequence : the journal of DNA sequencing and mapping.

[22]  R. Drmanac,et al.  Sequencing of megabase plus DNA by hybridization: theory of the method. , 1989, Genomics.

[23]  P. Pevzner 1-Tuple DNA sequencing: computer analysis. , 1989, Journal of biomolecular structure & dynamics.

[24]  A. Mirzabekov,et al.  Manual manufacturing of oligonucleotide, DNA, and protein microchips. , 1997, Analytical biochemistry.

[25]  N L Harris,et al.  Splice junctions, branch point sites, and exons: sequence statistics, identification, and applications to genome project. , 1990, Methods in enzymology.

[26]  R. Lipshutz,et al.  Likelihood DNA sequencing by hybridization. , 1993, Journal of biomolecular structure & dynamics.

[27]  E. Southern,et al.  Analyzing and comparing nucleic acid sequences by hybridization to arrays of oligonucleotides: evaluation using experimental models. , 1992, Genomics.

[28]  Omid Omidvar,et al.  Neural Networks and Pattern Recognition , 1997 .

[29]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[30]  K. Gunderson,et al.  Mutation detection by ligation to complete n-mer DNA arrays. , 1998, Genome research.

[31]  A. Mirzabekov,et al.  Parallel thermodynamic analysis of duplexes on oligodeoxyribonucleotide microchips. , 1998, Nucleic acids research.

[32]  N. L. Johnson,et al.  Linear Statistical Inference and Its Applications , 1966 .

[33]  Jürgen Schürmann,et al.  Pattern classification , 2008 .