Accurate method for fast design of diagnostic oligonucleotide probe sets for DNA microarrays

We present a method for the automatic generation of oligonucleotide probe sets for DNA microarrays. This approach is well suited particularly for specificity evaluation of designed probes in large data sets. Algorithms for probe preselection, hybridization prediction and probe selection are presented. Combinatorial techniques are introduced for the selection of probe sets of high differentiation capability even from sequence databases of homologous conserved genes. These techniques include the automatic generation of group specific probes and the design of excluding probes. A basic prototype was implemented including a shared memory parallelization and distribution. The principal applicability of our method to a database of very conserved sequence data was shown and the run-time performance estimated.

[1]  F. Brockman,et al.  Direct Detection of 16S rRNA in Soil Extracts by Using Oligonucleotide Microarrays , 2001, Applied and Environmental Microbiology.

[2]  M. Eisen,et al.  Gene expression informatics —it's all in your mine , 1999, Nature Genetics.

[3]  Kevin Truong,et al.  Identification and characterization of subfamily-specific signatures in a large protein superfamily by a hidden Markov model approach , 2002, BMC Bioinformatics.

[4]  Antoine de Saizieu,et al.  Bacterial transcript imaging by hybridization of total RNA to oligonucleotide arrays , 1998, Nature Biotechnology.

[5]  Diethard Tautz,et al.  An algorithm and program for finding sequence specific oligo-nucleotide probes for species identification , 2002, BMC Bioinformatics.

[6]  G. L. French,et al.  Rapid Diagnosis of Bacteremia by Universal Amplification of 23S Ribosomal DNA Followed by Hybridization to an Oligonucleotide Array , 2000, Journal of Clinical Microbiology.

[7]  Michael Wagner,et al.  probeBase: an online resource for rRNA-targeted oligonucleotide probes , 2003, Nucleic Acids Res..

[8]  Kai Hwang,et al.  Advanced computer architecture - parallelism, scalability, programmability , 1992 .

[9]  K. Schleifer,et al.  ARB: a software environment for sequence data. , 2004, Nucleic acids research.

[10]  K. Schleifer,et al.  Oligonucleotide Microarray for 16S rRNA Gene-Based Detection of All Recognized Lineages of Sulfate-Reducing Prokaryotes in the Environment , 2002, Applied and Environmental Microbiology.

[11]  J. Fry,et al.  PRIMROSE: a computer program for generating and estimating the phylogenetic range of 16S rRNA oligonucleotide probes and primers in conjunction with the RDP-II database. , 2002, Nucleic acids research.

[12]  Jean-Marie Rouillard,et al.  OligoArray: genome-scale oligonucleotide design for microarrays , 2002, Bioinform..

[13]  Daniel B. Oerther,et al.  The oligonucleotide probe database , 1996, Applied and environmental microbiology.

[14]  Alexander Schliep,et al.  Selecting signature oligonucleotides to identify organisms using DNA arrays , 2002, Bioinform..

[15]  J. SantaLucia,et al.  A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Robert Giegerich,et al.  From Ukkonen to McCreight and Weiner: A Unifying View of Linear-Time Suffix Tree Construction , 1997, Algorithmica.

[17]  Sven Rahmann,et al.  Rapid large-scale oligonucleotide selection for microarrays , 2002, Proceedings. IEEE Computer Society Bioinformatics Conference.

[18]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[19]  D. Bowtell,et al.  Options available — from start to finish — for obtaining expression data by microarray , 1999, Nature Genetics.

[20]  Michael Zuker,et al.  Algorithms and Thermodynamics for RNA Secondary Structure Prediction: A Practical Guide , 1999 .

[21]  Marek Chrobak,et al.  Probe selection algorithms with applications in the analysis of microbial communities , 2001, ISMB.