Approximation algorithms of non‐unique probes selection for biological target identification

Non‐unique probes are used to identify the targets, i.e., viruses, present in a given sample. Since the number of selected non‐unique probes is equal to the number of hybridization experiments, it is important to find a minimum set of non‐unique probes, which is NP‐complete. Using d‐disjunct matrix, we present two (1+(d+1)logn)‐approximation algorithms to identify at most d targets. Based on our selected non‐unique probes, we also present the decoding algorithms with linear time complexity. In addition, our solutions are fault tolerant. The proposed algorithms can identify at most d targets in the presence of experimental errors.

[1]  Vijay V. Vazirani,et al.  Approximation Algorithms , 2001, Springer Berlin Heidelberg.

[2]  Sven Rahmann Rapid Large-Scale Oligonucleotide Selection for Microarrays , 2002, WABI.

[3]  Sven Rahmann Fast and sensitive probe selection for DNA chips using jumps in matching statistics , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[4]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[5]  Alexander Schliep,et al.  Group testing with DNA chips: generating designs and decoding experiments , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.