DNA codes consisting of DNA sequences are necessary for DNA computing. The minimum distance parameter of such codes is a measure of how dissimilar the codewords are, and thus is indirectly a measure of the likelihood of undetectedable or uncorrectable errors occurring during hybridization. To compute distance, an abstract metric, for example, longest common subsequence, must be used to model the actual bonding energies of DNA strands. In this paper we continue the development [1,2,3] of similarity functions for q-ary n-sequences The theoretical lower bound on the maximal possible size of codes, built on the space endowed with this metric, is obtained. that can be used (for q = 4) to model a thermodynamic similarity on DNA sequences. We introduce the concept of a stem similarity function and discuss DNA codes [2] based on the stem similarity. We suggest an optimal construction [2] and obtain random coding bounds on the maximum size and rate for such codes.
[1]
Arkadii G. D'yachkov,et al.
A Weighted Insertion-Deletion Stacked Pair Thermodynamic Metric for DNA Codes
,
2004,
DNA.
[2]
Péter L. Erdös,et al.
Exordium for DNA Codes
,
2003,
J. Comb. Optim..
[3]
Arkadii G. D'yachkov,et al.
On DNA Codes
,
2005,
Probl. Inf. Transm..
[4]
Vladimir I. Levenshtein,et al.
Efficient Reconstruction of Sequences from Their Subsequences or Supersequences
,
2001,
J. Comb. Theory A.
[5]
Vladimir I. Levenshtein,et al.
Binary codes capable of correcting deletions, insertions, and reversals
,
1965
.