The syntactic monoid of hairpin-free languages

The study of hairpin-free words has been initiated in the context of DNA computing. DNA strands that, theoretically speaking, are finite strings over the alphabet {A, G, C, T} are used in DNA computing to encode information. Due to the fact that A is complementary to T and G to C, DNA single strands that are complementary can bind to each other or to themselves in either intended or unintended ways. One of the structures that is usually undesirable for biocomputation, since it makes the affected DNA string unavailable for future interactions, is the hairpin: if some subsequences of a DNA single string are complementary to each other, the string will bind to itself forming a hairpin-like structure. This paper continues the theoretical study of hairpin-free languages. We study algebraic properties of hairpin-free words and hairpins. We also give a complete characterization of the syntactic monoid of the language consisting of all hairpin-free words over a given alphabet and illustrate it with an example using the DNA alphabet.

[1]  Lila Kari,et al.  Hairpin Structures in DNA Words , 2005, DNA.

[2]  The syntactic monoid of the semigroup generated by a comma-free code , 1995 .

[3]  Gabriel Thierrin Convex Languages , 1972, ICALP.

[4]  Masami Ito,et al.  Outfix and Infix Codes and Related Classes of Languages , 1991, J. Comput. Syst. Sci..

[5]  Junghuei Chen,et al.  Involution codes: with application to DNA coded languages , 2004, Natural Computing.

[6]  H. Shyr Free monoids and languages , 1979 .

[7]  Gheorghe Paun,et al.  Hairpin Languages , 2001, Int. J. Found. Comput. Sci..

[8]  Umberto Eco,et al.  Theory of Codes , 1976 .

[9]  L. H. Haines On free monoids partially ordered by embedding , 1969 .

[10]  The syntactic monoid of the semigroup generated by a maximal prefix code , 1996 .

[11]  James A. Anderson Automata Theory with Modern Applications: Languages and codes , 2006 .

[12]  Helmut Jürgensen,et al.  Syntactic Monoids of Codes , 1999, Acta Cybern..

[13]  J. Howie Fundamentals of semigroup theory , 1995 .

[14]  Antonio Restivo,et al.  A Characterization of Strictly Locally Testable Languages and Its Applications to Subsemigroups of a Free Semigroup , 1980, Inf. Control..

[15]  Gabriel Thierrin,et al.  The syntactic monoid of an infix code , 1990 .

[16]  Imre Simon,et al.  Piecewise testable events , 1975, Automata Theory and Formal Languages.

[17]  Paola Bonizzoni,et al.  Linear splicing and syntactic monoid , 2006, Discret. Appl. Math..

[18]  Gabriel Thierrin,et al.  The syntactic monoid of a hypercode , 1973 .

[19]  Lila Kari,et al.  Sticky-free and overhang-free DNA languages , 2003, Acta Informatica.

[20]  Raymond E. Miller,et al.  Varieties of Formal Languages , 1986 .