New Abelian Square-Free DT0L-Languages over 4 Letters

In 1906 Axel Thue [34] started the systematic study of structures in words. Consequently, he studied basic objects of theoretical computer science long before the invention of the computer or DNA. In 1961 Paul Erdos [13] raised the question whether abelian squares can be avoided in infinitely long words. In 1992, we presented in [19], see also [20–23], an abelian square-free (a-2-free) endomorphism g85 on the four letter alphabet S4 = {a, b, c, d}. The size of g85 , i.e. |g85 (abcd)|, is equal to 4×85. Until now, all known methods for constructing arbitrarily long a-2-free words on S4 have been based on the structure of this g85 ; see Arturo Carpi [4–6]. In this paper, we report of a completely new endomorphism g98 of S4 , the iteration of which produces an infinite abelian square-free word. The size of g98 is 4×98, and the image words for letters are constructed, in part, differently from the case of g85 . For g85 they were directly obtained by permutating letters cyclically. The endomorphism g98 is not an a-2-free endomorphism itself, since it does not preserve the a-2-freeness of all words of length 7. However, g98 can be used together with g85 to produce a-2-free DT0L-languages of unlimited size. Here DT0L-languages mean deterministic context-independent Lindenmayer languages produced by using compositions of endomorphisms – so called tables; see [32, p.188]. Indeed, by using Carpi's algorithm [4] for prefixes of g85 (S) and g98 (S), and a modified version of this algorithm, one can check the following fact: for any a-2-free words w1 and w2 , where w1 does not contain a certain subword pattern, g98 (w1 ) and g85 (w2 ) are always a-2-free and avoid all undesirable patterns that would, in the case of g98 , lead to an abelian-square in the next iteration step. It is anticipated that later on this new result will lead to a sharper bound for the base number (~1.000021) found in [5], where Carpi showed that the number of a-2-free words over 4 letters of length n is ¥ 1.000021n . We explain extensive computer aided searches that have been carried out over 11 years to find new ways of constructing a-2-free words over 4 letters. In this process, we have encountered some unintuitive non-linear phenomena which, however, are in accordance with the complex behaviour of simple systems studied by Stephen Wolfram in his long-awaited book [35]. For example, two same looking initial values for prefixes and suffixes of image words for letters may yield 1000-fold running times when searching through all possibilities for proper infixes! We also analyse the structure of g85 in terms of subwords, and discuss some open problems.

[1]  Giuseppe Pirillo,et al.  On uniformly repetitive semigroups , 1994 .

[2]  P. Pleasants Non-repetitive sequences , 1970, Mathematical Proceedings of the Cambridge Philosophical Society.

[3]  Arto Salomaa Jewels of formal language theory , 1981 .

[4]  STACS 84 , 1984, Lecture Notes in Computer Science.

[5]  Stephen Wolfram,et al.  A New Kind of Science , 2003, Artificial Life.

[6]  James D. Currie,et al.  Open problems in pattern avoidance , 1993 .

[7]  Arturo Carpi,et al.  On the Number of Abelian Square-free Words on Four Letters , 1998, Discret. Appl. Math..

[8]  Arturo Carpi On Abelian Squares and Substitutions , 1999, Theor. Comput. Sci..

[9]  F. Michel Dekking,et al.  Strongly Non-Repetitive Sequences and Progression-Free Sets , 1979, J. Comb. Theory, Ser. A.

[10]  Sergey V. Avgustinovich,et al.  Words Avoiding Abelian Inclusions , 2001, J. Autom. Lang. Comb..

[11]  A. B. Cook Some unsolved problems. , 1952, Hospital management.

[12]  Veikko Keränen On the k-Freeness of Morphisms on Free Monoids , 1987, STACS.

[13]  Burnside Groups of Odd Exponent and Irreducible Systems of Group Identities , 1973 .

[14]  Arturo Carpi,et al.  On Abelian Power-Free Morphisms , 1993, Int. J. Algebra Comput..

[15]  Tero Harju,et al.  Combinatorics on Words , 2004 .

[16]  Michel Leconte,et al.  K-th power free codes , 1984, Automata on Infinite Words.

[17]  Michel Leconte,et al.  A Characterization of Power-Free Morphisms , 1985, Theor. Comput. Sci..

[18]  Veikko Keränen,et al.  Abelian Squares are Avoidable on 4 Letters , 1992, ICALP.

[19]  Grzegorz Rozenberg,et al.  The mathematical theory of L systems , 1980 .