New Bounds and Extended Relations Between Prefix Arrays, Border Arrays, Undirected Graphs, and Indeterminate Strings

We extend earlier works on the relation of prefix arrays of indeterminate strings to undirected graphs and border arrays. If integer array y is the prefix array for indeterminate string w, then we say w satisfies y. We use a graph theoretic approach to construct a string on a minimum alphabet size which satisfies a given prefix array. We relate the problem of finding a minimum alphabet size to finding edge clique covers of a particular graph, allowing us to bound the minimum alphabet size by n+n$n+\sqrt {n}$ for indeterminate strings, where n is the size of the prefix array. When we restrict ourselves to prefix arrays for partial words, we bound the minimum alphabet size by 2n$\left \lceil \sqrt {2n} \right \rceil $. Moreover, we show that this bound is tight up to a constant multiple by using Sidon sets. We also study the relationship between prefix arrays and border arrays. We give necessary and sufficient conditions for a border array and prefix array to be satisfied by the same indeterminate string. We show that the slowly-increasing property completely characterizes border arrays for indeterminate strings, whence there are exactly Cn distinct border arrays of size n for indeterminate strings (here Cn is the nth Catalan number). We give an algorithm to enumerate all prefix arrays for partial words of a given size, n. Our algorithm has a time complexity of n3 times the output size, that is, the number of valid prefix arrays for partial words of length n. We also bound the number of prefix arrays for partial words of a given size using Stirling numbers of the second kind.

[1]  Maxime Crochemore,et al.  Reverse Engineering Prefix Tables , 2009, STACS.

[2]  Karl R. Abrahamson Generalized String Matching , 1987, SIAM J. Comput..

[3]  Shu Wang Indeterminate strings , 2009 .

[4]  Julien Clément,et al.  26th International Symposium on Theoretical Aspects of Computer Science (STACS 2009) , 2009 .

[5]  Noga Alon,et al.  Covering graphs by the minimum number of equivalence relations , 1986, Comb..

[6]  P. Erdös,et al.  The Representation of a Graph by Set Intersections , 1966, Canadian Journal of Mathematics.

[7]  R. Stanley Enumerative Combinatorics: Volume 1 , 2011 .

[8]  William F. Smyth,et al.  Counting Distinct Strings , 1999, Algorithmica.

[9]  P. Ebdos,et al.  ON A PROBLEM OF SIDON IN ADDITIVE NUMBER THEORY, AND ON SOME RELATED PROBLEMS , 2002 .

[10]  P. Erdös On a Problem of Sidon in Additive Number Theory and on Some Related Problems Addendum , 1944 .

[11]  Francine Blanchet-Sadri Algorithmic Combinatorics on Partial Words , 2012, Int. J. Found. Comput. Sci..

[12]  Maxime Crochemore,et al.  Algorithms on strings , 2007 .

[13]  Michael G. Main,et al.  An O(n log n) Algorithm for Finding All Repetitions in a String , 1984, J. Algorithms.

[14]  Jonathan L. Gross,et al.  Handbook of graph theory , 2007, Discrete mathematics and its applications.

[15]  M. Fischer,et al.  STRING-MATCHING AND OTHER PRODUCTS , 1974 .

[16]  Francine Blanchet-Sadri Algorithmic Combinatorics on Partial Words (Discrete Mathematics and Its Applications) , 2007 .

[17]  Shu Wang,et al.  Indeterminate strings, prefix arrays & undirected graphs , 2014, Theor. Comput. Sci..

[18]  M. Crochemore,et al.  Algorithms on Strings: Tools , 2007 .

[19]  William F. Smyth,et al.  Computing Patterns in Strings , 2003 .

[20]  Shu Wang,et al.  New Perspectives on the Prefix Array , 2008, SPIRE.

[21]  Francine Blanchet-Sadri,et al.  New Bounds and Extended Relations Between Prefix Arrays, Border Arrays, Undirected Graphs, and Indeterminate Strings , 2014, STACS.