A Variant of Pattern Matching for Multiwords

Multiwords are words in which a single symbol can be replaced by a nonempty set of symbols. A pattern w is certain in a multiword M if it occurs in every word that can be obtained by selecting one single symbol among the symbols provided in each position of M . Motivated by a problem on incomplete databases, we investigate a variant of the pattern matching problem which is to decide whether a pattern w is certain in a multiword M . We study the language CERTAIN(w) of multiwords in which w is certain. We show that this regular language is aperiodic for three large families of words and we study the size of its minimal automaton.

[1]  Shu Wang,et al.  Fast pattern-matching on indeterminate strings , 2008, J. Discrete Algorithms.

[2]  Jean Berstel,et al.  Partial Words and a Theorem of Fine and Wilf , 1999, Theor. Comput. Sci..

[3]  Maxime Crochemore,et al.  Algorithms on strings , 2007 .

[4]  Alfred V. Aho,et al.  The Design and Analysis of Computer Algorithms , 1974 .

[5]  Tero Harju,et al.  Combinatorics on Words , 2004 .

[6]  Jef Wijsen,et al.  On First-Order Query Rewriting for Incomplete Database Histories , 2009, 2009 16th International Symposium on Temporal Representation and Reasoning.

[7]  Robert S. Boyer,et al.  A fast string searching algorithm , 1977, CACM.

[8]  M. Fischer,et al.  STRING-MATCHING AND OTHER PRODUCTS , 1974 .

[9]  Tero Harju,et al.  Relational codes of words , 2007, Theor. Comput. Sci..

[10]  Raymond E. Miller,et al.  Varieties of Formal Languages , 1986 .

[11]  Donald E. Knuth,et al.  Fast Pattern Matching in Strings , 1977, SIAM J. Comput..

[12]  H. Wilf,et al.  Uniqueness theorems for periodic functions , 1965 .

[13]  Marcel Paul Schützenberger,et al.  On Finite Monoids Having Only Trivial Subgroups , 1965, Inf. Control..

[14]  Gregory Kucherov,et al.  Subset Seed Automaton , 2007, CIAA.

[15]  R. McNaughton,et al.  Counter-Free Automata , 1971 .

[16]  Alfred V. Aho,et al.  Efficient string matching , 1975, Commun. ACM.

[17]  Costas S. Iliopoulos,et al.  Pattern Matching in Degenerate DNA/RNA Sequences , 2007, WALCOM.

[18]  Wojciech Rytter,et al.  Text Algorithms , 1994 .

[19]  Francine Blanchet-Sadri Algorithmic Combinatorics on Partial Words (Discrete Mathematics and Its Applications) , 2007 .