Separability by Short Subsequences and Subwords

The separability problem for regular languages asks, given two regular languages I and E, whether there exists a language S that separates the two, that is, includes I but contains nothing from E. Typically, S comes from a simple, less expressive class of languages than I and E. In general, a simple separator $S$ can be seen as an approximation of I or as an explanation of how I and E are different. In a database context, separators can be used for explaining the result of regular path queries or for finding explanations for the difference between paths in a graph database, that is, how paths from given nodes u_1 to v_1 are different from those from u_2 to v_2. We study the complexity of separability of regular languages by combinations of subsequences or subwords of a given length k. The rationale is that the parameter k can be used to influence the size and simplicity of the separator. The emphasis of our study is on tracing the tractability of the problem.

[1]  Tomás Masopust,et al.  On Upper and Lower Bounds on the Length of Alternating Towers , 2014, MFCS.

[2]  Frank Wolter,et al.  Query Inseparability for Description Logic Knowledge Bases , 2014, KR.

[3]  Robert McNaughton,et al.  Algebraic decision procedures for local testability , 1974, Mathematical systems theory.

[4]  Robert E. Tarjan,et al.  Three Partition Refinement Algorithms , 1987, SIAM J. Comput..

[5]  Albert R. Meyer,et al.  Word problems requiring exponential time(Preliminary Report) , 1973, STOC.

[6]  R. McNaughton,et al.  Counter-Free Automata , 1971 .

[7]  Thomas Place,et al.  Separating Regular Languages by Piecewise Testable and Unambiguous Languages , 2013, MFCS.

[8]  Dan Suciu,et al.  A formal approach to finding explanations for database queries , 2014, SIGMOD Conference.

[9]  Thomas A. Henzinger,et al.  Abstractions from proofs , 2004, SIGP.

[10]  William Craig,et al.  Three uses of the Herbrand-Gentzen theorem in relating model theory and proof theory , 1957, Journal of Symbolic Logic.

[11]  Kenneth L. McMillan,et al.  Applications of Craig Interpolants in Model Checking , 2005, TACAS.

[12]  Wim Martens,et al.  Efficient Separability of Regular Languages by Subsequences and Suffixes , 2013, ICALP.

[13]  Lorijn van Rooijen A combinatorial approach to the separation problem for regular languages. (Une approche combinatoire du problème de séparation pour les langages réguliers) , 2014 .

[14]  Wang Chiew Tan Provenance in Databases: Past, Current, and Future , 2007, IEEE Data Eng. Bull..

[15]  Michaël Thomazo,et al.  On $k$-piecewise testability (preliminary report) , 2014, ArXiv.

[16]  Anthony Widjaja Lin,et al.  Parikh Images of Grammars: Complexity and Applications , 2010, 2010 25th Annual IEEE Symposium on Logic in Computer Science.

[17]  Peter T. Wood,et al.  Containment for XPath Fragments under DTD Constraints , 2003, ICDT.

[18]  Imre Simon,et al.  Piecewise testable events , 1975, Automata Theory and Formal Languages.

[19]  Thomas Place,et al.  Separating Regular Languages by Locally Testable and Locally Threshold Testable Languages , 2013, FSTTCS.

[20]  Peter Buneman,et al.  Provenance in databases , 2009, SIGMOD '07.

[21]  Carsten Lutz,et al.  Foundations for Uniform Interpolation and Forgetting in Expressive Description Logics , 2011, IJCAI.

[22]  Janusz A. Brzozowski,et al.  Characterizations of locally testable events , 1973, Discret. Math..

[23]  Yechezkel Zalcstein,et al.  Locally Testable Languages , 1972, J. Comput. Syst. Sci..

[24]  Jacques Stern,et al.  Complexity of Some Problems from the Theory of Automata , 1985, Inf. Control..