Deciding Determinism of Regular Languages

We investigate the problem to decide for DFAs, NFAs and regular expressions whether they describe deterministic regular languages. For DFAs with limited alphabet size we show the NL-completeness (nondeterministic logarithmic space-completeness) of the problem. Brüggemann-Klein and Wood (Inf. Comput. 142(2), 182–206, (1998)) gave an algorithm deciding for a minimal DFA whether it recognizes a deterministic regular language. Their algorithm requires polynomial time. Based on this algorithm we construct an algorithm for those DFAs which are not necessarily minimal, but limited in the size of the alphabet. We first focus on the case that the DFAs are minimal and extend to general DFAs later on. The new algorithm is substantially modified in contrast to the original one and uses structural properties of minimal DFAs and of special subautomata of the DFAs, called orbit automata. This algorithm runs in nondeterministic logarithmic space. After that we show the NL-hardness of the problem. For NFAs and regular expressions with arbitrary alphabets the problem is shown to be in PSPACE by Czerwiński et al. (2013) (and, in fact, to be PSPACE-complete). Their approach resulted in an algorithm working with space 𝒪(x4)$\mathcal {O}(x^{4})$. However, our approach for the NL-algorithm can be modified to get an algorithm for NFAs and regular expressions using just quadratic space.

[1]  Yo-Sub Han,et al.  Generalized One-Unambiguity , 2011, Developments in Language Theory.

[2]  Pekka Kilpeläinen,et al.  Checking determinism of XML Schema content models in optimal time , 2011, Inf. Syst..

[3]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[4]  Ping Lu,et al.  Assisting the Design of XML Schema: Diagnosing Nondeterministic Content Models , 2011, APWeb.

[5]  V. Glushkov THE ABSTRACT THEORY OF AUTOMATA , 1961 .

[6]  Salil P. Vadhan,et al.  Computational Complexity , 2005, Encyclopedia of Cryptography and Security.

[7]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[8]  Nivat G. Päun,et al.  Handbook of Formal Languages , 2013, Springer Berlin Heidelberg.

[9]  Ping Lu,et al.  Checking Determinism of Regular Expressions with Counting , 2012, Developments in Language Theory.

[10]  Helena Ahonen Disambiguation of SGML Content Models , 1996, PODP.

[11]  John E. Hopcroft,et al.  An n log n algorithm for minimizing states in a finite automaton , 1971 .

[12]  Neil Immerman Nondeterministic Space is Closed Under Complementation , 1988, SIAM J. Comput..

[13]  Marc Gyssens,et al.  Regular Expressions with Counting: Weak versus Strong Determinism , 2009, SIAM J. Comput..

[14]  Sebastian Maneth,et al.  Deterministic regular expressions in linear time , 2012, PODS.

[15]  Dag Hovland,et al.  The Membership Problem for Regular Expressions with Unordered Concatenation and Numerical Constraints , 2012, LATA.

[16]  Frank Neven,et al.  Simplifying XML schema: effortless handling of nondeterministic regular expressions , 2009, SIGMOD Conference.

[17]  Stefanie Scherzinger,et al.  Attribute grammars for scalable query processing on XML streams , 2005, The VLDB Journal.

[18]  Albert R. Meyer,et al.  The Equivalence Problem for Regular Expressions with Squaring Requires Exponential Space , 1972, SWAT.

[19]  Neil D. Jones,et al.  Space-Bounded Reducibility among Combinatorial Problems , 1975, J. Comput. Syst. Sci..

[20]  Wim Martens,et al.  Descriptional Complexity of Deterministic Regular Expressions , 2012, MFCS.

[21]  Anne Brüggemann-Klein,et al.  Regular Expressions into Finite Automata , 1992, Theor. Comput. Sci..

[22]  Grzegorz Rozenberg,et al.  Handbook of formal languages, vol. 1: word, language, grammar , 1997 .

[23]  Derick Wood,et al.  One-Unambiguous Regular Languages , 1998, Inf. Comput..