Regular Expressions with Numerical Constraints and Automata with Counters

Regular expressions with numerical constraints are an extension of regular expressions, allowing to bound numerically the number of times that a subexpression should be matched. Expressions in this extension describe the same languages as the usual regular expressions, but are exponentially more succinct. We define a class of finite automata with counters and a deterministic subclass of these. Deterministic finite automata with counters can recognize words in linear time. Furthermore, we describe a subclass of the regular expressions with numerical constraints, a polynomial-time test for this subclass, and a polynomial-time construction of deterministic finite automata with counters from expressions in the subclass.

[1]  Silvano Dal-Zilio,et al.  XML Schema, Tree Logic and Sheaves Automata , 2003, RTA.

[2]  Jan Willem Klop,et al.  Term Rewriting Systems: From Church-Rosser to Knuth-Bendix and Beyond , 1990, ICALP.

[3]  Ethan V. Munson Symposium on document engineering , 2002, SIGF.

[4]  Ville Laurikari,et al.  NFAs with tagged transitions, their conversion to deterministic automata and application to regular expressions , 2000, Proceedings Seventh International Symposium on String Processing and Information Retrieval. SPIRE 2000.

[5]  V. Glushkov THE ABSTRACT THEORY OF AUTOMATA , 1961 .

[6]  Pekka Kilpeläinen,et al.  Towards efficient implementation of XML schema content models , 2004, DocEng '04.

[7]  Derick Wood,et al.  One-Unambiguous Regular Languages , 1998, Inf. Comput..

[8]  Pekka Kilpeläinen,et al.  Regular Expressions with Numerical Occurrence Indicators - preliminary results , 2003, SPLST.

[9]  Anne Brüggemann-Klein,et al.  Regular Expressions into Finite Automata , 1992, Theor. Comput. Sci..

[10]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[11]  Helmut Seidl,et al.  Exact XML Type Checking in Polynomial Time , 2007, ICDT.

[12]  守屋 悦朗,et al.  J.E.Hopcroft, J.D. Ullman 著, "Introduction to Automata Theory, Languages, and Computation", Addison-Wesley, A5変形版, X+418, \6,670, 1979 , 1980 .

[13]  Pekka Kilpeläinen,et al.  One-unambiguity of regular expressions with numeric occurrence indicators , 2007, Inf. Comput..

[14]  Dario Colazzo,et al.  Linear time membership in a class of regular expressions with interleaving and counting , 2008, CIKM '08.

[15]  S C Kleene,et al.  Representation of Events in Nerve Nets and Finite Automata , 1951 .

[16]  Frank Neven,et al.  Optimizing Schema Languages for XML: Numerical Constraints and Interleaving , 2009, SIAM J. Comput..