On Minimizing Regular Expressions Without Kleene Star

Finite languages lie at the heart of literally every regular expression. Therefore, we investigate the approximation complexity of minimizing regular expressions without Kleene star, or, equivalently, regular expressions describing finite languages. On the side of approximation hardness, given such an expression of size s, we prove that it is impossible to approximate the minimum size required by an equivalent regular expression within a factor of O ( s (log s)2+δ ) if the running time is bounded by a quasipolynomial function depending on δ, for every δ > 0, unless the exponential time hypothesis (ETH) fails. For approximation ratio O(s1−δ), we prove an exponential time lower bound depending on δ, assuming ETH. The lower bounds apply for alphabets of constant size. On the algorithmic side, we show that the problem can be approximated in polynomial time within O( s log log s log s ), with s being the size of the given regular expression. For constant alphabet size, the bound improves to O( s log s ). Finally, we devise a familiy of superpolynomial approximation algorithms that attain the performance ratios of the lower bounds, while their running times are only slightly above those excluded by the ETH.

[1]  Markus Holzer,et al.  Language Operations with Regular Expressions of Polynomial Size , 2008, DCFS.

[2]  Parinya Chalermsook,et al.  Nearly Tight Approximability Results for Minimum Biclique Cover and Partition , 2014, ESA.

[3]  Dániel Marx,et al.  Lower bounds based on the Exponential Time Hypothesis , 2011, Bull. EATCS.

[4]  Lorenzo Clemente,et al.  Efficient reduction of nondeterministic automata with application to language inclusion testing , 2017, Log. Methods Comput. Sci..

[5]  V. V. Williams ON SOME FINE-GRAINED QUESTIONS IN ALGORITHMS AND COMPLEXITY , 2019, Proceedings of the International Congress of Mathematicians (ICM 2018).

[6]  Jeffrey Shallit,et al.  Optimal Regular Expressions for Permutations , 2019, ICALP.

[7]  Allan Grønlund Jørgensen,et al.  A Dichotomy for Regular Expression Membership Testing , 2016, 2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS).

[8]  Markus Holzer,et al.  Tight Bounds on the Descriptional Complexity of Regular Expressions , 2009, Developments in Language Theory.

[9]  Georg Schnitger,et al.  Minimizing nfa's and regular expressions , 2007, J. Comput. Syst. Sci..

[10]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[11]  Amir Abboud,et al.  If the Current Clique Algorithms are Optimal, So is Valiant's Parser , 2015, 2015 IEEE 56th Annual Symposium on Foundations of Computer Science.

[12]  Harry B. Hunt,et al.  On the time and tape complexity of languages I , 1973, STOC.

[13]  Jeffrey Shallit,et al.  Enumerating Regular Expressions and Their Languages , 2004, CIAA.

[14]  Jan Van den Bussche,et al.  Naive Infinite Enumeration of Context-free Languages in Incremental Polynomial Time , 2015, J. Univers. Comput. Sci..

[15]  Markus Holzer,et al.  Computational Complexity of NFA Minimization for Finite and Unary Languages , 2007, LATA.

[16]  Markus Holzer,et al.  On Minimal Grammar Problems for Finite Languages , 2018, DLT.

[17]  Mateus de Oliveira Oliveira,et al.  On the Fine Grained Complexity of Finite Automata Non-emptiness of Intersection , 2020, DLT.

[18]  Galina Jirásková,et al.  A Survey on Fooling Sets as Effective Tools for Lower Bounds on Nondeterministic Complexity , 2018, Adventures Between Lower Bounds and Higher Altitudes.

[19]  Henning Fernau,et al.  Problems on Finite Automata and the Exponential Time Hypothesis , 2016, CIAA.

[20]  Michael Wehar,et al.  Hardness Results for Intersection Non-Emptiness , 2014, ICALP.

[21]  Russell Impagliazzo,et al.  Which problems have strongly exponential complexity? , 1998, Proceedings 39th Annual Symposium on Foundations of Computer Science (Cat. No.98CB36280).

[22]  Frantisek Mráz,et al.  Two-Dimensional Pattern Matching Against Basic Picture Languages , 2019, CIAA.

[23]  Jeffrey Shallit,et al.  Regular Expressions: New Results and Open Problems , 2004, J. Autom. Lang. Comb..