Finding All Minimum-Size DFA Consistent with Given Examples: SAT-Based Approach

Deterministic finite automaton (DFA) is a fundamental concept in the theory of computation. The NP-hard DFA identification problem can be efficiently solved by translation to the Boolean satisfiability problem (SAT). Previously we developed a technique to reduce the problem search space by enforcing DFA states to be enumerated in breadth-first search (BFS) order. We proposed symmetry breaking predicates, which can be added to Boolean formulae representing various automata identification problems. In this paper we continue the study of SAT-based approaches. First, we propose new predicates based on depth-first search order. Second, we present three methods to identify all non-isomorphic automata of the minimum size instead of just one—the Open image in new window P-complete problem which has not been solved before. Third, we revisited our implementation of the BFS-based approach and conducted new evaluation experiments. It occurs that BFS-based approach outperforms all other exact algorithms for DFA identification and can be effectively applied for finding all solutions of the problem.

[1]  Marcelo F. Frias,et al.  TACO: Efficient SAT-Based Bounded Verification Using Symmetry Breaking and Tight Bounds , 2013, IEEE Transactions on Software Engineering.

[2]  James M. Crawford,et al.  Symmetry-Breaking Predicates for Search Problems , 1996, KR.

[3]  Pierre Dupont,et al.  Regular Grammatical Inference from Positive and Negative Samples by Genetic Search: the GIG Method , 1994, ICGI.

[4]  Barak A. Pearlmutter,et al.  Results of the Abbadingo One DFA Learning Competition and a New Evidence-Driven State Merging Algorithm , 1998, ICGI.

[5]  Simon M. Lucas,et al.  Learning DFA: evolution versus evidence driven state merging , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[6]  Igor Buzhinsky,et al.  Exact finite-state machine identification from scenarios and temporal properties , 2016, International Journal on Software Tools for Technology Transfer.

[7]  Andrzej Zbrzezny,et al.  A New Translation from ECTL* to SAT , 2012, Fundam. Informaticae.

[8]  Colin de la Higuera,et al.  A bibliographical study of grammatical inference , 2005, Pattern Recognit..

[9]  E. Mark Gold,et al.  Complexity of Automaton Identification from Given Data , 1978, Inf. Control..

[10]  Niklas Sörensson,et al.  An Extensible SAT-solver , 2003, SAT.

[11]  Neil Walkinshaw,et al.  STAMINA: a competition to encourage the development and assessment of software model inference techniques , 2012, Empirical Software Engineering.

[12]  James J. Lu,et al.  Solving SQL Constraints by Incremental Translation to SAT , 2008, IEA/AIE.

[13]  Vladimir Ulyantsev,et al.  BFS-Based Symmetry Breaking Predicates for DFA Identification , 2015, LATA.

[14]  Marijn J. H. Heule,et al.  Exact DFA Identification Using SAT Solvers , 2010, ICGI.

[15]  Arlindo L. Oliveira,et al.  Inference of regular languages using state merging algorithms with search , 2005, Pattern Recognit..

[16]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[17]  Vladimir Ulyantsev,et al.  Extended Finite-State Machine Induction Using SAT-Solver , 2011, 2011 10th International Conference on Machine Learning and Applications and Workshops.