Active Coevolutionary Learning of Deterministic Finite Automata

This paper describes an active learning approach to the problem of grammatical inference, specifically the inference of deterministic finite automata (DFAs). We refer to the algorithm as the estimation-exploration algorithm (EEA). This approach differs from previous passive and active learning approaches to grammatical inference in that training data is actively proposed by the algorithm, rather than passively receiving training data from some external teacher. Here we show that this algorithm outperforms one version of the most powerful set of algorithms for grammatical inference, evidence driven state merging (EDSM), on randomly-generated DFAs. The performance increase is due to the fact that the EDSM algorithm only works well for DFAs with specific balances (percentage of positive labelings), while the EEA is more consistent over a wider range of balances. Based on this finding we propose a more general method for generating DFAs to be used in the development of future grammatical inference algorithms.

[1]  Boris A. Trakhtenbrot,et al.  Finite automata : behavior and synthesis , 1973 .

[2]  John W. Carr,et al.  A Solution of the Syntactical Induction-Inference Problem for Regular Languages , 1978, Comput. Lang..

[3]  Dana Angluin,et al.  A Note on the Number of Queries Needed to Identify Regular Languages , 1981, Inf. Control..

[4]  Dana Angluin,et al.  Learning Regular Sets from Queries and Counterexamples , 1987, Inf. Comput..

[5]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[6]  Jerome A. Feldman,et al.  Learning automata from ordered examples , 1991, COLT '88.

[7]  Leonard Pitt,et al.  Inductive Inference, DFAs, and Computational Complexity , 1989, AII.

[8]  J. Oncina,et al.  INFERRING REGULAR LANGUAGES IN POLYNOMIAL UPDATED TIME , 1992 .

[9]  H. Sebastian Seung,et al.  Query by committee , 1992, COLT '92.

[10]  Vasant G Honavar,et al.  Efficient Learning of Regular Languages Using Teacher-Supplied Positive Samples and Learner-Generated Queries , 1993 .

[11]  P. García,et al.  A new regular language learning algorithm from lexicographically ordered complete samples , 1993 .

[12]  Enrique Vidal,et al.  What Is the Search Space of the Regular Inference? , 1994, ICGI.

[13]  Francesco Bergadano,et al.  Inductive Logic Programming: From Machine Learning to Software Engineering , 1995 .

[14]  Pierre Dupont,et al.  Incremental regular inference , 1996, ICGI.

[15]  Rajesh Parekh,et al.  An incremental interactive algorithm for grammar inference , 1996 .

[16]  Samir W. Mahfoud Niching methods for genetic algorithms , 1996 .

[17]  Scott Brave,et al.  Evolving deterministic finite automata using cellular encoding , 1996 .

[18]  Vidroha Debroy,et al.  Genetic Programming , 1998, Lecture Notes in Computer Science.

[19]  Barak A. Pearlmutter,et al.  Results of the Abbadingo One DFA Learning Competition and a New Evidence-Driven State Merging Algorithm , 1998, ICGI.

[20]  Stefan C. Kremer,et al.  Inducing Grammars from Sparse Data Sets: A Survey of Algorithms and Results , 2003, J. Mach. Learn. Res..

[21]  Hod Lipson,et al.  Automated robot function recovery after unanticipated failure or environmental change using a minimum of hardware trials , 2004, Proceedings. 2004 NASA/DoD Conference on Evolvable Hardware, 2004..

[22]  Ran El-Yaniv,et al.  Online Choice of Active Learning Algorithms , 2003, J. Mach. Learn. Res..

[23]  Hod Lipson,et al.  Automating Genetic Network Inference with Minimal Physical Experimentation Using Coevolution , 2004, GECCO.

[24]  Dana Angluin Queries revisited , 2004, Theor. Comput. Sci..

[25]  Simon M. Lucas,et al.  Learning deterministic finite automata with a smart state labeling evolutionary algorithm , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Hod Lipson,et al.  Nonlinear system identification using coevolution of models and tests , 2005, IEEE Transactions on Evolutionary Computation.

[27]  Bengt Jonsson,et al.  Insights to Angluin's Learning , 2005, SVV@ICLP.