论文信息 - Extracting and Learning an Unknown Grammar with Recurrent Neural Networks

Extracting and Learning an Unknown Grammar with Recurrent Neural Networks

Simple second-order recurrent networks are shown to readily learn small known regular grammars when trained with positive and negative strings examples. We show that similar methods are appropriate for learning unknown grammars from examples of their strings. The training algorithm is an incremental real-time, recurrent learning (RTRL) method that computes the complete gradient and updates the weights at the end of each string. After or during training, a dynamic clustering algorithm extracts the production rules that the neural network has learned. The methods are illustrated by extracting rules from unknown deterministic regular grammars. For many cases the extracted grammar outperforms the neural net from which it was extracted in correctly classifying unseen strings.

[1] Jeffrey D. Ullman,et al. Introduction to Automata Theory, Languages and Computation , 1979 .

[2] O. Firschein,et al. Syntactic pattern recognition and applications , 1983, Proceedings of the IEEE.

[3] Carl H. Smith,et al. Inductive Inference: Theory and Methods , 1983, CSUR.

[4] C. L. Giles,et al. Machine learning using higher order correlation networks , 1986 .

[5] James L. McClelland,et al. Finite State Automata and Simple Recurrent Networks , 1989, Neural Computation.

[6] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[7] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .

[8] Noga Alon,et al. Efficient simulation of finite automata by neural nets , 1991, JACM.

[9] Anders Krogh,et al. Introduction to the theory of neural computation , 1994, The advanced book program.

[10] C. Lee Giles,et al. Learning and Extracting Finite State Automata with Second-Order Recurrent Neural Networks , 1992, Neural Computation.

[11] Raymond L. Watrous,et al. Induction of Finite-State Languages Using Second-Order Recurrent Networks , 1992, Neural Computation.

[12] Alan F. Murray,et al. International Joint Conference on Neural Networks , 1993 .