论文信息 - Constrained Second-Order Recurrent Networks for Finite-State Automata Induction

Constrained Second-Order Recurrent Networks for Finite-State Automata Induction

This paper presents an improved training algorithm for second-order dynamical recurrent networks applied to the problem of finite-state automata (FSA) induction. Second-order networks allow for a natural encoding of finite-state automata in which each second-order connection weight corresponds to one transition in a finite-state automaton. In practice, however, when trained using gradient descent, these networks almost never assume this type of encoding and sophisticated algorithms must be used to extract the encoded automata. This paper suggests a simple modification to the standard error function for second-order dynamical recurrent networks which encourages these networks to assume natural FSA encodings when trained using gradient descent. This obviates the need for cluster-based extraction techniques and provides a simple method for guaranteeing the stability of the network for arbitrarily long sequences. Initial results also suggest that fewer training strings must be presented to achieve convergence using the modified error.

Mikel L. Forcada | Stefan C. Kremer | Ramón P. Ñeco

[1] Padhraic Smyth,et al. Learning Finite State Machines With Self-Clustering Recurrent Networks , 1993, Neural Computation.

[2] Michael C. Mozer,et al. Dynamic On-line Clustering and State Extraction: An Approach to Symbolic Learning , 1998, Neural Networks.

[3] Mikel L. Forcada,et al. Encoding of sequential translators in discrete-time recurrent neural nets , 1999, ESANN.

[4] Stefan C. Kremer,et al. On the computational power of Elman-style recurrent networks , 1995, IEEE Trans. Neural Networks.

[5] C. Lee Giles,et al. Stable Encoding of Large Finite-State Automata in Recurrent Neural Networks with Sigmoid Discriminants , 1996, Neural Computation.

[6] Srimat T. Chakradhar,et al. First-order versus second-order single-layer recurrent neural networks , 1994, IEEE Trans. Neural Networks.

[7] Simon Haykin,et al. Neural Networks: A Comprehensive Foundation , 1998 .

[8] C. Lee Giles,et al. Learning and Extracting Finite State Automata with Second-Order Recurrent Neural Networks , 1992, Neural Computation.