论文信息 - Recurrent neural networks, hidden Markov models and stochastic grammars

Recurrent neural networks, hidden Markov models and stochastic grammars

A discussion is presented of the advantage of using a linear recurrent network to encode and recognize sequential data. The hidden Markov model (HMM) is shown to be a special case of such linear recurrent second-order neural networks. The Baum-Welch reestimation formula, which has proved very useful in training HMM, can also be used to learn a linear recurrent network. As an example, a network has successfully learned the stochastic Reber grammar with only a few hundred sample strings in about 14 iterations. The relative merits and limitations of the Baum-Welch optimal ascent algorithm in comparison with the error correction-gradient descent-learning algorithm are discussed

H. H. Chen | C. Lee Giles | Guo-Zheng Sun | Yee-Chun Lee

[1] Barak A. Pearlmutter. Learning State Space Trajectories in Recurrent Neural Networks , 1989, Neural Computation.

[2] L. R. Rabiner,et al. An introduction to the application of the theory of probabilistic functions of a Markov process to automatic speech recognition , 1983, The Bell System Technical Journal.

[3] Athanasios Kehagias,et al. Optimal Control for training: The missing link between Hidden Markov Models and Connectionist Networks , 1990 .

[4] Fernando J. Pineda,et al. Dynamics and architecture for neural computation , 1988, J. Complex..

[5] James L. McClelland,et al. Learning Subsequential Structure in Simple Recurrent Networks , 1988, NIPS.

[6] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[7] H. Bourlard,et al. Links Between Markov Models and Multilayer Perceptrons , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[8] King-Sun Fu,et al. Syntactic Pattern Recognition And Applications , 1968 .

[9] L. Baum,et al. Growth transformations for functions on manifolds. , 1968 .