论文信息 - A time-delay neural network architecture for isolated word recognition

A time-delay neural network architecture for isolated word recognition

Abstract A translation-invariant back-propagation network is described that performs better than a sophisticated continuous acoustic parameter hidden Markov model on a noisy, 100-speaker confusable vocabulary isolated word recognition task. The network's replicated architecture permits it to extract precise information from unaligned training patterns selected by a naive segmentation rule.

Geoffrey E. Hinton | Alexander H. Waibel | Kevin J. Lang | A. Waibel

[1] P. Werbos,et al. Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[2] Tomaso Poggio,et al. Cooperative computation of stereo disparity , 1988 .

[3] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[4] Geoffrey E. Hinton,et al. Experiments on Learning by Back Propagation. , 1986 .

[5] Peter F. Brown,et al. The acoustic-modeling problem in automatic speech recognition , 1987 .

[6] Geoffrey E. Hinton. Learning Translation Invariant Recognition in Massively Parallel Networks , 1987, PARLE.

[7] Geoffrey E. Hinton. Connectionist Learning Procedures , 1989, Artif. Intell..

[8] Geoffrey E. Hinton,et al. Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[9] Michael I. Jordan. Attractor dynamics and parallelism in a connectionist sequential machine , 1990 .

[10] James K. Baker,et al. Stochastic modeling for automatic speech understanding , 1990 .

[11] J J Hopfield,et al. Neural computation by concentrating information in time. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[12] E. McDermott,et al. Phoneme recognition using Kohonen's LVQ , 1988 .

[13] Andrew J. Viterbi,et al. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[14] Alexander H. Waibel,et al. Modular Construction of Time-Delay Neural Networks for Speech Recognition , 1989, Neural Computation.

[15] Kunihiko Fukushima,et al. Neocognitron: A Self-Organizing Neural Network Model for a Mechanism of Visual Pattern Recognition , 1982 .

[16] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.

[17] Geoffrey E. Hinton. Using fast weights to deblur old memories , 1987 .

[18] Lalit R. Bahl,et al. A Maximum Likelihood Approach to Continuous Speech Recognition , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.