Experiments on neural net recognition of spoken and written text

The problems are discussed of the recognition of handprinted and spoken digits and the handprinted and spoken English alphabet. Four such experiments were conducted and the results were compared to a conventional nearest-neighbor classifier trained on the same data. Results indicate that neural networks and nearest-neighbor classifiers perform at near the same level of accuracy. For each task, a critical number of neurons can be determined experimentally which yields highest recognition accuracy with least hardware. This number can also measure the classification efficiency of the input feature encoder. Several techniques for optimizing the performance of layered networks are discussed. A constant level added to the input signal biases patterns into the range where the learning rate is highest. Eliminating near-zero weights after learning results in little loss of accuracy. Finally, a novel handwriting encoder is described. >

[1]  L. Rabiner,et al.  An algorithm for determining the endpoints of isolated utterances , 1974, The Bell System Technical Journal.

[2]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[3]  Lawrence R. Rabiner,et al.  An algorithm for determining the endpoints of isolated utterances , 1975, Bell Syst. Tech. J..

[4]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[5]  J J Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Geoffrey E. Hinton,et al.  A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[7]  Biing-Hwang Juang,et al.  On the use of bandpass liftering in speech recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Y. Tohkura,et al.  A weighted cepstral distance measure for speech recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Richard P. Lippmann,et al.  An introduction to computing with neural nets , 1987 .

[10]  David J. Burr Speech Recognition Experiments with Perceptrons , 1987, NIPS.

[11]  Alex Waibel,et al.  Phoneme recognition: neural networks vs. hidden Markov models vs. hidden Markov models , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[12]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.