The Vocal Speech Understanding System

This paper describes the VOCAL (Voice Operated CALculator) speech understanding system. VOCAL is a software package that lets its user program a computer to perform numerical calculations by speaking to It in English-like sentences. To accomplish this, VOCAL uses processes for acoustic, grammatical, and semantic analysis. These individual procedures, which are relatively simple, are embedded in a control structure that uses the information from each component to arrive at a meaningful interpretation of spoken sentences. One unique feature of VOCAL, which is essential to the development of speech understanding systems, is that it is complete and self-contained. Coded in standard FORTRAN, it is compact enough to run on many minicomputers and can be used in a real-time, on-line environment on slightly more powerful machines. Testing has shown that despite a correct word identification rate of leas that 60% the VOCAL system usually correctly interprets even very long sentences.

[1]  W. G. Radley Visible Speech , 1948, Nature.

[2]  Sheila A. Greibach,et al.  A New Normal-Form Theorem for Context-Free Phrase Structure Grammars , 1965, JACM.

[3]  C. Cherry,et al.  On human communication , 1966 .

[4]  J. G. Woodward,et al.  IEEE TRANSACTIONS@ ON AUDIO AND ELECTROACOUSTICS , 1968 .

[5]  R. Singleton An algorithm for computing the mixed radix fast Fourier transform , 1969 .

[6]  G. D. Bergland,et al.  A guided tour of the fast Fourier transform , 1969, IEEE Spectrum.

[7]  Ronald W. Schafer,et al.  Design of digital filter banks for speech analysis , 1971 .

[8]  David Gries,et al.  Compiler Construction for Digital Computers , 1971 .

[9]  J. Markel,et al.  FFT pruning , 1971 .

[10]  William S. Meisel,et al.  Computer-oriented approaches to pattern recognition , 1972 .

[11]  J. Barnett A vocal data management system , 1973 .

[12]  J. Makhoul Spectral analysis of speech by linear prediction , 1973 .

[13]  Allen Newell,et al.  Speech understanding systems : Final report of a study group , 1973 .

[14]  E. Patrick,et al.  Fundamentals of Pattern Recognition , 1973 .

[15]  John Makhoul,et al.  Mechanical Inference Problems in Continuous Speech Understanding , 1973, IJCAI.

[16]  Lee D. Erman,et al.  A model and a system for machine recognition of speech , 1973 .

[17]  Andrew Craig Eberhard An optimal discrete window for the calculation of power spectra , 1973 .

[18]  D. Klatt,et al.  On the automatic recognition of continuous speech:Implications from a spectrogram-reading experiment , 1973 .

[19]  Bertram C. Bruce,et al.  Natural Communication with Computers. Volume 1. Speech Understanding Research at BBN , 1974 .

[20]  Madeleine Bates,et al.  Speech Understanding Research: Collected Papers, 1973-1974 , 1974 .

[21]  Raj Reddy,et al.  The HEARSAY Speech Understanding System , 1974 .

[22]  Perry Lowell Miller A Locally Organized Parser for Spoken Input , 1974 .

[23]  Forest Baskett,et al.  An Algorithm for Finding Nearest Neighbors , 1975, IEEE Transactions on Computers.

[24]  Donald E. Walker,et al.  Speech Understanding Through Syntactic and Semantic Analysis , 1973, IEEE Transactions on Computers.

[25]  Franklin S. Cooper,et al.  Speech Understanding Systems , 1976, Artificial Intelligence.