Recent advances in automatic speech recognition

Abstract The paper reviews the work done in speech recognition and understanding, mostly in the years 1976–1977. The attention is focussed on problems of system organization, use and representation of syntax and semantics, control strategies, lexical classification, extraction and emission of hypotheses about acoustic, phonetic and phonemic features.

[1]  William H. Paxton Experiments in Speech Understanding System Control , 1976 .

[2]  K. Steiglitz On the simultaneous estimation of poles and zeros in speech analysis , 1977 .

[3]  D. Klatt,et al.  Word verification in a speech understanding system , 1976 .

[4]  Donald E. Walker,et al.  Research on speech understanding and related areas at SRI , 1977 .

[5]  David R. Hill,et al.  Man-Machine Interaction Using Speech , 1971, Adv. Comput..

[6]  Victor R. Lesser,et al.  Parallelism in Artificial Intelligence Problem Solving: A Case Study of Hearsay II , 1977, IEEE Transactions on Computers.

[7]  Pietro Laface,et al.  A syntactic procedure for the recognition of glottal pulses in continuous speech , 1977, Pattern Recognit..

[8]  D.R. Reddy,et al.  Speech recognition by machine: A review , 1976, Proceedings of the IEEE.

[9]  Alan V. Oppenheim,et al.  Speech analysis homomorphic prediction , 1977 .

[10]  F. Jelinek,et al.  Continuous speech recognition by statistical methods , 1976, Proceedings of the IEEE.

[11]  Alistair D. C. Holden,et al.  A new method for accurate analysis of voiced speech , 1976, ICASSP.

[12]  P. Mermelstein Automatic segmentation of speech into syllabic units. , 1975, The Journal of the Acoustical Society of America.

[13]  Bishnu S. Atal,et al.  LPC prediction error--Analysis of its variation with the position of the analysis frame , 1977 .

[14]  S. Rivoira,et al.  An isolated-word recognizer based on grammar-controlled classification processes , 1978, Pattern Recognit..

[15]  Paul Mermelstein,et al.  On detecting nasals in continuous speech , 1975 .

[16]  A. Smith,et al.  Word hypothesization in the hearsay II speech system , 1976, ICASSP.

[17]  Sankar K. Pal,et al.  Effect of fuzzification on the plosive cognition system , 1978 .

[18]  T.B. Martin,et al.  Practical applications of voice input to machines , 1976, Proceedings of the IEEE.

[19]  R. DeMori,et al.  Syntactic Recognition of Speech Patterns , 1977 .

[20]  Victor R. Lesser,et al.  Focus of attention in a distributed-logic speech understanding system , 1976, ICASSP.

[21]  N. Dixon,et al.  A description of a parametrically controlled modular structure for speech processing , 1975 .

[22]  Pietro Torasso,et al.  Automatic recognition of liquids and nasals in continuous speech , 1977 .

[23]  Jared J. Wolf Speech Recognition and Understanding , 1980 .

[24]  Mark S. Fox,et al.  Maximal Consistent Interpretations of Errorful Data in Hierarchically Modeled Domains , 1977, IJCAI.

[25]  Renato De Mori,et al.  A Special-Purpose Computer for Digital Signal Processing , 1975, IEEE Transactions on Computers.

[26]  Lee D. Erman,et al.  Overview of the hearsay speech understanding research , 1976, SGAR.

[27]  Pietro Torasso,et al.  Lexical classification in a speech understanding system using fuzzy relations , 1976, ICASSP.

[28]  Masaki Kohda,et al.  Spoken word recognition using the restricted number of learning samples , 1976, ICASSP.

[29]  F. Poza,et al.  Acoustic phonetic research in speech understanding , 1975 .

[30]  R. Viswanathan,et al.  Objective speech quality evaluation of narrowband LPC vocoders , 1978, ICASSP.

[31]  Alan V. Oppenheim,et al.  Signal analysis by homomorphic prediction , 1976 .

[32]  A. Holden,et al.  A computer programming system using continuous speech input , 1976 .

[33]  Harvey F. Silverman,et al.  A general language-operated decision implementation system (GLODIS): Its application to continuous-speech segmentation , 1976 .

[34]  J.B. Allen,et al.  A unified approach to short-time Fourier analysis and synthesis , 1977, Proceedings of the IEEE.

[35]  Charles C. Tappert A Markov Model Acoustic Phonetic Component for Automatic Speech Recognition , 1977, Int. J. Man Mach. Stud..

[36]  Stephen E. Levinson,et al.  Computing relative redundancy to measure grammatical constraint in speech recognition tasks , 1978, ICASSP.

[37]  M. B. Herscher,et al.  Source data entry using voice input , 1976, ICASSP.

[38]  M. Bates,et al.  The use of syntax in a speech understanding system , 1975 .

[39]  George M. White,et al.  Speech Recognition: A Tutorial Overview , 1976, Computer.

[40]  Shuichi Itahashi,et al.  Direct linear prediction for fundamental frequency analysis , 1976, ICASSP.

[41]  Victor Zue,et al.  Dictionary expansion via phonological rules for a speech understanding system , 1976, ICASSP.

[42]  G. White,et al.  Speech recognition experiments with linear predication, bandpass filtering, and dynamic programming , 1976 .

[43]  Phillips B. Scott VICI - A speaker independent word recognition system , 1976, ICASSP.

[44]  Gary G. Hendrix,et al.  Expanding the Utility of Semantic Networks Through Partitioning , 1975, IJCAI.

[45]  Sei-ichi Nakagawa,et al.  A Machine Understanding System for Spoken Japanese Sentences , 1977 .

[46]  L. Saitta,et al.  Semantic support for a speech understanding system, based on fuzzy relations , 1976 .

[47]  J. Makhoul,et al.  A mixed‐source model for speech compression and synthesis , 1978 .

[48]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[49]  Lawrence R. Rabiner,et al.  Application of an LPC distance measure to the voiced-unvoiced-silence detection problem , 1977 .

[50]  Barbara J. Grosz,et al.  The Representation and Use of Focus in a System for Understanding Dialogs , 1977, IJCAI.

[51]  J. Forgie,et al.  An overview of the Lincoln Laboratory speech recognition system , 1974 .

[52]  J. Cheung,et al.  Computer recognition of linguistic stress patterns in connected speech , 1977 .

[53]  George M. White Dynamic programming, the viterbi algorithm, and low cost speech recognition , 1978, ICASSP.

[54]  Bruce T. Lowerre,et al.  The HARPY speech recognition system , 1976 .

[55]  Harvey F. Silverman,et al.  The 1976 modular acoustic processor(MAP) , 1977 .

[56]  John Makhoul,et al.  Adaptive lattice methods for linear prediction , 1978, ICASSP.

[57]  Henry Gilbert Goldberg,et al.  Segmentation and labeling of speech: a comparative performance evaluation. , 1976 .

[58]  Richard Fikes,et al.  A Network-Based Knowledge Representation and Its Natural Deduction System , 1977, IJCAI.

[59]  Jack Mostow,et al.  Syntax and semantics in a distributed speech understanding system , 1976, ICASSP.

[60]  Pietro Torasso,et al.  Performance analysis of syntactic-recognizer of isolated words , 1978, ICASSP.

[61]  Sankar K. Pal,et al.  Fuzzy sets and decisionmaking approaches in vowel and speaker recognition , 1977 .

[62]  Gary G. Hendrix,et al.  Procedures for Integrating Knowledge in a Speech Understanding System , 1977, IJCAI.

[63]  Seiichi Nakagawa,et al.  A Classification Method of Spoken Words in Continuous Speech for Many Speakers , 1976 .

[64]  J. Makhoul Stable and efficient lattice methods for linear prediction , 1977 .

[65]  Donald E. Walker Speech Understanding Research , 1976 .

[66]  Jared Wolf HWIM, a natural language speech understander , 1977, 1977 IEEE Conference on Decision and Control including the 16th Symposium on Adaptive Processes and A Special Symposium on Fuzzy Set Theory and Applications.

[67]  S. Itahashi,et al.  Discrete-word recognition utilizing a word dictionary and phonological rules , 1973 .

[68]  Lalit R. Bahl,et al.  Design of a linguistic statistical decoder for the recognition of continuous speech , 1975, IEEE Trans. Inf. Theory.

[69]  Lee Daniel Erman,et al.  An environment and system for machine understanding of connected speech. , 1974 .

[70]  Pietro Laface,et al.  Automatic detection and description of syllabic features in continuous speech , 1976 .

[71]  W. Hess,et al.  A pitch-synchronous digital feature extraction system for phonemic recognition of speech , 1976 .

[72]  Lawrence R. Rabiner,et al.  On creating reference templates for speaker independent recognition of isolated words , 1978 .

[73]  Aaron E. Rosenberg,et al.  Evaluation of a word recognition system using syntax analysis , 1977 .

[74]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[75]  William Hamilton Paxton,et al.  A framework for speech understanding , 1977 .

[76]  Barbara J. Grosz Establishing Context in Task-Oriented Dialogs , 1975 .

[77]  Franklin S. Cooper,et al.  Speech Understanding Systems , 1976, Artificial Intelligence.

[78]  G. Mercier,et al.  Automatic Segmentation of Speech into Syllabic and Phonemic Units: Application to French Words and Utterances , 1975 .

[79]  R. Bakis Continuous speech recognition via centisecond acoustic states , 1976 .

[80]  Allen Newell,et al.  Speech understanding systems : Final report of a study group , 1973 .

[81]  Kiyohiro Shikano,et al.  Speech recognition in the question-answering system operated by conversational speech , 1976, ICASSP.

[82]  Günther Ruske,et al.  An approach to speech recognition using syllabic decision units , 1978, ICASSP.

[83]  Lalit R. Bahl,et al.  Automatic recognition of continuously spoken sentences from a finite state grammer , 1978, ICASSP.

[84]  James K. Baker,et al.  Stochastic modeling as a means of automatic speech recognition. , 1975 .

[85]  Victor Lesser,et al.  Organization of the Hearsay II speech understanding system , 1975 .

[86]  Janet MacIver Baker A New Time-Domain Analysis of Human Speech and Other Complex Waveforms. , 1975 .

[87]  B. Nash-Webber,et al.  Semantic support for a speech understanding system , 1975 .

[88]  V. Zue,et al.  The role of phonological rules in speech understanding research , 1975 .

[89]  J. Baker,et al.  The DRAGON system--An overview , 1975 .

[90]  Michel Lecours,et al.  Vocal tract area function measurements: Two time-domain methods , 1976, ICASSP.

[91]  R. Schwartz,et al.  Advanced acoustic techniques in automatic speech understanding , 1977 .

[92]  Jean-Marie Pierrel,et al.  Organization and operation of a connected speech understanding system at lexical, syntactic and semantic levels , 1976, ICASSP.