Harpy, production systems and human cognition

Harpy is a speech understanding system that attained (in 1976) a set of highly demanding specifications laid down in 1971 for the recognition of continuous speech (902 accuracy, 1000 words, several speakers, but restricted syntax and semantics). Harpy is an achievement in artificial intelligence, without regard to psychology. Its success, however, makes it worthwhile to ask for its implications for the psychology of speech perception. This paper explores that issue by performing a sufficiency analysis, ie, by constructing a psychological model of speech perception that is faithful to Harpy and then inquiring whether it is acceptable given what we know about human processing capabilities. The strategy for accomplishing this is to select a specific model of basic human cognitive architecture, a production system architecture called HPSA77 that is under investigation independently as a model of cognition; and then to map Harpy into this structure in a way that maintains performance plausibility. The paper (1) presents the production system architecture; (2) presents Harpy; (3) performs the mapping; (4) detours into a consideration of intensity encoding in production systems to solve a problem in the mapping; (5) does the sufficiency analysis; (6) examines what the model says about some human speech phenomena; (7) attempts to state what has been achieved. It seems to the author that a viable and interesting theory of human speech perception has been generated by this exercise, though it has several major difficulties (as noted).

[1]  G. A. Miller,et al.  Verbal context and the recall of meaningful material. , 1950, The American journal of psychology.

[2]  L. R. Peterson,et al.  Short-term retention of individual verbal items. , 1959, Journal of experimental psychology.

[3]  H. Rubenstein,et al.  Intelligibility of Known and Unknown Message Sets , 1959 .

[4]  Kenneth N. Stevens,et al.  Speech recognition: A model and a program for research , 1962, IRE Trans. Inf. Theory.

[5]  U NEISSER,et al.  Searching for Ten Targets Simultaneously , 1963, Perceptual and motor skills.

[6]  G. A. Miller,et al.  Some perceptual consequences of linguistic rules , 1963 .

[7]  J. Fodor,et al.  The psychological reality of linguistic segments , 1965 .

[8]  W. J. McGill Neural counting mechanisms and energy detection in audition , 1967 .

[9]  Broadbent De Word-frequency effect and response bias. , 1967 .

[10]  A M Liberman,et al.  Perception of the speech code. , 1967, Psychological review.

[11]  Saul Sternberg,et al.  The discovery of processing stages: Extensions of Donders' method , 1969 .

[12]  R. M. Warren Perceptual Restoration of Missing Speech Sounds , 1970, Science.

[13]  T. Bever,et al.  The nonperceptual reality of the phoneme. , 1970 .

[14]  J. Frederiksen Statistical decision model for auditory word recognition. , 1971, Psychological review.

[15]  D M Green,et al.  A neural timing theory for response times and the psychophysics of intensity. , 1972, Psychological review.

[16]  Allen Newell,et al.  Speech understanding systems : Final report of a study group , 1973 .

[17]  Allen Newell,et al.  Human Problem Solving. , 1973 .

[18]  David McNeill,et al.  The Perceptual Reality of Phonemes, Syllables, Words, and Sentences. , 1973 .

[19]  Allen Newell,et al.  Production Systems: Models of Control Structures , 1973 .

[20]  T. Landauer,et al.  Structural differences between common and rare words: Failure of equivalence assumptions for theories of word recognition , 1973 .

[21]  H A Simon,et al.  How Big Is a Chunk? , 1974, Science.

[22]  G. W. Baylor,et al.  An information processing theory of aspects of the development of weight seriation in children , 1974 .

[23]  Donald A. Waterman,et al.  Adaptive Production Systems , 1975, IJCAI.

[24]  J. Baker,et al.  The DRAGON system--An overview , 1975 .

[25]  Georgette Silva,et al.  On automatic speech-understanding systems , 1975 .

[26]  J. G. Wallace,et al.  Cognitive development: An information-processing view , 1976 .

[27]  F. Jelinek,et al.  Continuous speech recognition by statistical methods , 1976, Proceedings of the IEEE.

[28]  V. Gregg Word frequency, recognition and recall. , 1976 .

[29]  R. M. Warren chapter 11 – Auditory Illusions and Perceptual Processes , 1976 .

[30]  Michael David Rychener,et al.  Production systems as a programming language for artificial intelligence applications. , 1976 .

[31]  Henry Gilbert Goldberg,et al.  Segmentation and labeling of speech: a comparative performance evaluation. , 1976 .

[32]  Bruce T. Lowerre,et al.  The HARPY speech recognition system , 1976 .

[33]  R. Shiffrin,et al.  Controlled and automatic human information processing: I , 1977 .

[34]  Walter Schneider,et al.  Controlled and Automatic Human Information Processing: 1. Detection, Search, and Attention. , 1977 .

[35]  Sei-ichi Nakagawa,et al.  A Machine Understanding System for Spoken Japanese Sentences , 1977 .

[36]  R. G. Crowder Principles of learning and memory , 1977 .

[37]  Franklin S. Cooper,et al.  Speech Understanding Systems , 1976, Artificial Intelligence.

[38]  Walter Schneider,et al.  Controlled and automatic human information processing: II. Perceptual learning, automatic attending and a general theory. , 1977 .

[39]  Allen Newell,et al.  An instructable production system: basic design issues , 1977, SGAR.

[40]  J. McDermott,et al.  Production system conflict resolution strategies , 1977, SGAR.

[41]  John P. McDermott,et al.  OPS, A Domain-Independent Production System Language , 1977, IJCAI.

[42]  C. Forgy,et al.  PRODUCTION SYSTEM CONFLICT RESOLUTION STRATEGIES1 , 1978 .

[43]  John R. Anderson,et al.  A Theory of the Acquisition of Cognitive Skills. , 1978 .

[44]  Allen Newell,et al.  AN INSTRUCTABLE PRODUCTION SYSTEM: BASIC DESIGN ISSUES1 , 1978 .

[45]  Steven Michael Rubin,et al.  The argos image understanding system. , 1978 .

[46]  D. M. Green,et al.  Speed-accuracy Trade off in Auditory Detection1 , 2022 .