An Approach to syntactic recognition without phonemics

Linguistic and perceptual arguments suggest that, in speech recognition systems, syntactic hypotheses should be formed before phonemic segments are identified. Prosodic features can provide some cues to constituent structure. In a variety of texts and excerpts from conversations, spoken by several talkers, a decrease in voice fundamental frequency (F 0 ) usually occurred at the end of each major syntactic constituent, and an increase in F 0 occurred near the beginning of the following constituent. A computer program based on this regularity correctly detected over 80 percent of all syntactically predicted boundaries. Some boundaries between minor constituents were also detected by the fall-rise patterns in F 0 . False boundary detections resulted from F 0 variations at boundaries between vowels and consonants, but most such false alarms could be eliminated by setting a minimum percent variation in F 0 for a boundary detection. Sentence boundaries were accompanied by large F 0 increases and substantial pauses. The categories of constituents affect boundary detection results, with noun phrase-verbal sequences showing particularly infrequent detection. Prosodic cues to stress patterns and stress-to-syntax rules may be used to detect other aspects of syntactic structure. Syntactic structure hypotheses might then be used to guide phonetic recognition procedures within constituents.

[1]  I. Mattingly Synthesis by Rule of Prosodic Features , 1966 .

[2]  Wayne A. Lea,et al.  Use of Syntactic Segmentation and Stressed Syllable Location in Phonemic Recognition. , 1973 .

[3]  Lee S. Hultzén,et al.  Information Points in Intonation , 1959 .

[4]  N. F. Johnson The psychological reality of phrase-structure rules , 1965 .

[5]  H C Barik,et al.  On Defining Juncture Pauses: A Note On Boomer's "Hesitation and Grammatical Encoding" , 1968, Language and speech.

[6]  D. S. Boomer Hesitation and Grammatical Encoding , 1965, Language and speech.

[7]  T. P. Barnwell,et al.  An algorithm for segment durations in a reading machine context , 1971 .

[8]  M. Lewis,et al.  Infant Speech: A STUDY OF THE BEGINNINGS OF LANGUAGE , 1938 .

[9]  D. Bolinger Contrastive Accent and Contrastive Stress , 1961 .

[10]  C. Baltaxe,et al.  Principles of phonology , 1969 .

[11]  Thomas G. Bever,et al.  The underlying structures of sentences are the primary units of immediate speech processing , 1969 .

[12]  George A. Miller,et al.  Decision units in the perception of speech , 1962, IRE Trans. Inf. Theory.

[13]  Werner F. Leopold,et al.  PATTERNING IN CHILDREN'S LANGUAGE LEARNING , 1953 .

[14]  Kenneth N. Stevens,et al.  Speech recognition: A model and a program for research , 1962, IRE Trans. Inf. Theory.

[15]  Mark F Medress,et al.  Acoustic Correlates of Word Stress , 1972 .

[16]  G. E. Peterson,et al.  Automatic Speech Recognition Procedures , 1961 .

[17]  Jay Earley,et al.  An efficient context-free parsing algorithm , 1970, Commun. ACM.

[18]  Noam Chomsky,et al.  The Sound Pattern of English , 1968 .

[19]  George A. Miller,et al.  Introduction to the Formal Analysis of Natural Languages , 1968 .