Acoustic-Phonetic Knowledge Representation: Implications from Spectrogram Reading Experiments

This paper presents a summary of several spectrogram reading experiments designed mainly to uncover the amount of phonetic information that is contained in the speech signal. The task involved identifying the phonetic contents of an utterance only from a visual examination of the spectrogram. The results generally support the notion that there is a great deal of phonetic information in the speech signal that can be extracted by the proper application of phonetic rules. From these results, it is argued that phonetic recognition in speech recognition systems can be improved substantially, and that improved phonetic recognition will lead to speech recognition systems of greatly increased complexity and sophistication.

[1]  D. Klatt Linguistic uses of segmental duration in English: acoustic and perceptual evidence. , 1976, The Journal of the Acoustical Society of America.

[2]  Allen Newell,et al.  Speech understanding systems : Final report of a study group , 1973 .

[3]  V. Zue,et al.  Acoustic study of medial /t,d/ in American English , 1979 .

[4]  I. Kameny,et al.  Comparison of the formant spaces of retroflexed and nonretroflexed vowels , 1975 .

[5]  D. J. Foss,et al.  On the Role of Sentence Stress in Sentence Processing , 1977, Language and speech.

[6]  Fant Cg Descriptive analysis of the acoustic aspects of speech. , 1962 .

[7]  Victor W. Zue,et al.  Acoustic Characteristics of Stop Consonants: A Controlled Study , 1976 .

[8]  N. Umeda Consonant duration in American English , 1977 .

[9]  A M Liberman,et al.  Why are speech spectrograms hard to read? , 1968, American annals of the deaf.

[10]  N. Umeda Vowel duration in American English. , 1975, The Journal of the Acoustical Society of America.

[11]  W. Koenig,et al.  The Sound Spectrograph , 1946 .

[12]  E. E. David,et al.  Human communication : a unified view , 1972 .

[13]  D. Klatt,et al.  On the automatic recognition of continuous speech:Implications from a spectrogram-reading experiment , 1973 .

[14]  Wayne A. Lea,et al.  Trends in Speech Recognition , 1980 .

[15]  Stefanie Shattuck-Hufnagel,et al.  Palatalization of /s/ in American English: when is a /š/ not a /š/? , 1980 .

[16]  D. R. Reddy,et al.  Speech recognition : invited papers presented at the 1974 IEEE symposium , 1975 .

[17]  V. Zue,et al.  The role of phonological rules in speech understanding research , 1975 .

[18]  S. Blumstein,et al.  Acoustic invariance in speech production: evidence from measurements of the spectral characteristics of stop consonants. , 1979, The Journal of the Acoustical Society of America.

[19]  N. Kiang Processing of speech by the auditory nervous system. , 1980, The Journal of the Acoustical Society of America.

[20]  D. Klatt Voice onset time, frication, and aspiration in word-initial consonant clusters. , 1975, Journal of speech and hearing research.

[21]  B. Lindblom,et al.  Interaction between segmental and nonsegmental factors in speech recognition , 1973 .