Abstract Speech is a natural error-correcting code. The speech signal is full of rich sources of contextual redundancy at many levels of representation including allophonic variation, phonotactics, syllable structure, stress domains, morphology, syntax, semantics and pragmatics. The psycholinguistic literature has tended to concentrate heavily on high level constraints such as semantics and pragmatics and has generally overlooked the usefulness of lower level constraints such as allophonic variation. It has even been said that allophonic variation is a source of confusion or a kind of statistical noise that makes speech recognition that much harder than it already is. In contrast, I argue that aspiration, stop release, flapping, palatalization and other cues that vary systematically with syllabic context can be used to parse syllables and stress domains. These constituents can then constrain the lexical matching process, so that much less search will be required in order to retrieve the correct lexical entry. In this way, syllable structure and stress domains will be proposed as an intermediate level of representation between the phonetic description and the lexicon. My argument is primarily a computational one and will include a discussion of a prototype phonetic parser which has been implemented using simple well- understood parsing mechanisms. No experimental results will be presented.
[1]
W. Christie.
Some cues for syllable juncture perception in English.
,
1974,
The Journal of the Acoustical Society of America.
[2]
D. Klatt,et al.
Word verification in a speech understanding system
,
1976
.
[3]
Jay Earley,et al.
An efficient context-free parsing algorithm
,
1970,
Commun. ACM.
[4]
V. Zue,et al.
The role of phonological rules in speech understanding research
,
1975
.
[5]
Iise Lehiste,et al.
Readings in Acoustic Phonetics
,
1968
.
[6]
Osamu Fujimura,et al.
Syllables as concatenative phonetic units
,
1982
.
[7]
John Fitch,et al.
Course notes
,
1975,
SIGS.
[8]
Kenneth Ward Church.
Phrase-structure parsing: a method for taking advantage of allophonic constraints
,
1983
.
[9]
Martin Kay,et al.
The MIND System
,
1970
.
[10]
Noam Chomsky,et al.
The Sound Pattern of English
,
1968
.
[11]
Jr. Allen Richard Smith,et al.
Word hypothesization for large-vocabulary speech understanding systems.
,
1978
.
[12]
D. Fry.
Duration and Intensity as Physical Correlates of Linguistic Stress
,
1954
.
[13]
A. Smith,et al.
Word hypothesization in the hearsay II speech system
,
1976,
ICASSP.