In Qualified Defense of VOT

The VOT measure has been said to provide the single most nearly adequate physical basis for separating homorganic stop categories across a variety of languages, granted that other features may also be involved. That transition duration affects perceived voicing of synthesized initial stops of one specific language, English, has suggested the hypothesis by Stevens and Klatt (1974) that a detector responsive to rapid formant-frequency shifts after voice onset better explains the child's acquisition of the contrast than does some mechanism which responds to VOT directly. If such a detector is part of our biological equipment, then it seems remarkably underutilized in language, for the hypothesis asserts that basic to voicing perception is whether laryngcal signal is or is not present during the interval in which the stop-vowel transition occurs. In effect, the “archetypical” voiceless stop is aspirated. Not only do many languages not possess voiceless aspirates, but even in English aspiration is severely restricted. Of course the VOT measure has its limitations - it is inapplicable to prepausal stops. However, there are much more serious difficulties with the posited detector, since even for the English initial stops there is evidence that the presence of a voiced first-formant transition is not required for the perception of /bdg/, nor does the absence of such a transition necessarily yield /ptk/, provided appropriate VOT values are provided.

[1]  L. Lisker,et al.  Letter: Is it VOT or a first-formant transition detector? , 1975, The Journal of the Acoustical Society of America.

[2]  L. Lisker,et al.  Some Effects of Context On Voice Onset Time in English Stops , 1967, Language and speech.

[3]  A M Liberman,et al.  On pushing the voice-onset-time (vot) boundary about. , 1975, Language and speech.

[4]  H. Winitz,et al.  Variations in VOT for English initial stops , 1975 .

[5]  M. Haggard,et al.  Pitch as a voicing cue. , 1970, The Journal of the Acoustical Society of America.

[6]  L. Lisker,et al.  A Cross-Language Study of Voicing in Initial Stops: Acoustical Measurements , 1964 .

[7]  G. E. Peterson,et al.  Duration of Syllable Nuclei in English , 1960 .

[8]  M. Haggard,et al.  Perceptual processing of multiple cues and contexts: effects of following vowel upon stop consonant voicing , 1974 .

[9]  A. Liberman,et al.  Some Cues for the Distinction Between Voiced and Voiceless Stops in Initial Position , 1957 .

[10]  G. E. Peterson,et al.  Transitions, Glides, and Diphthongs , 1961 .

[11]  Q. Summerfield,et al.  On the dissociation of spectral and temporal cues to the voicing distinction in initial stop consonants. , 1977, The Journal of the Acoustical Society of America.

[12]  K. Stevens,et al.  Role of formant transitions in the voiced-voiceless distinction for stops. , 1974, The Journal of the Acoustical Society of America.

[13]  A. Liberman,et al.  Some Cues for the Distinction Between Voiced and Voiceless Stops in Initial Position , 1958 .

[14]  A. Abramson,et al.  Laryngeal Timing in Consonant Distinctions , 1977, Phonetica: International Journal of Phonetic Science.

[15]  Alvin M. Liberman,et al.  On Pushing the Voice-Onset-Time (Vot) Boundary About , 1977, Language and speech.