Voice Onset Time (VOT) at 50: Theoretical and practical issues in measuring voicing distinctions

Just over fifty years ago, Lisker and Abramson proposed a straightforward measure of acoustic differences among stop consonants of different voicing categories, voice onset time (VOT). Since that time, hundreds of studies have used this method. Here, we review the original definition of VOT, propose some extensions to the definition, and discuss some problematic cases. We propose a set of terms for the most important aspects of VOT and a set of Praat labels that could provide some consistency for future cross-study analyses. Although additions of other aspects of realization of voicing distinctions (F0, amplitude, duration of voicelessness) could be considered, they are rejected as adding too much complexity for what has turned out to be one of the most frequently used metrics in phonetics and phonology.

[1]  C. Browman,et al.  Articulatory Phonology: An Overview , 1992, Phonetica.

[2]  Sulaiman S. AlDahri,et al.  Detection of Voice Onset Time (VOT) for unvoiced stop sound in Modern Standard Arabic (MSA) based on power signal , 2016, 2016 IEEE 13th International Conference on Signal Processing (ICSP).

[3]  Pétur Helgason,et al.  Preaspiration in the Nordic languages : synchronic and diachronic aspects , 2002 .

[4]  Katsumasa Shimizu,et al.  Cross-language study of voicing contrasts of stop consonants in Asian languages , 1990 .

[5]  Lisa Davidson Characteristics of stop releases in American English spontaneous speech , 2011, Speech Commun..

[6]  D. Silverman On the rarity of pre-aspirated stops , 2003, Journal of Linguistics.

[7]  Olga Dmitrieva,et al.  Phonological status, not voice onset time, determines the acoustic realization of onset f0 as a secondary voicing cue in Spanish and English , 2015, J. Phonetics.

[8]  Taehong Cho,et al.  Prosodic influences on consonant production in Dutch: Effects of prosodic boundaries, phrasal accent and lexical stress , 2005, J. Phonetics.

[9]  D H Whalen,et al.  FO gives voicing information even with unambiguous voice onset times. , 1993, The Journal of the Acoustical Society of America.

[10]  Sahyang Kim,et al.  Prosodic strengthening on the /s/-stop cluster and the phonetic implementation of an allophonic rule in English , 2014, J. Phonetics.

[11]  S. Nittrouer The role of temporal and dynamic signal components in the perception of syllable-final stop voicing by children and adults. , 2004, The Journal of the Acoustical Society of America.

[12]  A. Liberman,et al.  Some Cues for the Distinction Between Voiced and Voiceless Stops in Initial Position , 1957 .

[13]  D H Whalen,et al.  Gradient Effects of Fundamental Frequency on Stop Consonant Voicing Judgments , 1990, Phonetica.

[14]  Taehong Cho,et al.  Variation and universals in VOT: evidence from 18 languages , 1999 .

[15]  David Silva,et al.  Acoustic evidence for the emergence of tonal contrast in contemporary Korean , 2006, Phonology.

[16]  Arthur S. Abramson,et al.  DISTINCTIVE FEATURES AND LARYNGEAL CONTROL , 1971 .

[17]  B H Repp,et al.  Relative Amplitude of Aspiration Noise as a Voicing Cue for Syllable-Initial Stop Consonants , 1979, Language and speech.

[18]  Morgan Sonderegger,et al.  The private life of stops: VOT in a real-time corpus of spontaneous Glaswegian , 2015 .

[19]  C Henton,et al.  Stops in the World’s Languages , 1992, Phonetica.

[20]  Samuel E. Martin 한국어 음소론(Korean Phonemics) , 1981 .

[21]  Hugo Van hamme,et al.  Automatic voice onset time estimation from reassignment spectra , 2009, Speech Commun..

[22]  L. Lisker,et al.  Some Effects of Context On Voice Onset Time in English Stops , 1967, Language and speech.

[23]  M. Haggard,et al.  Pitch as a voicing cue. , 1970, The Journal of the Acoustical Society of America.

[24]  P. Keating PHONETIC AND PHONOLOGICAL REPRESENTATION OF STOP CONSONANT VOICING , 1984 .

[25]  Yoonjung Kang,et al.  Voice Onset Time merger and development of tonal contrast in Seoul Korean stops: A corpus study , 2014, J. Phonetics.

[26]  Taehong Cho,et al.  Articulatory and acoustic studies on domain-initial strengthening in Korean , 2001, J. Phonetics.

[27]  Early Modern Instrumental Phonetics , 1995 .

[28]  Franklin S. Cooper,et al.  Observing Laryngeal Adjustments during Running Speech by Use of a Fiberoptics System , 1970 .

[29]  Taehong Cho,et al.  Acoustic and aerodynamic correlates of Korean stops and fricatives , 2002, J. Phonetics.

[30]  Lisa Davidson,et al.  Variability in the implementation of voicing in American English obstruents , 2016, J. Phonetics.

[31]  Arthur S. Abramson,et al.  Phonetic Validation of Distinctive Features: A Test Case in French , 1987 .

[32]  F. Cooper,et al.  Transillumination of the larynx in running speech. , 1966, The Journal of the Acoustical Society of America.

[33]  S. Das,et al.  Detection of voice onset time (VOT) for unvoived stops (/p/, /t/, /k/) using the Teager energy operator (TEO) for automatic detection of accented English , 2004, Proceedings of the 6th Nordic Signal Processing Symposium, 2004. NORSIG 2004..

[34]  Chin-Wu Kim A Theory of Aspiration , 1970 .

[35]  P. Ladefoged,et al.  Linking linguistic contrasts to reality: The case of VOT. , 2000 .

[36]  G. Docherty The Timing of Voicing in British English Obstruents , 1992 .

[37]  Guillaume Jacques,et al.  A panchronic study of aspirated fricatives, with new evidence from Pumi , 2011 .

[38]  L. Lisker,et al.  A Cross-Language Study of Voicing in Initial Stops: Acoustical Measurements , 1964 .

[39]  Hyun Bok Lee Illustration of the IPA: Korean , 1993 .

[40]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[41]  Morgan Sonderegger,et al.  Automatic measurement of voice onset time using discriminative structured prediction. , 2012, The Journal of the Acoustical Society of America.

[42]  Francisco Torreira Investigating the nature of aspirated stops in Western Andalusian Spanish , 2012, Journal of the International Phonetic Association.

[43]  Gillian Gallagher,et al.  Acoustic and articulatory features in phonology – the case for [long VOT] , 2011 .

[44]  W. Ewan,et al.  Laryngeal behavior in speech , 1976 .

[45]  Abeer Alwan,et al.  Automatic detection of voice onset time contrasts for use in pronunciation assessment , 2006, INTERSPEECH.

[46]  Edward Flemming,et al.  Auditory Representations in Phonology , 2002 .

[47]  L. Lisker,et al.  Voice Timing in Korean Stops , 1972 .

[48]  Henning Reetz,et al.  Caught in the ACT: The Timing of Aspiration and Voicing in East Bengali , 2007, Language and speech.

[49]  John H. L. Hansen,et al.  Automatic voice onset time detection for unvoiced stops (/p/, /t/, /k/) with application to accent classification , 2010, Speech Commun..

[50]  Terrance M. Nearey,et al.  Effects of Place of Articulation and Vowel Context on VOT Production and Perception for French and English Stops , 1994, Journal of the International Phonetic Association.

[51]  R. Prakash Dixit Inadequacies in phonetic specifications of some latyngeal features: evidence from hindi , 1979 .

[52]  J. Stuart-Smith,et al.  Pre-aspiration and post-aspiration in Scottish Gaelic stop consonants , 2013, Journal of the International Phonetic Association.

[53]  K. Davis,et al.  Phonetic and phonological contrasts in the acquisition of voicing: voice onset time production in Hindi and English , 1995, Journal of Child Language.

[54]  Indranil Dutta,et al.  Four -way stop contrasts in Hindi: An acoustic study of voicing, fundamental frequency and spectral tilt , 2007 .

[55]  J. M. Pickett,et al.  Producing Speech: Contemporary Issues, for Katherine Safford Harris , 1996 .

[56]  Rachel M. Theodore,et al.  Individual talker differences in voice-onset-time: contextual influences. , 2009, The Journal of the Acoustical Society of America.

[57]  A Löfqvist,et al.  The cricothyroid muscle in voicing control. , 1988, The Journal of the Acoustical Society of America.

[58]  Janet B. Pierrehumbert,et al.  Paradigm Uniformity and the Phonetics-Phonology Boundary , 1996 .

[59]  R. Kagaya A fiberscopic and acoustic study of the Korean stops, affricates and fricatives , 1974 .

[60]  Peter Ladefoged,et al.  Phonetic Structures of Scottish Gaelic , 1998, Journal of the International Phonetic Association.

[61]  Chin-Wu Kim On the Autonomy of the Tensity Feature in Stop Classification (with Special Reference to Korean Stops) , 1965 .

[62]  T. H. Crystal,et al.  Segmental durations in connected speech signals , 1981 .

[63]  Hsiao-Chuan Wang,et al.  Automatic estimation of voice onset time for word-initial stops by applying random forest to onset detection. , 2011, The Journal of the Acoustical Society of America.

[64]  T H Crystal,et al.  Segmental durations in connected speech signals: preliminary results. , 1982, The Journal of the Acoustical Society of America.

[65]  Kenneth N. Stevens,et al.  Models for the production and acoustics of stop consonants , 1993, Speech Commun..

[66]  Joan Mascaró,et al.  The Typology of Voicing and Devoicing , 2001 .

[67]  Gillian Gallagher,et al.  Natural classes in cooccurrence constraints , 2015 .

[68]  Arthur S. Abramson,et al.  Thai Final Stops: Cross-Language Perception , 1999, Phonetica.

[69]  C. Browman,et al.  Representation of voicing contrasts using articulatory gestures , 1986 .

[70]  Julie Horrocks,et al.  The contribution of consonantal and vocalic information to the perception of Korean initial stops , 2002, J. Phonetics.

[71]  J. Westbury Enlargement of the supraglottal cavity and its relation to stop consonant voicing. , 1983, The Journal of the Acoustical Society of America.

[72]  D. Klatt Voice onset time, frication, and aspiration in word-initial consonant clusters. , 1975, Journal of speech and hearing research.

[73]  Gary F. Simons 12. Linguistics as a community activity: The paradox of freedom through standards , 2009 .

[74]  清水 克正 A cross-language study of voicing contrasts of stop consonants in Asian languages , 1996 .

[75]  L. Raphael Preceding vowel duration as a cue to the perception of the voicing characteristic of word-final consonants in American English. , 1972, The Journal of the Acoustical Society of America.

[76]  Ryan Bennett Contrast and laryngeal states in Tz ’ utujil , 2010 .

[77]  J. Ohala,et al.  Phonetic Explanations for the Development of Tones , 1979 .

[78]  Linda Shockey,et al.  Sound Patterns of Spoken English , 2003 .

[79]  G. Panconcelli-calzia Die experimentelle Phonetik in ihrer Anwendung auf die Sprachwissenschaft , 1924 .

[80]  Jane A. Baran,et al.  Phonological contrastivity in conversation: a comparative study of voice onset time , 1977 .

[81]  A. Abramson,et al.  Laryngeal Timing in Consonant Distinctions , 1977, Phonetica: International Journal of Phonetic Science.

[82]  Eleanor Chodroff,et al.  Structure in talker-specific phonetic realization: Covariation of stop consonant VOT in American English , 2017, J. Phonetics.

[83]  Jill Beckman,et al.  Empirical evidence for laryngeal features: Aspirating vs. true voice languages1 , 2013, Journal of Linguistics.

[84]  P. Rousselot,et al.  Principes de phonétique expérimentale , 1897 .