Effects of speaking rate on the vowel length distinction in Japanese

Abstract The issue of acoustic invariance was explored with the question of whether an absolute or relative duration value can be found to reliably classify the Japanese phonemic short and long vowels produced across different speaking rates. The stimuli were Japanese disyllabic non-words (Experiment 1) and real words (Experiment 2). Four native speakers produced these words in a carrier sentence at slow, normal, and fast rates. The duration of accented and unaccented short vowels in CVCV contexts and contrasting long vowels in CVVCV and CVCVV contexts, as well as word duration, was measured. Rate changes affected the duration of long vowels more than short vowels in such a way that the duration difference between short and long vowels was greater for slow than normal, and for normal than fast speech. In contrast with the absolute durations, the ratios of long-to-short vowels and three-mora (CVVCV or CVCVV) to two-mora (CVCV) words were less affected by rate changes. Furthermore, the proportion of the vowel to the total word duration was found to distinguish the two vowel length categories across three rates with high accuracy. The results support the view that “relational” acoustic invariance exists that remains stable across speakers and rates.

[1]  Haruo Kubozono The organization of Japanese prosody , 1987 .

[2]  Sheila E. Blumstein,et al.  Effects of Speaking Rate on the Singleton/Geminate Consonant Contrast in Italian , 1999, Phonetica.

[3]  Sheila E. Blumstein,et al.  Effects of speaking rate on the vowel length distinction in Korean , 1991 .

[4]  Douglas H. Whalen,et al.  Vowel and consonant judgments are not independent when cued by the same information , 1989 .

[5]  Mieko S. Han,et al.  Acoustic manifestations of mora timing in Japanese , 1994 .

[6]  R. Port,et al.  Evidence for mora timing in Japanese. , 1987, The Journal of the Acoustical Society of America.

[7]  Yuko Kondo,et al.  Production of schwa by Japanese speakers of English : a crosslinguistic study of coarticulatory strategies , 1995 .

[8]  G. Fant,et al.  Auditory analysis and perception of speech , 1975 .

[9]  Kenneth de Jong,et al.  Comparing stress, lexical focus, and segmental focus: patterns of variation in Arabic vowel duration , 2002, J. Phonetics.

[10]  C. Fowler Segmentation of coarticulated speech in perception , 1984, Perception & psychophysics.

[11]  A. Liberman,et al.  The motor theory of speech perception revised , 1985, Cognition.

[12]  Keisuke Kinoshita,et al.  Duration and F0 as perceptual cues to Japanese vowel quantity , 2002, INTERSPEECH.

[13]  M. L. Katyanee Svastikula,et al.  A PERCEPTUAL AND ACOUSTIC STUDY OF THE EFFECTS OF SPEECH RATE ON DISTINCTIVE VOWEL LENGTH IN THAI , 1986 .

[14]  H. Sussman,et al.  An investigation of locus equations as a source of relational invariance for stop place categorization , 1991 .

[15]  D. Whalen,et al.  Perception of the English /s/–/∫/ distinction relies on fricative noises and transitions, not on brief spectral slices , 1991 .

[16]  J Pind Speech segment durations and quantity in Icelandic. , 1999, The Journal of the Acoustical Society of America.

[17]  J. Pind,et al.  Speaking rate, voice-onset time, and quantity: The search for higher-order invariants for two Icelandic speech cues , 1995, Perception & psychophysics.

[18]  Timothy J. Vance,et al.  An introduction to Japanese phonology , 1987 .

[19]  Haruo Kubozono,et al.  Temporal neutralization in Japanese , 2002 .

[20]  C A Fowler,et al.  Invariants, specifiers, cues: An investigation of locus equations as information for place of articulation , 1994, Perception & psychophysics.

[21]  A. Liberman,et al.  On the relation of speech to language , 2000, Trends in Cognitive Sciences.

[22]  A. Reeves,et al.  Speaking Rate and Segments: A Look at the Relation between Speech Production and Speech Perception for the Voicing Contrast , 1986 .

[23]  Robert F. Port,et al.  Temporal Compensation and Universal Phonetics , 1980 .

[24]  S. Blumstein,et al.  A reconsideration of acoustic invariance for place of articulation in diffuse stop consonants: evidence from a cross-language study. , 1981, The Journal of the Acoustical Society of America.

[25]  R. Port,et al.  Consonant/vowel ratio as a cue for voicing in English , 1982, Perception & psychophysics.

[26]  R. Port,et al.  The influence of speaking tempo on the duration of stressed vowel and medial stop in English trochee words , 1976 .

[27]  John Kingston,et al.  Papers in Laboratory Phonology: Index of names , 1990 .

[28]  V. Boucher Timing relations in speech and the identification of voice-onset times: A stable perceptual boundary for voicing categories across speaking rates , 2002, Perception & psychophysics.

[29]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[30]  Y. Homma,et al.  Durational relationship between Japanese stops and vowels , 1981 .

[31]  H. Fujisaki,et al.  Auditory Perception of Duration of Speech and Non-Speech Stimuli , 1975 .

[32]  C. Fowler An event approach to the study of speech perception from a direct realist perspective , 1986 .

[33]  C A Fowler,et al.  Perceptual parsing of acoustic consequences of velum lowering from information for vowels , 2000, Perception & psychophysics.

[34]  G. E. Peterson,et al.  Duration of Syllable Nuclei in English , 1960 .