In defense of lab speech

Abstract Lab speech has often been described as unnatural, overly clear, over planned, monotonous, lacking of rich prosody, and devoid of communicative functions, interactions and emotions. Along with this view is a growing popularity for directly examining spontaneous speech for the sake of understanding spontaneous speech, especially in regard to its prosody. In this paper I argue that few of the stereotyped characteristics associated with lab speech are warranted. Instead, the quality of lab speech is a design issue rather than a matter of fundamental limitation. More importantly, because it allows systematic experimental control, lab speech is indispensable in our quest to understand the underlying mechanisms of human language. In contrast, although spontaneous speech is rich in various patterns, and so is useful for many purposes, the difficulty in recognizing and controlling the contributing factors makes it less likely than lab speech to lead to true insights about the nature of human speech.

[1]  Yi Xu,et al.  Fundamental Frequency Peak Delay in Mandarin , 2000, Phonetica.

[2]  Daniel Hirst,et al.  Form and function in the representation of speech prosody , 2005, Speech Commun..

[3]  J L Miller,et al.  Internal Structure of Phonetic Categories: Effects of Speaking Rate , 1997, Phonetica.

[4]  Esther Janse,et al.  Word perception in fast speech: artificially time-compressed vs. naturally produced fast speech , 2004, Speech Commun..

[5]  Klaus J. Kohler,et al.  Timing and Communicative Functions of Pitch Contours , 2005, Phonetica.

[6]  T. Gay Effect of speaking rate on diphthong formant movements. , 1968, The Journal of the Acoustical Society of America.

[7]  Bei Wang,et al.  Prosodic encoding of topic and focus in Mandarin , 2006 .

[8]  Yi Xu,et al.  Question intonation as affected by word stress and focus in English , 2007 .

[9]  Y Xu,et al.  Production and perception of coarticulated tones. , 1994, The Journal of the Acoustical Society of America.

[10]  C. W. Wightman ToBI Or Not ToBI ? , 2002 .

[11]  D. Whalen Coarticulation is largely planned , 1990 .

[12]  P. Iverson,et al.  Plasticity in vowel perception and production: a study of accent change in young adults. , 2007, The Journal of the Acoustical Society of America.

[13]  A. Cohen,et al.  Structure and Process in Speech Perception , 1975 .

[14]  Mari Ostendorf,et al.  TOBI: a standard for labeling English prosody , 1992, ICSLP.

[15]  Francisco Torreira,et al.  The segmental anchoring hypothesis revisited: Syllable structure and speech rate effects on peak timing in Spanish , 2007, J. Phonetics.

[16]  Mahzarin R. Banaji,et al.  Some everyday thoughts on ecologically valid methods. , 1991 .

[17]  P. Iverson,et al.  Vowel normalization for accent: An investigation of perceptual plasticity in young adults , 2004 .

[18]  J. Pierrehumbert The phonology and phonetics of English intonation , 1987 .

[19]  Santitham Prom-on,et al.  Modeling tone and intonation in Mandarin and English as a process of target approximation. , 2009, The Journal of the Acoustical Society of America.

[20]  Mahzarin R. Banaji,et al.  The Bankruptcy of Everyday Memory , 1989 .

[21]  Yi Xu,et al.  Organizing syllables into groups - Evidence from F0 and duration patterns in Mandarin , 2009, J. Phonetics.

[22]  Yi Xu,et al.  The phonetics and phonology of apparent cases of iterative tonal change in Standard Chinese , 2007 .

[23]  B. Lindblom,et al.  Interaction between duration, context, and speaking style in English stressed vowels , 1994 .

[24]  Yi Xu,et al.  Speech melody as articulatorily implemented communicative functions , 2005, Speech Commun..

[25]  Yoshinori Sagisaka,et al.  Computing Prosody, Computational Models for Processing Spontaneous Speech , 2011 .

[26]  Jørgen Rischel,et al.  Formal linguistics and real speech , 1992, Speech Commun..

[27]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[28]  Bernd Möbius,et al.  Rare Events and Closed Domains: Two Delicate Concepts in Speech Synthesis , 2003, Int. J. Speech Technol..

[29]  Mary E. Beckman,et al.  The Parsing of Prosody , 1996 .

[30]  Yi Xu Speech prosody as articulated communicative functions , 2006 .

[31]  T. Crystal,et al.  Articulation rate and the duration of syllables and stress groups in connected speech. , 1990, The Journal of the Acoustical Society of America.

[32]  J Caspers,et al.  Effects of Time Pressure on the Phonetic Realization of the Dutch Accent-Lending Pitch Rise and Fall , 1993, Phonetica.

[33]  Gösta Bruce,et al.  On the analysis of prosody in spontaneous speech with exemplification from Swedish and French , 1992, Speech Commun..

[34]  Yi Xu,et al.  Phonetic realization of focus in English declarative intonation , 2005, J. Phonetics.

[35]  I. Lehiste The Phonetic Structure of Paragraphs , 1975 .

[36]  Caroline Féry,et al.  Pitch accent scaling on given, new and focused constituents in German , 2008, J. Phonetics.

[37]  J. Perkell,et al.  Economy of effort in different speaking conditions. I. A preliminary study of intersubject differences and modeling issues. , 2002, The Journal of the Acoustical Society of America.

[38]  W. Cooper,et al.  Acoustical aspects of contrastive stress in question-answer contexts. , 1985, The Journal of the Acoustical Society of America.

[39]  J. C. Krause,et al.  Acoustic properties of naturally produced clear speech at normal speaking rates. , 1996, The Journal of the Acoustical Society of America.

[40]  Yi Xu,et al.  Extreme reductions: contraction of disyllables into monosyllables in taiwan Mandarin , 2009, INTERSPEECH.

[41]  M. Picheny,et al.  Speaking clearly for the hard of hearing. II: Acoustic characteristics of clear and conversational speech. , 1986, Journal of speech and hearing research.

[42]  Klaus R. Scherer,et al.  Vocal communication of emotion: A review of research paradigms , 2003, Speech Commun..

[43]  Yukari Hirata,et al.  Effects of speaking rate on the vowel length distinction in Japanese , 2004, J. Phonetics.

[44]  Esther Janse,et al.  Perceptual learning of time-compressed and natural fast speech. , 2009, The Journal of the Acoustical Society of America.

[45]  Gary Weismer,et al.  Effects of speaking rate on second formant trajectories of selected vocalic nuclei. , 2003, The Journal of the Acoustical Society of America.

[46]  D. Kewley-Port,et al.  Vowel intelligibility in clear and conversational speech for normal-hearing and hearing-impaired listeners. , 2002, The Journal of the Acoustical Society of America.

[47]  M. Pitermann Effect of speaking rate and contrastive stress on formant dynamics and vowel perception. , 2000, The Journal of the Acoustical Society of America.

[48]  Mattias Heldner,et al.  Exploring Prosody in Interaction Control , 2005, Phonetica.

[49]  Raymond D. Kent,et al.  Speaking rate and speech movement velocity profiles. , 1993, Journal of speech and hearing research.

[50]  Hugo Quené,et al.  On Speech and Language: Studies for Sieb G. Nooteboom , 2004 .

[51]  Fang Liu,et al.  Parallel Encoding of Focus and Interrogative Meaning in Mandarin Intonation , 2005, Phonetica.

[52]  W J Barry Prosodic Functions Revisited Again! , 1981, Phonetica.

[53]  Clinical Research in Communicative Disorders: Principles and Strategies , 1988 .

[54]  Y Xu,et al.  Consistency of Tone-Syllable Alignment across Different Syllable Structures and Speaking Rates , 1998, Phonetica.

[55]  Ralf W. Schlosser,et al.  The Efficacy of Augmentative and Alternative Communication: Toward Evidence-Based Practice , 2003 .

[56]  Nancy Hedberg,et al.  Meanings and Configurations of Questions in English , 2004 .

[57]  J. V. Santen Exploring N -way tables with sums-of-products models , 1993 .

[58]  Nakarin Satthamnuwong,et al.  Effects of Speaking Rate on Thai Tones , 1999, Phonetica.

[59]  Yi Xu Timing and coordination in tone and intonation — An articulatory-functional perspective , 2007 .

[60]  C. Gussenhoven The phonology of tone and intonation , 2004 .

[61]  Stefanie Shattuck-Hufnagel,et al.  Word-boundary-related duration patterns in English , 2000, J. Phonetics.

[62]  Vincent J. van Heuven,et al.  Planning in speech melody: production and perception of downstep in Dutch , 2004 .

[63]  Yu-ching Kuo,et al.  The phonetics and phonology of apparent cases of iterative tonal change in Standard Chinese , 2005 .

[64]  Mary E. Beckman,et al.  A Typology of Spontaneous Speech , 1997, Computing Prosody.

[65]  T. Gay Effect of speaking rate on vowel formant movements. , 1978, The Journal of the Acoustical Society of America.

[66]  Yi Xu,et al.  Maximum speed of pitch change and how it may relate to speech. , 2002, The Journal of the Acoustical Society of America.

[67]  Stefanie Shattuck-Hufnagel,et al.  A prosody tutorial for investigators of auditory sentence processing , 1996, Journal of psycholinguistic research.

[68]  Harlan Lane,et al.  Speaking rate , voice-onset time , and quantity : The search for higher-order invariants for two Icelandic speech cues , 2002 .

[69]  A J Schafer,et al.  Intonational Disambiguation in Sentence Production and Comprehension , 2000, Journal of psycholinguistic research.

[70]  W. Cooper,et al.  Speech intonation and focus location in matched statements and questions. , 1986, The Journal of the Acoustical Society of America.

[71]  Richard Wright,et al.  A new method for eliciting three speaking styles in the laboratory , 2008, Speech Commun..

[72]  Yi Xu How often is maximum speed of articulation approached in speech , 2007 .

[73]  Jean-Pierre Gagné,et al.  Auditory, visual and audiovisual clear speech , 2002, Speech Commun..

[74]  James F. Allen,et al.  A Study on Prosody and Discourse Structure in Cooperative Dialogues , 1993 .

[75]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .

[76]  W R Tiffany,et al.  The effects of syllable structure on diadochokinetic and reading rates. , 1980, Journal of speech and hearing research.

[77]  Yi Xu,et al.  Effects of tone and focus on the formation and alignment of f0contours , 1999 .

[78]  Yi Xu,et al.  On the Temporal Domain of Focus , 2004 .