Car-talk: Location-specific speech production and perception

Abstract Some locations are probabilistically associated with certain types of speech. Most speech that is encountered in a car, for example, will have Lombard-like characteristics as a result of having been produced in the context of car noise. We examine the hypothesis that the association between cars and Lombard speech will trigger Lombard-like speaking and listening behaviour when a person is physically present in a car, even in the absence of noise. Production and perception tasks were conducted, in noise and in quiet, in both a lab and a parked car. The results show that speech produced in a quiet car resembles speech produced in the context of car noise. Additionally, we find tentative evidence indicating that listeners in a quiet car adjust their vowel boundaries in a manner that suggests that they interpreted the speech as though it were Lombard speech.

[1]  D. Strori Specificity effects in spoken word recognition and the nature of lexical representations in memory , 2016 .

[2]  John J. Dreher,et al.  Effects of ambient noise on speaker intelligibility of words and phrases , 1957, The Laryngoscope.

[3]  A. Samuel,et al.  Perceptual learning evidence for contextually-specific representations , 2011, Cognition.

[4]  Paavo Alku,et al.  Synthesis and perception of breathy, normal, and Lombard speech in the presence of noise , 2014, Comput. Speech Lang..

[5]  J. Hay,et al.  Stuffed toys and speech perception , 2010 .

[6]  V C Tartter,et al.  Some acoustic effects of listening to noise on speech production. , 1993, The Journal of the Acoustical Society of America.

[7]  Véronique Delvaux,et al.  The Influence of Ambient Speech on Adult Speech Productions through Unintentional Imitation , 2007, Phonetica.

[8]  J. Hay,et al.  The Emergence of Sociophonetic Structure , 2015 .

[9]  Keith Johnson,et al.  Gradient and Visual Speaker Normalization in the Perception of Fricatives , 1996, KONVENS.

[10]  Jennifer Hay,et al.  Factors influencing speech perception in the context of a merger-in-progress , 2006, J. Phonetics.

[11]  Jennifer Hay,et al.  Contextual activation of Australia can affect New Zealanders' vowel productions , 2015, J. Phonetics.

[12]  G. E. Peterson,et al.  Duration of Syllable Nuclei in English , 1960 .

[13]  R. H. Bernacki,et al.  Effects of noise on speech production: acoustic and perceptual analyses. , 1988, The Journal of the Acoustical Society of America.

[14]  Martin Cooke,et al.  Spectral and temporal changes to speech produced in the presence of energetic and informational maskers. , 2010, The Journal of the Acoustical Society of America.

[15]  Satterthwaite Fe An approximate distribution of estimates of variance components. , 1946 .

[16]  Matthew H. Davis,et al.  Speech recognition in adverse conditions: A review , 2012 .

[17]  K. Drager Speaker Age and Vowel Perception , 2011, Language and speech.

[18]  Paul Foulkes,et al.  The social life of phonetics and phonology , 2006, J. Phonetics.

[19]  Janet B. Pierrehumbert,et al.  The next toolkit , 2006, J. Phonetics.

[20]  Raymond D. Kent,et al.  Effect of body position on vocal tract acoustics: Acoustic pharyngometry and vowel formants. , 2015, The Journal of the Acoustical Society of America.

[21]  John C. Wells,et al.  Accents of English , 1982 .

[22]  Elizabeth A. Strand,et al.  Auditory–visual integration of talker gender in vowel perception , 1999 .

[23]  Petra Hoedl,et al.  Defying gravity: Formant frequencies of English vowels produced in upright and supine body position , 2015, ICPhS.

[24]  H. Brumm,et al.  The evolution of the Lombard effect: 100 years of psychoacoustic research , 2011 .

[25]  K. Tjaden,et al.  Rate and loudness manipulations in dysarthria: acoustic and perceptual findings. , 2004, Journal of speech, language, and hearing research : JSLHR.

[26]  Michael I. Jordan,et al.  Sensorimotor adaptation of speech I: Compensation and adaptation. , 2002, Journal of speech, language, and hearing research : JSLHR.

[27]  Janet B. Pierrehumbert,et al.  Phonological Representation: Beyond Abstract Versus Episodic , 2016 .

[28]  N. Mulligan Conceptual implicit memory and environmental context , 2011, Consciousness and Cognition.

[29]  Yuan Zhao,et al.  The effect of lexical frequency and Lombard reflex on tone hyperarticulation , 2009, J. Phonetics.

[30]  David J. Ostry,et al.  Effects of Gravitational Load on Jaw Movements in Speech , 1999, The Journal of Neuroscience.

[31]  Austin F. Frank,et al.  Analyzing linguistic data: a practical introduction to statistics using R , 2010 .

[32]  H. Lane,et al.  The Lombard Sign and the Role of Hearing in Speech , 1971 .

[33]  Susanne Brouwer,et al.  Interdependent processing and encoding of speech and concurrent background noise , 2015, Attention, perception & psychophysics.

[34]  D. V. Parikh,et al.  Reducing Automotive Interior Noise with Natural Fiber Nonwoven Floor Covering Systems , 2006 .

[35]  Arthur G. Samuel,et al.  How lexical is the lexicon? Evidence for integrated auditory memory representations , 2014, Cognitive Psychology.

[36]  J. Elman Effects of frequency-shifted feedback on the pitch of vocal productions. , 1981, The Journal of the Acoustical Society of America.

[37]  G. L. Draegert Relationships between voice variables and speech intelligibility in high level noise , 1951 .

[38]  A comparison of formant frequencies for vowels pronounced in the supine and upright positions , 1993, [1993] Proceedings of the Twelfth Southern Biomedical Engineering Conference.

[39]  Martin Cooke,et al.  Speech production modifications produced by competing talkers, babble, and stationary noise. , 2008, The Journal of the Acoustical Society of America.

[40]  J C Junqua,et al.  The Lombard reflex and its role on human listeners and automatic speech recognizers. , 1993, The Journal of the Acoustical Society of America.

[41]  Oliver Jung On the Lombard Effect Induced by Vehicle Interior Driving Noises, Regarding Sound Pressure Level and Long-Term Average Speech Spectrum , 2012 .

[42]  N. Coupland Style: Language Variation and Identity , 2007 .

[43]  D. Bates,et al.  Linear Mixed-Effects Models using 'Eigen' and S4 , 2015 .

[44]  D. Norris,et al.  Shortlist B: a Bayesian model of continuous speech recognition. , 2008, Psychological review.

[45]  A. Baddeley,et al.  Context-dependent memory in two natural environments: on land and underwater. , 1975 .

[46]  Janet B. Pierrehumbert,et al.  Tracking word frequency effects through 130years of sound change , 2015, Cognition.

[47]  Kevin G Munhall,et al.  Adaptive control of vowel formant frequency: evidence from real-time formant manipulation. , 2006, The Journal of the Acoustical Society of America.

[48]  Sarah C. Creel,et al.  Word learning under adverse listening conditions: Context-specific recognition , 2012 .

[49]  Steven M. Smith,et al.  Environmental context-dependent memory: A review and meta-analysis , 2001, Psychonomic bulletin & review.

[50]  E. Golob,et al.  Evidence that the Lombard effect is frequency-specific in humans. , 2013, The Journal of the Acoustical Society of America.

[51]  A. Bronkhorst The cocktail-party problem revisited: early processing and selection of multi-talker speech , 2015, Attention, Perception, & Psychophysics.

[52]  P. Callier,et al.  Voice Quality and Identity , 2015, Annual Review of Applied Linguistics.

[53]  P. Loizou,et al.  The influence of noise on vowel and consonant cues. , 2005, The Journal of the Acoustical Society of America.

[54]  P. Eckert Linguistic variation as social practice , 2000 .

[55]  David B. Pisoni,et al.  Some acoustic-phonetic correlates of speech produced in noise , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.