Hyperarticulation in Lombard speech: Global coordination of the jaw, lips and the tongue.

Over the last century, researchers have collected a considerable amount of data reflecting the properties of Lombard speech, i.e., speech in a noisy environment. The documented phenomena predominately report effects on the speech signal produced in ambient noise. In comparison, relatively little is known about the underlying articulatory patterns of Lombard speech, in particular for lingual articulation. Here the authors present an analysis of articulatory recordings of speech material in babble noise of different intensity levels and in hypoarticulated speech and report quantitative differences in relative expansion of movement of different articulatory subsystems (the jaw, the lips and the tongue) as well as in relative expansion of utterance duration. The trajectory modifications for one articulator can be relatively reliably predicted by those for another one, but subsystems differ in a degree of continuity in trajectory expansion elicited across different noise levels. Regression analysis of articulatory modifications against durational expansion shows further qualitative differences between the subsystems, namely, the jaw and the tongue. The findings are discussed in terms of possible influences of a combination of prosodic, segmental, and physiological factors. In addition, the Lombard effect is put forward as a viable methodology for eliciting global articulatory variation in a controlled manner.

[1]  T. Hothorn,et al.  Simultaneous Inference in General Parametric Models , 2008, Biometrical journal. Biometrische Zeitschrift.

[2]  Maeva Garnier,et al.  May speech modifications in noise contribute to enhance audio-visible cues to segment perception? , 2008, AVSP.

[3]  Shinichi Nakagawa,et al.  A general and simple method for obtaining R2 from generalized linear mixed‐effects models , 2013 .

[4]  J C Junqua,et al.  The Lombard reflex and its role on human listeners and automatic speech recognizers. , 1993, The Journal of the Acoustical Society of America.

[5]  B. Lindblom,et al.  Acoustical consequences of lip, tongue, jaw, and larynx movement. , 1970, The Journal of the Acoustical Society of America.

[6]  Jeesun Kim,et al.  Lombard speech: Auditory (A), Visual (V) and AV effects , 2006 .

[7]  D. Dubois,et al.  Influence of sound immersion and communicative interaction on the Lombard effect. , 2010, Journal of speech, language, and hearing research : JSLHR.

[8]  Jeesun Kim,et al.  A visual concomitant of the Lombard reflex , 2005, AVSP.

[9]  Martin Cooke,et al.  Speech production modifications produced by competing talkers, babble, and stationary noise. , 2008, The Journal of the Acoustical Society of America.

[10]  Anja Geumann,et al.  Invariance and variability in articulation and acoustics of natural perturbed speech , 2001 .

[11]  John H. L. Hansen,et al.  Analysis and compensation of stressed and noisy speech with application to robust automatic recognition , 1988 .

[12]  K. D. Jong The supraglottal articulation of prominence in English: Linguistic stress as localized hyperarticulation , 1995 .

[13]  Lucie Ménard,et al.  Effect of being seen on the production of visible speech cues. A pilot study on Lombard speech , 2012, INTERSPEECH.

[14]  D. Bates,et al.  fitting linear mixed effects models using lme 4 arxiv , 2014 .

[15]  Mary J Lindstrom,et al.  Tongues and lips without jaws: a comparison of methods for decoupling speech movements. , 2002, Journal of speech, language, and hearing research : JSLHR.

[16]  Z S Bond,et al.  Acoustic-phonetic characteristics of speech produced in noise and while wearing an oxygen mask. , 1989, The Journal of the Acoustical Society of America.

[17]  J. Abbs,et al.  Labial-Mandibular Coordination in the Production of Speech: Implications for the Operation of Motor Equivalence , 1976, Phonetica.

[18]  Bharath Chandrasekaran,et al.  Effects of increasing sound pressure level on lip and jaw movement parameters and consistency in young adults. , 2006, Journal of speech, language, and hearing research : JSLHR.

[19]  Milos Cernak,et al.  Effective Triphone Mapping for Acoustic Modeling in Speech Recognition , 2011, INTERSPEECH.

[20]  R. H. Bernacki,et al.  Effects of noise on speech production: acoustic and perceptual analyses. , 1988, The Journal of the Acoustical Society of America.

[21]  C. Kroos,et al.  Are there compensatory effects in natural speech , 1999 .

[22]  Kristin L. Greilick,et al.  Acoustic and articulatory features of diphthong production: a speech clarity study. , 2010, Journal of speech, language, and hearing research : JSLHR.

[23]  M. Beckman,et al.  Manner and place conflicts in the articulation of accent in Australian English , 2001 .

[24]  Pascal van Lieshout,et al.  A comparison of methods for decoupling tongue and lower lip from jaw movements in 3D articulography. , 2013 .

[25]  Michael I. Jordan,et al.  Trading relations between tongue-body raising and lip rounding in production of the vowel /u/: a pilot "motor equivalence" study. , 1993, The Journal of the Acoustical Society of America.

[26]  Björn Lindblom,et al.  Explaining Phonetic Variation: A Sketch of the H&H Theory , 1990 .

[27]  Herman J. M. Steeneken,et al.  Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..

[28]  Paavo Alku,et al.  Effect of noise type and level on focus related fundamental frequency changes , 2012, INTERSPEECH.

[29]  John H. L. Hansen,et al.  Analysis and Compensation of Lombard Speech Across Noise Type and Levels With Application to In-Set/Out-of-Set Speaker Recognition , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[30]  Martin Cooke,et al.  The contribution of changes in F0 and spectral tilt to increased intelligibility of speech produced in noise , 2009, Speech Commun..

[31]  Jean-Claude Junqua,et al.  The influence of acoustics on speech production: A noise-induced stress phenomenon known as the Lombard reflex , 1996, Speech Commun..

[32]  John H. L. Hansen,et al.  Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition , 1996, Speech Commun..

[33]  Pascal van Lieshout,et al.  A comparison of methods for decoupling tongue and lower lip from jaw movements in 3D articulography. , 2013, Journal of speech, language, and hearing research : JSLHR.

[34]  Phil Hoole,et al.  Five-dimensional articulography , 2009 .

[35]  John H. L. Hansen,et al.  Classification of speech under stress using target driven features , 1996, Speech Commun..

[36]  R. Schulman,et al.  Articulatory dynamics of loud and normal speech. , 1989, The Journal of the Acoustical Society of America.

[37]  S. Tasko,et al.  Variations in articulatory movement with changes in speech task. , 2004, Journal of speech, language, and hearing research : JSLHR.

[38]  Marion Dohen,et al.  An acoustic and articulatory study of Lombard speech: global effects on the utterance , 2006, INTERSPEECH.

[39]  Martti Vainio,et al.  Hyperarticulation in Lombard speech: A preliminary study , 2014 .