Characterization of laryngealization as irregular vocal fold vibration and interaction with prosodic prominence

This paper introduces an original variant of recurrence analysis to quantify the degree of regularity of vocal fold vibration as captured by electroglottography during phonation. The proposed technique is applied to the analysis of laryngealized phonation as this phonation type typically shows irregular vibration cycles. The reliability of this approach is validated with synthetic vocal fold vibration signals, demonstrating that it permits measuring the regularity of vocal fold vibration, unaffected by changes in fundamental frequency. The method is also applied to real electroglottographic signals recorded at the onset of vowel-initial nonsense words produced in a speeded repetition task by five female German speakers. Results show that the degree of laryngealization during the production of word-initial vowels is modulated by the presence of stress (with stressed vowels being less laryngealized). Due to its robustness to changes of F0, the proposed technique proves to be a suitable tool for studying vocal fold regularity in concatenated speech. Its applications are not limited to the study of glottalization, since the degree of regularity of vocal fold vibration has paralinguistic functions and is a clinically relevant measure of voice pathologies.

[1]  Sven Grawunder,et al.  Tongue-larynx interactions in the production of word initial laryngealization over different prosodic contexts: A repeated speech experiment , 2014 .

[2]  Norbert Marwan,et al.  Selection of recurrence threshold for signal detection , 2008 .

[3]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[4]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[5]  M. Rothenberg A new inverse-filtering technique for deriving the glottal air flow waveform during voicing. , 1970, The Journal of the Acoustical Society of America.

[6]  Anders Löfqvist,et al.  Vocal tract aerodynamics in /aCa/ utterances: Measurements , 1995, Speech Commun..

[7]  E. Bradley,et al.  Recurrence plots of experimental data: To embed or not to embed? , 1998, Chaos.

[8]  Marc Garellek,et al.  Voice quality strengthening and glottalization , 2014, J. Phonetics.

[9]  Matthew Gordon,et al.  The Phonetics and Phonology of Non-modal Vowels: A Cross-Linguistic Perspective , 1998 .

[10]  Janet Slifka,et al.  Towards models of phonation , 2001, J. Phonetics.

[11]  Christine Mooshammer,et al.  Acoustic and laryngographic measures of the laryngeal reflexes of linguistic prominence and vocal effort in German. , 2010, The Journal of the Acoustical Society of America.

[12]  Peter Ladefoged,et al.  Phonation types: a cross-linguistic overview , 2001, J. Phonetics.

[13]  F. Takens Detecting strange attractors in turbulence , 1981 .

[14]  H. Kantz,et al.  Recurrence plot analysis of nonstationary data: the understanding of curved patterns. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  Stefanie Shattuck-Hufnagel,et al.  Variation in the realization of glottalization in normal speakers , 2001, J. Phonetics.

[16]  Douglas M. Bates,et al.  Nonlinear Mixed-Effects Models: Basic Concepts and Motivating Examples , 2000 .

[17]  Hong-Shik Choi,et al.  Voice analysis of patients with diverse types of Reinke's edema and clinical use of electroglottographic measurements , 2006, Acta oto-laryngologica.

[18]  J. Laver The phonetic description of voice quality , 1980 .

[19]  David A. Berry,et al.  Mechanisms of modal and nonmodal phonation , 2001, J. Phonetics.

[20]  Scott Moisik,et al.  The 'Whole Larynx' Approach to Laryngeal Features , 2011, ICPhS.

[21]  Ingo R. Titze,et al.  Principles of voice production , 1994 .

[22]  Abeer Alwan,et al.  An improved correction formula for the estimation of harmonic magnitudes and its application to open quotient estimation , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[23]  Jeffrey S. Morris,et al.  Wavelet‐based functional mixed models , 2006, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[24]  Taehong Cho,et al.  Acoustic and aerodynamic correlates of Korean stops and fricatives , 2002, J. Phonetics.

[25]  H. Abarbanel,et al.  Determining embedding dimension for phase-space reconstruction using a geometrical construction. , 1992, Physical review. A, Atomic, molecular, and optical physics.

[26]  C L Webber,et al.  Dynamical assessment of physiological systems and states using recurrence plot strategies. , 1994, Journal of applied physiology.

[27]  Bryan Gick,et al.  The quantal larynx revisited , 2013 .

[28]  Jorge C Lucero,et al.  Comparison of measures of variability of speech movement trajectories using synthetic records. , 2005, Journal of speech, language, and hearing research : JSLHR.

[29]  Norbert Marwan,et al.  Line structures in recurrence plots , 2005 .

[30]  Susanne Fuchs,et al.  Application of concepts from cross-recurrence analysis in speech production: an overview and comparison with other nonlinear methods. , 2014, Journal of speech, language, and hearing research : JSLHR.

[31]  M P Karnell,et al.  Laryngeal perturbation analysis: minimum length of analysis window. , 1991, Journal of speech and hearing research.

[32]  Robert Sedgewick,et al.  Algorithms in C , 1990 .

[33]  J. Esling,et al.  Modeling the biomechanical influence of epilaryngeal stricture on the vocal folds: a low-dimensional model of vocal-ventricular fold coupling. , 2014, Journal of speech, language, and hearing research : JSLHR.

[34]  J A Kelso,et al.  The production and perception of syllable structure. , 1991, Journal of speech and hearing research.

[35]  M Rothenberg,et al.  Measurement of airflow in speech. , 1977, Journal of speech and hearing research.

[36]  Marzena Zygis,et al.  Glottal Marking of Vowel-initial Words in German , 2011, ICPhS.

[37]  Caroline G. Henton,et al.  Creak as a sociophonetic marker , 1986 .

[38]  Leonardo Lancia,et al.  Measuring laryngealization in running speech: interaction with contrastive tones in yalálag zapotec , 2013, INTERSPEECH.

[39]  T. Y. Kong,et al.  Topological Algorithms for Digital Image Processing , 1996 .

[40]  P. Ladefoged,et al.  The sounds of the world's languages , 1996 .

[41]  Max A. Little,et al.  Exploiting Nonlinear Recurrence and Fractal Scaling Properties for Voice Disorder Detection , 2007, Biomedical engineering online.

[42]  Leonardo Lancia,et al.  Application of Concepts From Cross-Recurrence Analysis in Speech Production: An Overview and Comparison With Other Nonlinear Methods , 2013 .

[43]  M. Beckman,et al.  The Interplay Between Prosodic Structure and Coarticulation , 1993, Language and speech.

[44]  M. Rothenberg,et al.  Monitoring vocal fold abduction through vocal fold contact area. , 1988, Journal of speech and hearing research.

[45]  Julie Roberts AS OLD BECOMES NEW: GLOTTALIZATION IN VERMONT , 2006 .

[46]  Janet Pierrehumbert,et al.  Gesture, Segment, Prosody: Lenition of |h| and glottal stop , 1992 .

[47]  Kyoko Nagao,et al.  Phase transitions in a repetitive speech task as gestural recomposition , 2001 .

[48]  Yi Xu,et al.  Maximum speed of pitch change and how it may relate to speech. , 2002, The Journal of the Acoustical Society of America.

[49]  Janet Slifka,et al.  Some physiological correlates to regular and irregular phonation at the end of an utterance. , 2006, Journal of voice : official journal of the Voice Foundation.

[50]  Jody Kreiman,et al.  Toward a taxonomy of nonmodal phonation , 2001, J. Phonetics.

[51]  R. Colton,et al.  Problems and pitfalls of electroglottography , 1990 .

[52]  R C Scherer,et al.  Required number of tokens to determine representative voice perturbation values. , 1995, Journal of speech and hearing research.

[53]  Peter Ladefoged,et al.  The Features of the Larynx. , 1973 .

[54]  Norbert Marwan,et al.  How to Avoid Potential Pitfalls in Recurrence Plot Based Data Analysis , 2010, Int. J. Bifurc. Chaos.

[55]  Ailbhe Ní Chasaide,et al.  The role of voice quality in communicating emotion, mood and attitude , 2003, Speech Commun..

[56]  Thierry Legou,et al.  Acoustic correlates of masculinity in voices of Lebanese women with Reinke's edema , 2014 .

[57]  J C Lucero,et al.  Time normalization of voice signals using functional data analysis. , 2000, The Journal of the Acoustical Society of America.

[58]  Daniel Silverman,et al.  Laryngeal complexity in Otomanguean vowels , 1997, Phonology.

[59]  M. Thiel,et al.  Cross recurrence plot based synchronization of time series , 2002, physics/0201062.

[60]  John H. Esling,et al.  The valves of the throat and their functioning in tone, vocal register and stress: laryngoscopic case studies , 2006, Phonology.

[61]  Klaus J. Kohler,et al.  Glottal Stops and Glottalization in German , 1994 .

[62]  Jürgen Kurths,et al.  Recurrence plots for the analysis of complex systems , 2009 .

[63]  U Eysholdt,et al.  Spatio-temporal analysis of irregular vocal fold oscillations: biphonation due to desynchronization of spatial modes. , 2001, The Journal of the Acoustical Society of America.

[64]  Barbara Blankenship,et al.  The timing of nonmodal phonation in vowels , 2002, J. Phonetics.

[65]  Cécile Fougeron,et al.  Articulatory properties of initial segments in several prosodic constituents in French , 2001, J. Phonetics.

[66]  Boston Uni,et al.  Glottalization of word-initial vowels as a function of prosodic structure , 1996 .

[67]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..