Aerodynamics and Lumped-Masses Combined with Delay Lines for Modeling Vertical and Anterior-Posterior Phase Differences in Pathological Vocal Fold Vibration

We discuss the representation of anterior-posterior (A-P) phase differences in vocal cord oscillations through a numerical biomechanical model involving lumped elements as well as distributed elements, i.e., delay lines. A dynamic glottal source model is illustrated in which the fold displacement along the vertical and the longitudinal dimensions is explicitly modeled by numerical waveguide components representing the propagation on the fold cover tissue. In contrast to other models of the same class, in which the reproduction of longitudinal phase differences are intrinsically impossible (e.g., in two-mass models) or not easy to control explicitely (e.g., in 3D 16-mass and multi-mass models in general), the one proposed here provides direct control over the amount of phase delay between folds oscillations at the posterior and anterior side of the glottis, while keeping the dynamic model simple and computationally efficient. The model is assessed by addressing the reproduction of typical oscillatory patterns observed in high-speed videoendoscopic data, in which A-P phase differences are observed. Experimental results are provided which demonstrate the ability of the approach to effectively reproduce different oscillatory patterns of the vocal folds.

[1]  Terri Treman Gerlach,et al.  Phase asymmetries in normophonic speakers: visual judgments and objective findings. , 2008, American journal of speech-language pathology.

[2]  van Rr René Hassel,et al.  Theoretical and experimental study of quasisteady‐flow separation within the glottis during phonation. Application to a modified two‐mass model , 1994 .

[3]  I. Titze The physics of small-amplitude oscillation of the vocal folds. , 1988, The Journal of the Acoustical Society of America.

[4]  Seiji Niimi,et al.  Vocal Fold Vibration and Voice Quality , 1999, Folia Phoniatrica et Logopaedica.

[5]  Philipp Aichinger,et al.  A Database of Laryngeal High-Speed Videos with Simultaneous High-Quality Audio Recordings of Pathological and Non-Pathological Voices , 2016, LREC.

[6]  D. Bless,et al.  Vocal fold vibratory characteristics in normal female speakers from high-speed digital imaging. , 2012, Journal of voice : official journal of the Voice Foundation.

[7]  J M Festen,et al.  Deviant vocal fold vibration as observed during videokymography: the effect on voice quality. , 2001, Journal of voice : official journal of the Voice Foundation.

[8]  Rita R. Patel,et al.  Biomechanical simulation of vocal fold dynamics in adults based on laryngeal high-speed videoendoscopy , 2017, PloS one.

[9]  Shigeru Kiritani,et al.  Simultaneous high-speed digital recording of vocal fold vibration and speech signal , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  J. Flanagan,et al.  Synthesis of voiced sounds from a two-mass model of the vocal cords , 1972 .

[11]  H. Larsson,et al.  Vocal Fold Vibrations: High‐Speed Imaging, Kymography, and Acoustic Analysis: A Preliminary Report , 2000, The Laryngoscope.

[12]  Yuling Yan,et al.  Quantitative analysis of diplophonic vocal fold vibratary pattern from high-speed digital imaging of glottis , 2009, MAVEBA.

[13]  N. Tayama,et al.  Quantification of Vocal Fold Vibration in Various Laryngeal Disorders Using High-Speed Digital Imaging. , 2016, Journal of voice : official journal of the Voice Foundation.

[14]  P. Carding,et al.  Vocim analysis of laryngeal images: is breathiness related to the glottic area? , 1998, Clinical otolaryngology and allied sciences.

[15]  J. Lucero,et al.  Self-entrainment of the right and left vocal fold oscillators. , 2015, The Journal of the Acoustical Society of America.

[16]  Dimitar D Deliyski,et al.  Analysis of longitudinal phase differences in vocal-fold vibration using synchronous high-speed videoendoscopy and electroglottography. , 2012, Journal of voice : official journal of the Voice Foundation.

[17]  S Niimi,et al.  [Relation between voice quality and pathological vibratory patterns using high-speed digital imaging]. , 1999, Nihon Jibiinkoka Gakkai kaiho.

[18]  A. Alwan,et al.  Variability in the relationships among voice quality, harmonic amplitudes, open quotient, and glottal area waveform shape in sustained phonation. , 2012, The Journal of the Acoustical Society of America.

[19]  T. Koizumi,et al.  Two-mass models of the vocal cords for natural sounding voice synthesis. , 1987, The Journal of the Acoustical Society of America.

[20]  Carlos Dias Maciel,et al.  Analysis of nonlinear dynamics of vocal folds using high-speed video observation and biomechanical modeling , 2012, Digit. Signal Process..

[21]  Carlo Drioli A flow waveform-matched low-dimensional glottal model based on physical knowledge. , 2005, The Journal of the Acoustical Society of America.

[22]  N. Tayama,et al.  Relationship of Various Open Quotients With Acoustic Property, Phonation Types, Fundamental Frequency, and Intensity. , 2016, Journal of voice : official journal of the Voice Foundation.

[23]  Abeer Alwan,et al.  Acoustic Correlates of Glottal Gaps , 2011, INTERSPEECH.

[24]  Rita R. Patel,et al.  Differential Vibratory Characteristics of Adductor Spasmodic Dysphonia and Muscle Tension Dysphonia on High-Speed Digital Imaging , 2011, The Annals of otology, rhinology, and laryngology.

[25]  D. Chhetri,et al.  Dynamics of phonatory posturing at phonation onset , 2016, The Laryngoscope.

[26]  D. Bless,et al.  Vocal fold vibratory characteristics of healthy geriatric females--analysis of high-speed digital images. , 2012, Journal of voice : official journal of the Voice Foundation.

[27]  Carlo Drioli,et al.  Accurate glottal model parametrization by integrating audio and high-speed endoscopic video data , 2015, Signal Image Video Process..

[28]  Philipp Aichinger,et al.  Comparison of an audio-based and a video-based approach for detecting diplophonia , 2017, Biomed. Signal Process. Control..

[29]  A. Alwan,et al.  Development of a glottal area index that integrates glottal gap size and open quotient. , 2013, The Journal of the Acoustical Society of America.