Pathological Voice Source Analysis System Using a Flow Waveform-Matched Biomechanical Model

Voice production occurs through vocal cord and vibration coupled to glottal airflow. Vocal cord lesions affect the vocal system and lead to voice disorders. In this paper, a pathological voice source analysis system is designed. This study integrates nonlinear dynamics with an optimized asymmetric two-mass model to explore nonlinear characteristics of vocal cord vibration, and changes in acoustic parameters, such as fundamental frequency, caused by distinct subglottal pressure and varying degrees of vocal cord paralysis are analyzed. Various samples of sustained vowel /a/ of normal and pathological voices were extracted from MEEI (Massachusetts Eye and Ear Infirmary) database. A fitting procedure combining genetic particle swarm optimization and a quasi-Newton method was developed to optimize the biomechanical model parameters and match the targeted voice source. Experimental results validate the applicability of the proposed model to reproduce vocal cord vibration with high accuracy, and show that paralyzed vocal cord increases the model coupling stiffness.

[1]  Jack J Jiang,et al.  Chaos in voice, from modeling to measurement. , 2006, Journal of voice : official journal of the Voice Foundation.

[2]  Gaëtan Kerschen,et al.  Vibrational dynamics of vocal folds using nonlinear normal modes. , 2013, Medical engineering & physics.

[3]  Heinz Ulrich Hoppe,et al.  Vibration parameter extraction from endoscopic image series of the vocal folds , 2002, IEEE Transactions on Biomedical Engineering.

[4]  Julio C. Ho,et al.  Observation and analysis of in vivo vocal fold tissue instabilities produced by nonlinear source-filter coupling: a case study. , 2011, The Journal of the Acoustical Society of America.

[5]  R. Mittal,et al.  Computational modeling of phonatory dynamics in a tubular three-dimensional model of the human larynx. , 2012, The Journal of the Acoustical Society of America.

[6]  Carlos Dias Maciel,et al.  Analysis of nonlinear dynamics of vocal folds using high-speed video observation and biomechanical modeling , 2012, Digit. Signal Process..

[7]  Jack J Jiang,et al.  Chaotic vibrations of a vocal fold model with a unilateral polyp. , 2004, The Journal of the Acoustical Society of America.

[8]  Yu Zhang,et al.  Nonlinear detection of disordered voice productions from short time series based on a Volterra–Wiener–Korenberg model , 2011 .

[9]  Christian Soize,et al.  Voice Signals Produced With Jitter Through a Stochastic One-mass Mechanical Model. , 2017, Journal of voice : official journal of the Voice Foundation.

[10]  Abeer Alwan,et al.  A study of acoustic-to-articulatory inversion of speech by analysis-by-synthesis using chain matrices and the Maeda articulatory model. , 2011, The Journal of the Acoustical Society of America.

[11]  M. Döllinger,et al.  Vocal fold vibration amplitude, open quotient, speed quotient and their variability along glottal length: Kymographic data from normal subjects , 2013, Logopedics, phoniatrics, vocology.

[12]  Jody Kreiman,et al.  Acoustic and perceptual effects of changes in body layer stiffness in symmetric and asymmetric vocal fold models. , 2013, The Journal of the Acoustical Society of America.

[13]  David E. Sommer,et al.  Corrected contact dynamics for the Steinecke and Herzel asymmetric two-mass model of the vocal folds. , 2012, The Journal of the Acoustical Society of America.

[14]  M. Sabarimalai Manikandan,et al.  Effective Glottal Instant Detection and Electroglottographic Parameter Extraction for Automated Voice Pathology Assessment , 2018, IEEE Journal of Biomedical and Health Informatics.

[15]  Dong Zhang,et al.  Nonlinear oscillation of pathological vocal folds during vocalization , 2013 .

[16]  Giuseppe De Pietro,et al.  A methodology for voice classification based on the personalized fundamental frequency estimation , 2018, Biomed. Signal Process. Control..

[17]  Chun Lu,et al.  An improved GA and a novel PSO-GA-based hybrid algorithm , 2005, Inf. Process. Lett..

[18]  Leonardo Lancia,et al.  Characterization of laryngealization as irregular vocal fold vibration and interaction with prosodic prominence , 2016, J. Phonetics.

[19]  Paavo Alku,et al.  Parameterisation Methods of the Glottal Flow Estimated by Inverse Filtering , 1985 .

[20]  R Mittal,et al.  A computational study of the effect of vocal-fold asymmetry on phonation. , 2010, The Journal of the Acoustical Society of America.

[21]  Jack J Jiang,et al.  Optimized Nonlinear Dynamic Analysis of Pathologic Voices With Laryngeal Paralysis Based on the Minimum Embedding Dimension. , 2017, Journal of voice : official journal of the Voice Foundation.

[22]  Tobias Riede,et al.  Functional morphology of the Alligator mississippiensis larynx with implications for vocal production , 2015, The Journal of Experimental Biology.

[23]  Yu Zhang,et al.  Extracting Physiologically Relevant Parameters of Vocal Folds From High-Speed Video Image Series , 2007, IEEE Transactions on Biomedical Engineering.

[24]  Jack J Jiang,et al.  Nonlinear dynamic-based analysis of severe dysphonia in patients with vocal fold scar and sulcus vocalis. , 2012, Journal of voice : official journal of the Voice Foundation.

[25]  Edson Cataldo,et al.  Non-stationary Bayesian estimation of parameters from a body cover model of the vocal folds. , 2016, The Journal of the Acoustical Society of America.

[26]  Vincent Visseq,et al.  A numerical strategy for finite element modeling of frictionless asymmetric vocal fold collision. , 2017, International journal for numerical methods in biomedical engineering.

[27]  María Victoria Rodellar Biarge,et al.  Glottal Source biometrical signature for voice pathology detection , 2009, Speech Commun..