Development of a time-dependent numerical model for the assessment of non-stationary pharyngoesophageal tissue vibrations after total laryngectomy

Laryngeal cancer due to, e.g., extensive smoking and/or alcohol consumption can necessitate the excision of the entire larynx. After such a total laryngectomy, the voice generating structures are lost and with that the quality of life of the concerning patients is drastically reduced. However, the vibrations of the remaining tissue in the so called pharyngoesophageal (PE) segment can be applied as alternative sound generator. Tissue, scar, and geometric aspects of the PE-segment determine the postoperative substitute voice characteristic, being highly important for the future live of the patient. So far, PE-dynamics are simulated by a biomechanical model which is restricted to stationary vibrations, i.e., variations in pitch and amplitude cannot be handled. In order to investigate the dynamical range of PE-vibrations, knowledge about the temporal processes during substitute voice production is of crucial interest. Thus, time-dependent model parameters are suggested in order to quantify non-stationary PE-vibrations and drawing conclusions on the temporal characteristics of tissue stiffness, oscillating mass, pressure, and geometric distributions within the PE-segment. To adapt the numerical model to the PE-vibrations, an automatic, block-based optimization procedure is applied, comprising a combined global and local optimization approach. The suggested optimization procedure is validated with 75 synthetic data sets, simulating non-stationary oscillations of differently shaped PE-segments. The application to four high-speed recordings is shown and discussed. The correlation between model and PE-dynamics is $$\ge $$≥ 97 %.

[1]  R. Brent Table errata: Algorithms for minimization without derivatives (Prentice-Hall, Englewood Cliffs, N. J., 1973) , 1975 .

[2]  D. Berry,et al.  Computation of physiological human vocal fold parameters by mathematical optimization of a biomechanical model. , 2011, The Journal of the Acoustical Society of America.

[3]  Britta Hammarberg,et al.  The pharyngoesophageal segment in laryngectomees—videoradiographic, acoustic, and voice quality perceptual data , 2008, Logopedics, phoniatrics, vocology.

[4]  J. Flanagan,et al.  Synthesis of voiced sounds from a two-mass model of the vocal cords , 1972 .

[5]  Mingxi Wan,et al.  Aerodynamic characteristics of laryngectomees breathing quietly and speaking with the electrolarynx. , 2004, Journal of voice : official journal of the Voice Foundation.

[6]  M. Singer,et al.  Voice rehabilitation after total laryngectomy. , 1983, The Journal of otolaryngology.

[7]  Heinz Ulrich Hoppe,et al.  Vibration parameter extraction from endoscopic image series of the vocal folds , 2002, IEEE Transactions on Biomedical Engineering.

[8]  J Lohscheller,et al.  The Laryngectomee Substitute Voice: Image Processing of Endoscopic Recordings by Fusion with Acoustic Signals , 2003, Methods of Information in Medicine.

[9]  Manfred Kaltenbacher,et al.  Three-dimensional biomechanical properties of human vocal folds: parameter optimization of a numerical model to match in vitro dynamics. , 2012, The Journal of the Acoustical Society of America.

[10]  Luigi Bartolomei,et al.  Botulinum toxin type A: an effective treatment to restore phonation in laryngectomized patients unable to voice , 2011, Neurological Sciences.

[11]  M. Döllinger,et al.  Model-based classification of nonstationary vocal fold vibrations. , 2006, The Journal of the Acoustical Society of America.

[12]  Jigeeshu V Divatia,et al.  Pharyngoesophageal segment manometry : Its role in determining post-laryngectomy speech , 1995 .

[13]  Heinz Ulrich Hoppe,et al.  Quantitative investigation of the vibration pattern of the substitute voice generator , 2004, IEEE Transactions on Biomedical Engineering.

[14]  Michael Döllinger,et al.  Spatiotemporal classification of vocal fold dynamics by a multimass model comprising time-dependent parameters. , 2008, The Journal of the Acoustical Society of America.

[15]  U Eysholdt,et al.  High-speed digital imaging of neoglottic vibration after total laryngectomy. , 1999, Archives of otolaryngology--head & neck surgery.

[16]  M. Doellinger,et al.  The Next Step in Voice Assessment: High-Speed Digital Endoscopy and Objective Evaluation , 2009 .

[17]  Raphael Schwarz,et al.  Model-based Quantification of Pathological Voice Production , 2007 .

[18]  Lester Ingber,et al.  Adaptive Simulated Annealing , 2012 .

[19]  E. Alexopoulos,et al.  Survival after laryngectomy: a review of 133 patients with laryngeal carcinoma , 2010, European Archives of Oto-Rhino-Laryngology.

[20]  Alexander Sutor,et al.  Optical Reconstruction of High-Speed Surface Dynamics in an Uncontrollable Environment , 2010, IEEE Transactions on Medical Imaging.

[21]  Elmar Nöth,et al.  Application of Automatic Speech Recognition to Quantitative Assessment of Tracheoesophageal Speech with Different Signal Quality , 2008, Folia Phoniatrica et Logopaedica.

[22]  B Weinberg,et al.  Aerodynamic and myoelastic contributions to tracheoesophageal voice production. , 1987, Journal of speech and hearing research.

[23]  Ulrich Eysholdt,et al.  Spatio-temporal quantification of vocal fold vibrations using high-speed videoendoscopy and a biomechanical model. , 2008, The Journal of the Acoustical Society of America.

[24]  Ulrich Eysholdt,et al.  The Pitch Rise Paradigm: A New Task for Real-Time Endoscopy of Non-Stationary Phonation , 2006, Folia Phoniatrica et Logopaedica.

[25]  Frank Rosanowski,et al.  Quality of Life in Laryngectomees after Prosthetic Voice Restoration , 2003, Folia Phoniatrica et Logopaedica.

[26]  Kathrin Klamroth,et al.  Registration of PE segment contour deformations in digital high-speed videos , 2008, Medical Image Anal..

[27]  Snehal G. Patel,et al.  Cancer of the Head and Neck , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[28]  Frank Rosanowski,et al.  Quantitative detection of substitute voice generator during phonation in patients undergoing laryngectomy. , 2005, Archives of otolaryngology--head & neck surgery.

[29]  Ingo R. Titze,et al.  Principles of voice production , 1994 .

[30]  R. Dantas,et al.  Relation between the dimensions and intraluminal pressure of the pharyngoesophageal segment and tracheoesophageal voice and speech proficiency , 2013, Head & neck.

[31]  J. Lucero,et al.  Simulations of temporal patterns of oral airflow in men and women using a two-mass model of the vocal folds under dynamic control. , 2005, The Journal of the Acoustical Society of America.

[32]  Maria Schuster,et al.  Substitute Voice Production: Quantification of PE Segment Vibrations Using a Biomechanical Model , 2011, IEEE Transactions on Biomedical Engineering.