A new insight into postsurgical objective voice quality evaluation: application to thyroplastic medialization

This paper aims at providing new objective parameters and plots, easily understandable and usable by clinicians and logopaedicians, in order to assess voice quality recovering after vocal fold surgery. The proposed software tool performs presurgical and postsurgical comparison of main voice characteristics (fundamental frequency, noise, formants) by means of robust analysis tools, specifically devoted to deal with highly degraded speech signals as those under study. Specifically, we address the problem of quantifying voice quality, before and after medialization thyroplasty, for patients affected by glottis incompetence. Functional evaluation after thyroplastic medialization is commonly based on several approaches: videolaryngostroboscopy (VLS), for morphological aspects evaluation, GRBAS scale and Voice Handicap Index (VHI), relative to perceptive and subjective voice analysis respectively, and Multi-Dimensional Voice Program (MDVP), that provides objective acoustic parameters. While GRBAS has the drawback to entirely rely on perceptive evaluation of trained professionals, MDVP often fails in performing analysis of highly degraded signals, thus preventing from presurgical/postsurgical comparison in such cases. On the contrary, the new tool, being capable to deal with severely corrupted signals, always allows a complete objective analysis. The new parameters are compared to scores obtained with the GRBAS scale and to some MDVP parameters, suitably modified, showing good correlation with them. Hence, the new tool could successfully replace or integrate existing ones. With the proposed approach, deeper insight into voice recovering and its possible changes after surgery can thus be obtained and easily evaluated by the clinician.

[1]  N. Isshiki,et al.  Anatomic Study for Posterior Medialization Thyroplasty , 1999, The Annals of otology, rhinology, and laryngology.

[2]  Yariv Ephraim,et al.  A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..

[3]  K. Omori,et al.  Quantitative Criteria for Predicting Thyroplasty Type I Outcome , 1996, The Laryngoscope.

[4]  John E. Markel,et al.  Linear Prediction of Speech , 1976, Communication and Cybernetics.

[5]  E Fresnel-Elbaz,et al.  Differentiated perceptual evaluation of pathological voice quality: reliability and correlations with acoustic measurements. , 1996, Revue de laryngologie - otologie - rhinologie.

[6]  Shubha Kadambe,et al.  Application of the wavelet transform for pitch detection of speech signals , 1992, IEEE Trans. Inf. Theory.

[7]  R. Casiano,et al.  Vocal Evaluation of Thyroplasty Type I in the Treatment of Nonparalytic Glottic Incompetence , 1998, The Annals of otology, rhinology, and laryngology.

[8]  Bhaskar D. Rao,et al.  Model based processing of signals: a state space approach , 1992, Proc. IEEE.

[9]  W. Montgomery,et al.  Thyroplasty: A New Approach , 1993, The Annals of otology, rhinology, and laryngology.

[10]  Claudia Manfredi,et al.  Robust techniques for pre- and post-surgical voice analysis , 2003, INTERSPEECH.

[11]  A. Laub,et al.  The singular value decomposition: Its computation and some applications , 1980 .

[12]  W. M. Carey,et al.  Digital spectral analysis: with applications , 1986 .

[13]  Claudia Manfredi,et al.  Adaptive noise energy estimation in pathological speech signals , 2000, IEEE Transactions on Biomedical Engineering.

[14]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  H. Kasuya,et al.  Normalized noise energy as an acoustic measure to evaluate pathologic voice. , 1986, The Journal of the Acoustical Society of America.

[16]  C Manfredi,et al.  A comparative analysis of fundamental frequency estimation methods with application to pathological voices. , 2000, Medical engineering & physics.

[17]  N. Isshiki,et al.  Thyroplasty as a new phonosurgical technique. , 1974, Acta oto-laryngologica.

[18]  C. Piazza,et al.  [Functional results after type I thyroplasty with the Montgomery's prosthesis]. , 2001, Acta otorhinolaryngologica Italica : organo ufficiale della Societa italiana di otorinolaringologia e chirurgia cervico-facciale.

[19]  A Fort,et al.  Parametric and non-parametric estimation of speech formants: application to infant cry. , 1996, Medical engineering & physics.

[20]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[21]  Ingrid Daubechies,et al.  The wavelet transform, time-frequency localization and signal analysis , 1990, IEEE Trans. Inf. Theory.