To what degree of voice perturbation are jitter measurements valid? A novel approach with synthesized vowels and visuo-perceptual pattern recognition

Abstract Objective measurement of the severity of dysphonia typically requires signal processing algorithms applied to acoustic recordings. Since Lieberman (1963) introduced the concept of perturbation analysis in the area of voice, the best-known acoustic parameter in clinical practice is conventional jitter . However, jitter measurements have some critical limitations. According to a widely accepted guideline, in sustained vowels of dysphonic voices, only perturbation measures less than about 5% are reliable: this is related to period extraction methods. This limit of 5% deserves critical analysis, certainly when there are indications that some acoustic analysis programs can be applied to quite irregular voices such as substitution voices. The present experiment demonstrates that – on signals of synthesized deviant voices (sustained vowel) with moderate additive noise – different raters are able to visually identify in a very consistent way the period durations of successive cycles up to values of about 13% jitter. Furthermore, even for higher values – over 30% – the jitter % computed with the period values rated by visual perception is, for some of the raters, very comparable to the real value. This suggests that improved acoustic programs using more reliable algorithms could validly transgress the traditional limit of 5% if they demonstrate the correspondence of their computations with the true jitter values. This is now made possible by synthesizers generating artificial deviant voices that cannot be distinguished from true dysphonia, and in which the jitter put in is exactly known.

[1]  I. Titze The myoelastic aerodynamic theory of phonation , 2006 .

[2]  J Schoentgen Stochastic models of jitter. , 2001, The Journal of the Acoustical Society of America.

[3]  Jean Schoentgen,et al.  Perceived naturalness of a synthesizer of disordered voices , 2009, INTERSPEECH.

[4]  F. Hilgers,et al.  Acoustical analysis and perceptual evaluation of tracheoesophageal prosthetic voice. , 1998, Journal of voice : official journal of the Voice Foundation.

[5]  Jean Schoentgen,et al.  Shaping function models of the phonatory excitation signal. , 2003, The Journal of the Acoustical Society of America.

[6]  P. Dejonckere,et al.  A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques , 2001, European Archives of Oto-Rhino-Laryngology.

[7]  Jean-Pierre Martens,et al.  Objective evaluation of the quality of substitution voices , 2004, European Archives of Oto-Rhino-Laryngology and Head & Neck.

[8]  Jean Schoentgen,et al.  Synthèse des voix pathologiques , 2010 .

[9]  Jean Schoentgen,et al.  Evaluation of a Synthesizer of Disordered Voices , 2009 .

[10]  A. Schindler,et al.  Vocal improvement after voice therapy in unilateral vocal fold paralysis. , 2008, Journal of voice : official journal of the Voice Foundation.

[11]  Rabab Kreidieh Ward,et al.  Obtaining LIP and Glottal Reflection Coefficients from Vowel Sounds , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[12]  Rick M Roark,et al.  Frequency and voice: perspectives in the time domain. , 2006, Journal of voice : official journal of the Voice Foundation.

[13]  John Nicholas Holmes,et al.  Speech synthesis , 1972 .

[14]  E. Hoffman,et al.  Vocal tract area functions from magnetic resonance imaging. , 1996, The Journal of the Acoustical Society of America.

[15]  Jack J. Jiang,et al.  Nonlinear dynamic analysis of disordered voice: the relationship between the correlation dimension (D2) and pre-/post-treatment change in perceived dysphonia severity. , 2010, Journal of voice : official journal of the Voice Foundation.

[16]  P. Lieberman Some Acoustic Measures of the Fundamental Periodicity of Normal and Pathologic Larynges , 1963 .

[17]  P H Dejonckere,et al.  Documentation of progress in voice therapy: perceptual, acoustic, and laryngostroboscopic findings pretherapy and posttherapy. , 2004, Journal of voice : official journal of the Voice Foundation.

[18]  I. Titze,et al.  Comparison of Fo extraction methods for high-precision voice perturbation measurements. , 1993, Journal of speech and hearing research.

[19]  G Molenberghs,et al.  The dysphonia severity index: an objective measure of vocal quality based on a multiparameter approach. , 2000, Journal of speech, language, and hearing research : JSLHR.