Comparative assessment of electroglottographic and acoustic measures of jitter in pathological voices.

Jitter, or the amount of cycle-to-cycle variation in the fundamental frequency, is a characteristic of the vocal folds' vibration that may affect electroglottographic (EGG) and acoustic signals in similar ways, because the translaryngeal conductance and the oral pressure are modulated by the same physiological mechanism. Despite the apparent simplicity in jitter computation and the relative facility in recording and analyzing EGG signals, only a few studies comparing EGG and acoustic jitter have been reported. This can be attributed to the remarkable sensitivity of measures of acoustic jitter to such features as the type of sound being analyzed, the equipment used for data acquisition, or the algorithms used to identify glottal cycle boundaries. To assess the agreement achievable by measures from these signals in pathological voices, acoustic and EGG waveforms of sustained vowels (/i/, /a/, /u/) produced by 15 dysphonic patients were analyzed by jitter extraction methods on the basis of peaks, zero crossings, and a waveform matching technique. The agreement, expressed as normalized absolute differences between acoustic- and EGG-derived jitter, was poorer for /i/ and /u/ than for /a/ vowels. For /a/ vowels, a method of acoustic jitter estimation is also proposed that combines peaks and zero crossings and resulted in increased consistency with the zero crossing-based EGG measures (mean normalized absolute difference: 10.95%, SD: 6.44%; range: 23.81%). The proposed method, which has a built-in-test intended to reject unreliable estimates, may provide more confidence in acoustic measures in dysphonic speakers and lead to a better understanding of the relationships between acoustic and EGG signals.

[1]  R. Colton,et al.  Problems and pitfalls of electroglottography , 1990 .

[2]  Mervyn A. Jack,et al.  Methodological aspects in a multimedia database of vocal fold pathologies , 1995, EUROSPEECH.

[3]  M. Rossi,et al.  Movements of the hyoid and the larynx and the intrinsic frequency of vowels , 1981 .

[4]  Vicki L. Heiberger,et al.  Jitter and Shimmer in Sustained Phonation , 1982 .

[5]  J. Laver,et al.  Acoustic waveform perturbations and voice disorders , 1992 .

[6]  Wolfgang Hess,et al.  Accurate time-domain pitch determination of speech signals by means of a laryngograph , 1987, Speech Commun..

[7]  P. Lieberman Perturbations in Vocal Pitch , 1960 .

[8]  R. Orlikoff Vocal stability and vocal tract configuration: an acoustic and electroglottographic investigation. , 1995, Journal of voice : official journal of the Voice Foundation.

[9]  P. Lieberman Some Acoustic Measures of the Fundamental Periodicity of Normal and Pathologic Larynges , 1963 .

[10]  E T Doherty,et al.  Tape recorder effects on jitter and shimmer extraction. , 1988, Journal of speech and hearing research.

[11]  Jean Schoentgen,et al.  Jitter in sustained vowels and isolated sentences produced by dysphonic speakers , 1989, Speech Commun..

[12]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .

[13]  Y Qi Time normalization in voice analysis. , 1992, The Journal of the Acoustical Society of America.

[14]  M. Ross,et al.  Average magnitude difference function pitch extractor , 1974 .

[15]  M P Karnell,et al.  Laryngeal perturbation analysis: minimum length of analysis window. , 1991, Journal of speech and hearing research.

[16]  P. Milenkovic,et al.  Least mean square measures of voice perturbation. , 1987, Journal of speech and hearing research.

[17]  W S Winholtz,et al.  Effect of microphone type and placement on voice perturbation measurements. , 1993, Journal of speech and hearing research.

[18]  Jean Schoentgen,et al.  An algorithm for the measurement of jitter , 1991, Speech Commun..

[19]  Eyal Yair,et al.  Super resolution pitch determination of speech signals , 1991, IEEE Trans. Signal Process..

[20]  J. Kormylo,et al.  Two-pass recursive digital filter with zero phase shift , 1974 .

[21]  R. Orlikoff,et al.  The effect of the heartbeat on vocal fundamental frequency perturbation. , 1989, Journal of speech and hearing research.

[22]  R. Czarnach Recursive processing by noncausal digital filters , 1982 .

[23]  I. Titze,et al.  Comparison of Fo extraction methods for high-precision voice perturbation measurements. , 1993, Journal of speech and hearing research.

[24]  K Schäfer-Vincent,et al.  Pitch Period Detection and Chaining: Method and Evaluation , 1983, Phonetica.

[25]  I R Titze,et al.  Some technical considerations in voice perturbation measurements. , 1987, Journal of speech and hearing research.

[26]  Paul C. Bagshaw,et al.  Enhanced pitch tracking and the processing of F0 contours for computer aided intonation teaching , 1993, EUROSPEECH.

[27]  Fergus McInnes,et al.  Analysis of the Effects of Electroglottographic Baseline Fluctuation on the f0 Estimation in Pathological Voices , 1996 .

[28]  Ronald J. Baken,et al.  Clinical measurement of speech and voice , 1987 .

[29]  Jody Kreiman,et al.  Comparison of Voice Analysis Systems for Perturbation Measurement , 1996 .