Stochastic models of jitter.

This study presents stochastic models of jitter. Jitter designates small, random, involuntary perturbations of the glottal cycle lengths. Jitter is a base-line phenomenon that may be observed in all voiced speech sounds. Knowledge of its properties is therefore relevant to the acoustic modeling, analysis, and synthesis of voice quality. Also, models of jitter are conceptual frameworks that enable experimenters and clinicians to distinguish jitter in particular from aperiodic cycle length patterns in general. Vocal jitter is modeled by means of the ribbon model of the glottal vibration combined with stochastic models of the disturbances of the instantaneous frequency. The disturbance model comprises correlation-free noise and vocal microtremor. Properties of jitter that are simulated are the stochasticity, stationarity, and normality of the decorrelated cycle length perturbations, the size of decorrelated jitter, the correlation between the perturbations of neighboring glottal cycles, the modulation level and modulation frequency owing to microtremor, the asynchrony between external disturbances and glottal cycles, the dependence of the size of jitter on the average glottal cycle length, and the relation between jitter and laryngeal pathologies. Modeled jitter is discussed in the light of measured jitter, as well as the physiological and statistical plausibility of the model parameters.

[1]  J Hillenbrand,et al.  A methodological study of perturbation and additive noise in synthetically generated voice signals. , 1987, Journal of speech and hearing research.

[2]  I R Titze,et al.  Some technical considerations in voice perturbation measurements. , 1987, Journal of speech and hearing research.

[3]  J. Hillenbrand,et al.  Perception of aperiodicities in synthetically generated voices. , 1988, The Journal of the Acoustical Society of America.

[4]  Hideki Kasuya,et al.  Characteristics pf pitch period and amplitude perturbations in pathologic voice , 1983, ICASSP.

[5]  Anton J. Rozsypal,et al.  Perception of jitter and shimmer in synthetic vowels , 1979 .

[6]  S. Imaizumi Acoustic measures of roughness in pathological voice , 1986 .

[7]  Y. Koike,et al.  Acoustic measures for detecting laryngeal pathology. , 1977, Acta oto-laryngologica.

[8]  George E. P. Box,et al.  Statistical Control: By Monitoring and Feedback Adjustment , 1997 .

[9]  Jean Schoentgen,et al.  Searching for nonlinear relations in whitened jitter time series [glottal cycle length fluctuations] , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[10]  I R Titze,et al.  Unification of perturbation measures in speech signals. , 1990, The Journal of the Acoustical Society of America.

[11]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[12]  Ke Wu,et al.  Quality of speech produced by analysis-synthesis , 1990, Speech Commun..

[13]  Ingo Titze,et al.  A four-parameter model of the glottis and vocal fold contact area , 1989, Speech Commun..

[14]  I R Titze,et al.  A model for neurologic sources of aperiodicity in vocal fold vibration. , 1991, Journal of speech and hearing research.

[15]  W S Winholtz,et al.  Vocal tremor analysis with the Vocal Demodulator. , 1992, Journal of speech and hearing research.

[16]  K. Ishizaka,et al.  Computer simulation of pathological vocal‐cord vibration , 1975 .

[17]  P. Milenkovic,et al.  Least mean square measures of voice perturbation. , 1987, Journal of speech and hearing research.

[18]  Paul C. Fletcher,et al.  Statistics in Language Studies: Statistical inference , 1986 .

[19]  J. Laver,et al.  An acoustic screening system for the detection of laryngeal pathology , 1986 .

[20]  I. Titze Parameterization of the glottal area, glottal flow, and vocal fold contact area. , 1984, The Journal of the Acoustical Society of America.

[21]  Jean Schoentgen,et al.  Predictable and random components of jitter , 1997, Speech Commun..

[22]  Y. Horii Fundamental frequency perturbation observed in sustained phonation. , 1979, Journal of speech and hearing research.

[23]  S. K. Mullick,et al.  NONLINEAR DYNAMICAL ANALYSIS OF SPEECH , 1996 .

[24]  P. Lieberman Some Acoustic Measures of the Fundamental Periodicity of Normal and Pathologic Larynges , 1963 .

[25]  R. W. Wendahl,et al.  Laryngeal analog synthesis of jitter and shimmer auditory parameters of harshness. , 1966, Folia phoniatrica.

[26]  T Shipp,et al.  An adaptive method for tracking voicing irregularities. , 1992, The Journal of the Acoustical Society of America.

[27]  Sheldon M. Ross,et al.  Introduction to Probability and Statistics for Engineers and Scientists , 1987 .

[28]  N. Geçkinli,et al.  A set of optimal discrete linear smoothers , 1981 .

[29]  T. Baer,et al.  A pitch-synchronous analysis of hoarseness in running speech. , 1988, The Journal of the Acoustical Society of America.

[30]  J A Molina,et al.  Acoustic voice analysis in patients with essential tremor. , 1998, Journal of voice : official journal of the Voice Foundation.

[31]  Jean Schoentgen,et al.  Time series analysis of jitter , 1995 .

[32]  Philippe H. Dejonckere,et al.  An analysis of the diplophonia phenomenon , 1983, Speech Commun..

[33]  N Isshiki,et al.  Turbulent noise in dysphonia. , 1978, Folia phoniatrica.

[34]  G. E. Peterson,et al.  The elements of an acoustic phonetic theory. , 1966, Journal of speech and hearing research.

[35]  A. Behrman,et al.  Microphone and electroglottographic data from dysphonic patients: type 1, 2 and 3 signals. , 1998, Journal of voice : official journal of the Voice Foundation.

[36]  N. R. Petersen,et al.  Intrinsic Fundamental Frequency of Danish Vowels. , 1978 .

[37]  Jean Schoentgen,et al.  An algorithm for the measurement of jitter , 1991, Speech Commun..

[38]  J. Atkinson,et al.  Inter- and intraspeaker variability in fundamental voice frequency. , 1976, The Journal of the Acoustical Society of America.

[39]  G. P. Moore,et al.  A model for vocal fold vibratory motion, contact area, and the electroglottogram. , 1986, The Journal of the Acoustical Society of America.

[40]  Harry Hollien,et al.  A Method for Analyzing Vocal Jitter in Sustained Phonation. , 1973 .

[41]  Donald G. Childers,et al.  Modeling vocal disorders via formant synthesis , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[42]  C. Gardiner Handbook of Stochastic Methods , 1983 .

[43]  Shigeru Kiritani,et al.  High-speed digital image analysis of vocal cord vibration in diplophonia , 1993, Speech Commun..

[44]  D. Berry,et al.  Analysis of vocal disorders with methods from nonlinear dynamics. , 1994, Journal of speech and hearing research.

[45]  H.-J. Schultz-Coulon,et al.  Zur quantitativen Bewertung der Tonhöhenschwankungen im Rahmen der Stimmfunktionsprüfung , 1979 .

[46]  A. Behrman,et al.  Global and local dimensions of vocal dynamics. , 1999, The Journal of the Acoustical Society of America.

[47]  Satoshi Imaizumi,et al.  Acoustic and perceptual modelling of the voice quality caused by fundamental frequency perturbation , 1992, ICSLP.

[48]  R F Coleman,et al.  Effect of waveform changes upon roughness perception. , 1971, Folia phoniatrica.

[49]  A. Behrman,et al.  Correlation dimension of electroglottographic data from healthy and pathologic subjects. , 1997, The Journal of the Acoustical Society of America.

[50]  Abeer Alwan,et al.  Analysis by synthesis of pathological voices using the Klatt synthesizer , 1997, Speech Commun..