The correlogram: a visual display of periodicity.

Fundamental frequency (F0) extraction is often used in voice quality analysis. In pathological voices with a high degree of instability in F0, it is common for F0 extraction algorithms to fail. In such cases, the faulty F0 values might spoil the possibilities for further data analysis. This paper presents the correlogram, a new method of displaying periodicity. The correlogram is based on the waveform-matching techniques often used in F0 extraction programs, but with no mechanism to select an actual F0 value. Instead, several candidates for F0 are shown as dark bands. The result is presented as a 3D plot with time on the x axis, correlation delay inverted to frequency on the y axis, and correlation on the z axis. The z axis is represented in a gray scale as in a spectrogram. Delays corresponding to integer multiples of the period time will receive high correlation, thus resulting in candidates at F0, F0/2, F0/3, etc. While the correlogram adds little to F0 analysis of normal voices, it is useful for analysis of pathological voices since it illustrates the full complexity of the periodicity in the voice signal. Also, in combination with manual tracing, the correlogram can be used for semimanual F0 extraction. If so, F0 extraction can be performed on many voices that cause problems for conventional F0 extractors. To demonstrate the properties of the method it is applied to synthetic and natural voices, among them six pathological voices, which are characterized by roughness, vocal fry, gratings/scrape, hypofunctional breathiness and voice breaks, or combinations of these.

[1]  M. Ng,et al.  Acoustic, aerodynamic, physiologic, and perceptual properties of modal and vocal fry registers. , 1998, The Journal of the Acoustical Society of America.

[2]  J Kreiman,et al.  The perceptual structure of pathologic voice quality. , 1996, The Journal of the Acoustical Society of America.

[3]  S. Imaizumi Acoustic measures of roughness in pathological voice , 1986 .

[4]  I. Titze,et al.  Comparison of Fo extraction methods for high-precision voice perturbation measurements. , 1993, Journal of speech and hearing research.

[5]  J Pesák,et al.  Vocal breaks from the modal to falsetto register. , 1994, Folia phoniatrica et logopaedica : official organ of the International Association of Logopedics and Phoniatrics.

[6]  J. Laver The phonetic description of voice quality , 1980 .

[7]  J. P. Pabon,et al.  Objective acoustic voice-quality parameters in the computer phonetogram , 1991 .

[8]  B. Hammarberg Voice Research and Clinical Needs , 1999, Folia Phoniatrica et Logopaedica.

[9]  Adrian Fourcin,et al.  Electrolaryngographic assessment of vocal fold function , 1986 .

[10]  A. McAllister,et al.  Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Acoustic, Perceptual and Physiological Studies of Ten-year-old Children's Voices Acoustic, Perceptual and Physiological Studies of Ten-year-old Children's Voices , 2022 .

[11]  Britta Hammarberg,et al.  Perceptual and acoustic analysis of dysphonia , 1986 .

[12]  G. Krom Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments. , 1995, Journal of speech and hearing research.

[13]  平野 実,et al.  Vocal fold physiology : voice quality control , 1995 .

[14]  Anders Friberg,et al.  A method for extracting vibrato parameters applied to violin performance , 1998 .

[15]  J. Kreiman,et al.  The multidimensional nature of pathologic vocal quality. , 1994, The Journal of the Acoustical Society of America.

[16]  J. Hillenbrand Perception of aperiodicities in synthetically generated voices. , 1988, The Journal of the Acoustical Society of America.

[17]  C R Rabinov,et al.  Comparing reliability of perceptual ratings of roughness and acoustic measure of jitter. , 1995, Journal of speech and hearing research.

[18]  K. Omori,et al.  Acoustic characteristics of rough voice: subharmonics. , 1997, Journal of voice : official journal of the Voice Foundation.

[19]  M P Karnell,et al.  Comparison of acoustic voice perturbation measures among three independent voice laboratories. , 1991, Journal of speech and hearing research.

[20]  Wolfgang Hess,et al.  Pitch Determination of Speech Signals , 1983 .

[21]  W J Hess Determination of glottal excitation cycles in running speech. , 1995, Phonetica.

[22]  M. Rothenberg A new inverse-filtering technique for deriving the glottal air flow waveform during voicing. , 1970, The Journal of the Acoustical Society of America.

[23]  N. Isshiki,et al.  Differential diagnosis of hoarseness. , 1969, Folia phoniatrica.

[24]  J. Sundberg,et al.  The Science of Singing Voice , 1987 .

[25]  G. de Krom,et al.  Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments. , 1995, Journal of speech and hearing research.