Spectral-Cepstral Estimation of Dysphonia Severity: External Validation

Objectives: The current study applied an acoustic algorithm incorporating measures from cepstral and spectral analyses, the Cepstral Spectral Index of Dysphonia (CSID), in an attempt to externally validate the CSID as an acoustic estimate of dysphonia severity. Methods: Correlation (Pearson's r) between the CSID and trained listener-perceived severities as rated on the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) was calculated from sentence and sustained vowel samples from 56 patients before or after they underwent thyroid surgery. Results: A strong correlation was identified between the mean CSID values calculated across CAPE-V sentences and vowels and the median rating of perceived overall severity (r = 0.82; p < 0.001). The CSID values did not differ significantly from their corresponding auditory-perceptual ratings of dysphonia severity for these samples (CSID: Mean, 15.54, SD, 16.63; CAPE-V Severity: Mean, 17.33, SD, 13.61; p = 0.16). Conclusions: Independent testing of an acoustic algorithm incorporating measures from cepstral and spectral analyses (the CSID) confirmed a strong correlation of the CSID to perceptual ratings of overall voice quality. This study provides external validation of the CSID as a robust correlate of dysphonia severity as rated by trained listeners.

[1]  Soren Y Lowell,et al.  Spectral- and cepstral-based measures during continuous speech: capacity to distinguish dysphonia and consistency within a speaker. , 2010, Journal of voice : official journal of the Voice Foundation.

[2]  Benjamin Halberstam Acoustic and Perceptual Parameters Relating to Connected Speech Are More Reliable Measures of Hoarseness than Parameters Relating to Sustained Vowels , 2004, ORL.

[3]  J Hillenbrand,et al.  A methodological study of perturbation and additive noise in synthetically generated voice signals. , 1987, Journal of speech and hearing research.

[4]  Shaheen N Awan,et al.  Acoustic analyses of thyroidectomy-related changes in vowel phonation. , 2012, Journal of voice : official journal of the Voice Foundation.

[5]  A. Noll Short‐Time Spectrum and “Cepstrum” Techniques for Vocal‐Pitch Detection , 1964 .

[6]  G. de Krom,et al.  Consistency and reliability of voice quality ratings for different types of speech fragments. , 1994, Journal of speech and hearing research.

[7]  Christopher Dromey,et al.  Estimating dysphonia severity in continuous speech: Application of a multi-parameter spectral/cepstral model , 2009, Clinical linguistics & phonetics.

[8]  H. Hoffman,et al.  Reliability of clinician-based (GRBAS and CAPE-V) and patient-based (V-RQOL and IPVI) documentation of voice disorders. , 2007, Journal of voice : official journal of the Voice Foundation.

[9]  Shannon C. Mauszycki,et al.  Task Specificity in Adductor Spasmodic Dysphonia Versus Muscle Tension Dysphonia , 2005, The Laryngoscope.

[10]  J. Kreiman,et al.  Perceptual evaluation of voice quality: review, tutorial, and a framework for future research. , 1993, Journal of speech and hearing research.

[11]  Y. Heman-Ackah,et al.  The relationship between cepstral peak prominence and selected parameters of dysphonia. , 2002, Journal of voice : official journal of the Voice Foundation.

[12]  D. Jamieson,et al.  Acoustic discrimination of pathological voice: sustained vowels versus continuous speech. , 2001, Journal of speech, language, and hearing research : JSLHR.

[13]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[14]  Geoffrey S. Meltzner,et al.  Quantifying dysphonia severity using a spectral/cepstral-based acoustic index: Comparisons with auditory-perceptual judgements from the CAPE-V , 2010, Clinical linguistics & phonetics.

[15]  Youri Maryn,et al.  The Acoustic Voice Quality Index: toward improved treatment outcomes assessment in voice disorders. , 2010, Journal of communication disorders.

[16]  G. de Krom,et al.  Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments. , 1995, Journal of speech and hearing research.

[17]  V. Wolfe,et al.  Acoustic correlates of dysphonia: type and severity. , 1997, Journal of communication disorders.

[18]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality. , 1994, Journal of speech and hearing research.

[19]  Richard J Morris,et al.  The effect of initiating oral contraceptive use on voice: a case study. , 2011, Journal of voice : official journal of the Voice Foundation.

[20]  Alexander Stojadinovic,et al.  Clinical versus laboratory ratings of voice using the CAPE-V. , 2011, Journal of voice : official journal of the Voice Foundation.

[21]  P. Van cauwenberge,et al.  Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels. , 2010, Journal of voice : official journal of the Voice Foundation.

[22]  Christopher R Watts,et al.  Use of spectral/cepstral analyses for differentiating normal from hypofunctional voices in sustained vowel and continuous speech contexts. , 2011, Journal of speech, language, and hearing research : JSLHR.

[23]  Ronald J. Baken,et al.  Clinical measurement of speech and voice , 1987 .

[24]  J M Bland,et al.  Statistical methods for assessing agreement between two methods of clinical measurement , 1986 .

[25]  Shaheen N Awan,et al.  Tracking voice change after thyroidectomy: application of spectral/cepstral analyses , 2011, Clinical linguistics & phonetics.

[26]  J. Hillenbrand,et al.  Cepstral Peak Prominence: A More Reliable Measure of Dysphonia , 2003, The Annals of otology, rhinology, and laryngology.

[27]  N. Roy,et al.  Acoustic prediction of voice type in women with functional dysphonia. , 2005, Journal of voice : official journal of the Voice Foundation.

[28]  N. Solomon,et al.  The role of listener experience on Consensus Auditory-perceptual Evaluation of Voice (CAPE-V) ratings of postthyroidectomy voice. , 2010, American journal of speech-language pathology.

[29]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. , 1996, Journal of speech and hearing research.

[30]  R. Hillman,et al.  Consensus auditory-perceptual evaluation of voice: development of a standardized clinical protocol. , 2009, American journal of speech-language pathology.

[31]  Shaheen N Awan,et al.  Toward the development of an objective index of dysphonia severity: A four‐factor acoustic model , 2006, Clinical linguistics & phonetics.

[32]  P. Van cauwenberge,et al.  Acoustic measurement of overall voice quality: a meta-analysis. , 2009, The Journal of the Acoustical Society of America.