Use of spectral/cepstral analyses for differentiating normal from hypofunctional voices in sustained vowel and continuous speech contexts.

PURPOSE In this study, the authors evaluated the diagnostic value of spectral/cepstral measures to differentiate dysphonic from nondysphonic voices using sustained vowels and continuous speech samples. METHODOLOGY Thirty-two age- and gender-matched individuals (16 participants with dysphonia and 16 controls) were recorded reading a standard passage (The Rainbow Passage; Fairbanks, 1960) and sustaining the vowel /α/. Recorded voices were analyzed with custom software that calculated 4 spectral/cepstral measures. RESULTS Measures of cepstral peak prominence (CPP) and low-high spectral ratio (L/H ratio) were significantly different between groups in both speaking conditions; the standard deviation of the CPP was significantly different between groups in continuous speech only. In differentiating dysphonic individuals with a hypofunctional etiology from nondysphonic individuals, receiver operating characteristic (ROC) analyses demonstrated (a) high sensitivity and high specificity for the CPP in the sustained vowel condition and (b) high sensitivity and moderate specificity for the CPP in the speech condition. CONCLUSIONS In a sample of dysphonic speakers (hypofunctional etiologies) versus typical speakers, spectral/cepstral measures of CPP and L/H ratio were able to differentiate these groups from one another in both vowel prolongation and continuous speech contexts with high sensitivity and specificity. The results of this study support the growing body of literature documenting the significant value of cepstral and other spectral-based acoustic measures to the clinical evaluation and management processes.

[1]  Y. Qi,et al.  The estimation of signal-to-noise ratio in continuous speech for disordered voices. , 1999, The Journal of the Acoustical Society of America.

[2]  Shaheen N Awan,et al.  Toward the development of an objective index of dysphonia severity: A four‐factor acoustic model , 2006, Clinical linguistics & phonetics.

[3]  C R Rabinov,et al.  Comparing reliability of perceptual ratings of roughness and acoustic measure of jitter. , 1995, Journal of speech and hearing research.

[4]  P. Van cauwenberge,et al.  Acoustic measurement of overall voice quality: a meta-analysis. , 2009, The Journal of the Acoustical Society of America.

[5]  Maria Markaki,et al.  Using modulation spectra for voice pathology detection and classification , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[6]  William J. Barry,et al.  Instrumental dimensioning of normal and pathological phonation using acoustic measurements , 2008, Clinical linguistics & phonetics.

[7]  J. B. Laflen,et al.  Pitch Deviation Analysis of Pathological Voice in Connected Speech , 2008, The Annals of otology, rhinology, and laryngology.

[8]  Ulrich Eysholdt,et al.  Classification of functional voice disorders based on phonovibrograms , 2010, Artif. Intell. Medicine.

[9]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[10]  C. Watts,et al.  An investigation of voice quality in individuals with inherited elastin gene abnormalities , 2008, Clinical linguistics & phonetics.

[11]  Jack J. Jiang,et al.  Nonlinear dynamic analysis of disordered voice: the relationship between the correlation dimension (D2) and pre-/post-treatment change in perceived dysphonia severity. , 2010, Journal of voice : official journal of the Voice Foundation.

[12]  R. Hillman,et al.  Consensus auditory-perceptual evaluation of voice: development of a standardized clinical protocol. , 2009, American journal of speech-language pathology.

[13]  Milton Orlando Sarria Paja,et al.  Feature selection in pathological voice classification using dinamyc of component analysis , 2008 .

[14]  W. Youden,et al.  Index for rating diagnostic tests , 1950, Cancer.

[15]  Jayashree S Bhat,et al.  Cepstral analysis of voice in persons with vocal nodules. , 2010, Journal of voice : official journal of the Voice Foundation.

[16]  Benjamin Halberstam Acoustic and Perceptual Parameters Relating to Connected Speech Are More Reliable Measures of Hoarseness than Parameters Relating to Sustained Vowels , 2004, ORL.

[17]  Y. Heman-Ackah,et al.  The relationship between cepstral peak prominence and selected parameters of dysphonia. , 2002, Journal of voice : official journal of the Voice Foundation.

[18]  G. Fairbanks Voice and articulation drillbook , 1960 .

[19]  Jack J. Jiang,et al.  Perturbation and nonlinear dynamic analyses of voices from patients with unilateral laryngeal paralysis. , 2005, Journal of voice : official journal of the Voice Foundation.

[20]  E. Mendoza,et al.  Differences in voice quality between men and women: use of the long-term average spectrum (LTAS). , 1996, Journal of voice : official journal of the Voice Foundation.

[21]  B. Reiser,et al.  Estimation of the Youden Index and its Associated Cutoff Point , 2005, Biometrical journal. Biometrische Zeitschrift.

[22]  Guus de Krom,et al.  A Cepstrum-Based Technique for Determining a Harmonics-to-Noise Ratio in Speech Signals , 1993 .

[23]  9. Cepstra of normal and pathological voices in correlation to acoustical, aerodynamic and perceptual data , 1996 .

[24]  N. Roy,et al.  Acoustic prediction of voice type in women with functional dysphonia. , 2005, Journal of voice : official journal of the Voice Foundation.

[25]  Geoffrey S. Meltzner,et al.  Quantifying dysphonia severity using a spectral/cepstral-based acoustic index: Comparisons with auditory-perceptual judgements from the CAPE-V , 2010, Clinical linguistics & phonetics.

[26]  Antanas Verikas,et al.  Categorizing normal and pathological voices: automated and perceptual categorization. , 2011, Journal of voice : official journal of the Voice Foundation.

[27]  M. E. Torres,et al.  A pattern recognition approach to spasmodic dysphonia and muscle tension dysphonia automatic classification. , 2010, Journal of voice : official journal of the Voice Foundation.

[28]  Viv Bewick,et al.  Statistics review 13: Receiver operating characteristic curves , 2004, Critical care.

[29]  D. Jamieson,et al.  Acoustic discrimination of pathological voice: sustained vowels versus continuous speech. , 2001, Journal of speech, language, and hearing research : JSLHR.

[30]  R. Klich Relationships of vowel characteristics to listener ratings of breathiness. , 1982, Journal of speech and hearing research.

[31]  P. Van cauwenberge,et al.  Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels. , 2010, Journal of voice : official journal of the Voice Foundation.

[32]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. , 1996, Journal of speech and hearing research.

[33]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality. , 1994, Journal of speech and hearing research.

[34]  J. Hillenbrand,et al.  Cepstral Peak Prominence: A More Reliable Measure of Dysphonia , 2003, The Annals of otology, rhinology, and laryngology.

[35]  Christopher Dromey,et al.  Estimating dysphonia severity in continuous speech: Application of a multi-parameter spectral/cepstral model , 2009, Clinical linguistics & phonetics.

[36]  Rick M Roark,et al.  Frequency and voice: perspectives in the time domain. , 2006, Journal of voice : official journal of the Voice Foundation.

[37]  Jody Kreiman,et al.  Comparison of Voice Analysis Systems for Perturbation Measurement , 1996 .

[38]  Jacqueline Vaissière,et al.  Objective acoustic and aerodynamic measures of breathiness in paralytic dysphonia , 2003, European Archives of Oto-Rhino-Laryngology.

[39]  Soren Y Lowell,et al.  Spectral- and cepstral-based measures during continuous speech: capacity to distinguish dysphonia and consistency within a speaker. , 2010, Journal of voice : official journal of the Voice Foundation.

[40]  V. Wolfe,et al.  Acoustic correlates of dysphonia: type and severity. , 1997, Journal of communication disorders.

[41]  D. Sackett Evidence-based medicine: how to practice and teach EBM: 2nd ed , 2000 .