Relation of perceived breathiness to laryngeal kinematics and acoustic measures based on computational modeling.

PURPOSE In this study, the authors sought to determine (a) how specific vocal fold structural and vibratory features relate to breathy voice quality and (b) the relation of perceived breathiness to 4 acoustic correlates of breathiness. METHOD A computational, kinematic model of the vocal fold medial surfaces was used to specify features of vocal fold structure and vibration in a manner consistent with breathy voice. Four model parameters were altered: vocal process separation, surface bulging, vibratory nodal point, and epilaryngeal constriction. Twelve naïve listeners rated breathiness of 364 samples relative to a reference. The degree of breathiness was then compared to (a) the underlying kinematic profile and (b) 4 acoustic measures: cepstral peak prominence (CPP), harmonics-to-noise ratio, and two measures of spectral slope. RESULTS Vocal process separation alone accounted for 61.4% of the variance in perceptual rating. Adding nodal point ratio and bulging to the equation increased the explained variance to 88.7%. The acoustic measure CPP accounted for 86.7% of the variance in perceived breathiness, and explained variance increased to 92.6% with the addition of one spectral slope measure. CONCLUSION Breathiness ratings were best explained kinematically by the degree of vocal process separation and acoustically by CPP.

[1]  B. P. Bogert,et al.  The quefrency analysis of time series for echoes : cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking , 1963 .

[2]  Brad H Story,et al.  Comparison of magnetic resonance imaging-based vocal tract area functions obtained from the same speaker in 1994 and 2002. , 2008, The Journal of the Acoustical Society of America.

[3]  J. Perkell,et al.  Glottal airflow and transglottal air pressure measurements for male and female speakers in soft, normal, and loud voice. , 1988, The Journal of the Acoustical Society of America.

[4]  Y. Heman-Ackah,et al.  The relationship between cepstral peak prominence and selected parameters of dysphonia. , 2002, Journal of voice : official journal of the Voice Foundation.

[5]  Rahul Shrivastav,et al.  Objective measures of breathy voice quality obtained using an auditory model. , 2003, The Journal of the Acoustical Society of America.

[6]  J. Perkell,et al.  Comparisons among aerodynamic, electroglottographic, and acoustic spectral measures of female voice. , 1995, Journal of speech and hearing research.

[7]  Jack J. Jiang,et al.  Multiparameter comparison of injection laryngoplasty, medialization laryngoplasty, and arytenoid adduction in an excised larynx model , 2010, The Laryngoscope.

[8]  I R Titze,et al.  Acoustic interpretation of resonant voice. , 2001, Journal of voice : official journal of the Voice Foundation.

[9]  D. Klatt,et al.  Analysis, synthesis, and perception of voice quality variations among female and male talkers. , 1990, The Journal of the Acoustical Society of America.

[10]  Allard Jongman,et al.  Acoustic correlates of breathy and clear vowels: the case of Khmer , 2003, J. Phonetics.

[11]  R. Colton,et al.  Spectral moment analysis of unilateral vocal fold paralysis. , 2011, Journal of voice : official journal of the Voice Foundation.

[12]  D G Childers,et al.  Vocal quality factors: analysis, synthesis, and perception. , 1991, The Journal of the Acoustical Society of America.

[13]  M. Huffman Measures of phonation type in Hmong. , 1987, The Journal of the Acoustical Society of America.

[14]  Adrian P. Simpson,et al.  The first and second harmonics should not be used to measure breathiness in male and female voices , 2012, J. Phonetics.

[15]  I Honjo,et al.  A new index for evaluation of the turbulent noise in pathological voice. , 1988, The Journal of the Acoustical Society of America.

[16]  Ingo R Titze,et al.  Regulating glottal airflow in phonation: application of the maximum power transfer theorem to a low dimensional phonation model. , 2002, The Journal of the Acoustical Society of America.

[17]  Ingo R Titze,et al.  Voice training and therapy with a semi-occluded vocal tract: rationale and scientific underpinnings. , 2006, Journal of speech, language, and hearing research : JSLHR.

[18]  Kristin L. Greilick,et al.  Acoustic and articulatory features of diphthong production: a speech clarity study. , 2010, Journal of speech, language, and hearing research : JSLHR.

[19]  Christina M. Esposito The effects of linguistic experience on the perception of phonation , 2010, J. Phonetics.

[20]  J. Kreiman,et al.  The multidimensional nature of pathologic vocal quality. , 1994, The Journal of the Acoustical Society of America.

[21]  I. Titze The myoelastic aerodynamic theory of phonation , 2006 .

[22]  Jody Kreiman,et al.  Perceptual sensitivity to first harmonic amplitude in the voice source. , 2010, The Journal of the Acoustical Society of America.

[23]  Jacqueline Vaissière,et al.  Objective acoustic and aerodynamic measures of breathiness in paralytic dysphonia , 2003, European Archives of Oto-Rhino-Laryngology.

[24]  G. de Krom,et al.  Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments. , 1995, Journal of speech and hearing research.

[25]  I. Titze Parameterization of the glottal area, glottal flow, and vocal fold contact area. , 1984, The Journal of the Acoustical Society of America.

[26]  E. Yumoto,et al.  Thyroarytenoid muscle activity and infraglottic aspect of canine vocal fold vibration. , 1995, Archives of otolaryngology--head & neck surgery.

[27]  J. Vaissière,et al.  Objective voice quality analysis before and after onset of unilateral vocal fold paralysis. , 2001, Journal of voice : official journal of the Voice Foundation.

[28]  Robert C. Peppard,et al.  Aerodynamic, laryngoscopic, and perceptual-acoustic characteristics in dysphonic females with posterior glottal chinks: A retrospective study , 1992 .

[29]  T. Hsiao,et al.  Outcome of resonant voice therapy for female teachers with voice disorders: perceptual, physiological, acoustic, aerodynamic, and functional measurements. , 2007, Journal of voice : official journal of the Voice Foundation.

[30]  E. Yumoto,et al.  Infraglottic aspect of canine vocal fold vibration: effect of increase of mean airflow rate and lengthening of vocal fold. , 1993, Journal of voice : official journal of the Voice Foundation.

[31]  L. Pignataro,et al.  Voice therapy for laryngeal hemiplegia: the role of timing of initiation of therapy. , 2010, Journal of rehabilitation medicine.

[32]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. , 1996, Journal of speech and hearing research.

[33]  J. Netterville,et al.  Voice outcome following thyroplasty in patients with longstanding vocal fold immobility. , 2002, Auris, nasus, larynx.

[34]  A. Schindler,et al.  Vocal improvement after voice therapy in unilateral vocal fold paralysis. , 2008, Journal of voice : official journal of the Voice Foundation.

[35]  J. Flanagan,et al.  Excitation of vocal-tract synthesizers. , 1969, The Journal of the Acoustical Society of America.

[36]  J. Hillenbrand,et al.  Acoustic correlates of breathy vocal quality. , 1994, Journal of speech and hearing research.

[37]  R. Casiano,et al.  Early results of transcutaneous injection laryngoplasty with micronized acellular dermis versus type-I thyroplasty for glottic incompetence dysphonia due to unilateral vocal fold paralysis. , 2003, Journal of voice : official journal of the Voice Foundation.

[38]  M. Döllinger,et al.  The influence of epilarynx area on vocal fold dynamics , 2006, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[39]  Lee M Akst,et al.  Long‐term Effects of Micronized Alloderm Injection for Unilateral Vocal Fold Paralysis , 2005, The Laryngoscope.

[40]  S. Hertegård,et al.  Glottal area and vibratory patterns studied with simultaneous stroboscopy, flow glottography, and electroglottography. , 1995, Journal of speech and hearing research.

[41]  J. Hillenbrand Perception of aperiodicities in synthetically generated voices. , 1988, The Journal of the Acoustical Society of America.

[42]  Peter J Murphy On first rahmonic amplitude in the analysis of synthesized aperiodic voice signals. , 2006, The Journal of the Acoustical Society of America.

[43]  Jody Kreiman,et al.  Measures of the glottal source spectrum. , 2007, Journal of speech, language, and hearing research : JSLHR.

[44]  E. Yumoto,et al.  Vocal fold vibration viewed from the tracheal side in living human beings , 1995, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[45]  I. Titze,et al.  Rules for controlling low-dimensional vocal fold models with muscle activation. , 2002, The Journal of the Acoustical Society of America.

[46]  Jonathan Harrington,et al.  The Acoustic Theory of Speech Production , 1999 .

[47]  P. Boersma ACCURATE SHORT-TERM ANALYSIS OF THE FUNDAMENTAL FREQUENCY AND THE HARMONICS-TO-NOISE RATIO OF A SAMPLED SOUND , 1993 .

[48]  J. Hillenbrand,et al.  Cepstral Peak Prominence: A More Reliable Measure of Dysphonia , 2003, The Annals of otology, rhinology, and laryngology.

[49]  Christopher Dromey,et al.  Estimating dysphonia severity in continuous speech: Application of a multi-parameter spectral/cepstral model , 2009, Clinical linguistics & phonetics.

[50]  J. Perkell,et al.  Aerodynamic and acoustic voice measurements of patients with vocal nodules: variation in baseline and changes across voice therapy. , 2003, Journal of voice : official journal of the Voice Foundation.

[51]  A. Oppenheim,et al.  Nonlinear filtering of multiplied and convolved signals , 1968 .

[52]  Ingo Titze,et al.  A four-parameter model of the glottis and vocal fold contact area , 1989, Speech Commun..

[53]  R C Scherer,et al.  Vocal fold bulging effects on phonation using a biophysical computer model. , 2000, Journal of voice : official journal of the Voice Foundation.

[54]  M. Södersten,et al.  Glottal closure and perceived breathiness during phonation in normally speaking subjects. , 1990, Journal of speech and hearing research.

[55]  Abeer Alwan,et al.  Age, sex, and vowel dependencies of acoustic measures related to the voice source. , 2007, The Journal of the Acoustical Society of America.

[56]  I. Titze Nonlinear source-filter coupling in phonation: theory. , 2008, The Journal of the Acoustical Society of America.

[57]  V. Wolfe,et al.  Pathologic voice type and the acoustic prediction of severity. , 1995, Journal of speech and hearing research.

[58]  Brad H Story,et al.  Relation of structural and vibratory kinematics of the vocal folds to two acoustic measures of breathy voice based on computational modeling. , 2011, Journal of speech, language, and hearing research : JSLHR.

[59]  Max A. Little,et al.  Objective dysphonia quantification in vocal fold paralysis: comparing nonlinear with classical measures , 2009 .

[60]  J. Sundberg,et al.  Breathiness and insufficient vocal fold closure , 1986 .

[61]  James M Hillenbrand,et al.  Open source software for experiment design and control. , 2005, Journal of speech, language, and hearing research : JSLHR.

[62]  E. Chuang,et al.  Glottal characteristics of male speakers: acoustic correlates and comparison with female data. , 1996, The Journal of the Acoustical Society of America.

[63]  J. Kreiman,et al.  Perceptual evaluation of voice quality: review, tutorial, and a framework for future research. , 1993, Journal of speech and hearing research.

[64]  S. Bassiouny Efficacy of the Accent Method of Voice Therapy , 1998, Folia Phoniatrica et Logopaedica.

[65]  R. Hillman,et al.  Consensus auditory-perceptual evaluation of voice: development of a standardized clinical protocol. , 2009, American journal of speech-language pathology.

[66]  R. Shrivastav The use of an auditory model in predicting perceptual ratings of breathy voice quality. , 2003, Journal of voice : official journal of the Voice Foundation.

[67]  H M Hanson,et al.  Glottal characteristics of female speakers: acoustic correlates. , 1997, The Journal of the Acoustical Society of America.

[68]  I. Titze,et al.  Acoustic interactions of the voice source with the lower vocal tract. , 1997, The Journal of the Acoustical Society of America.