Acoustic Analysis and Detection of Hypernasality Using a Group Delay Function

In this paper, we describe a group delay-based signal processing technique for the analysis and detection of hypernasal speech. Our preliminary acoustic analysis on nasalized vowels shows that, even though additional resonances are introduced at various frequency locations, the introduction of a new resonance in the low-frequency region (around 250 Hz) is found to be consistent. This observation is further confirmed by a perceptual analysis carried out on vowel sounds that are modified by introducing different nasal resonances, and an acoustic analysis on hypernasal speech. Based on this, for subsequent experiments the focus is given only to the low-frequency region. The additive property of the group delay function can be exploited to resolve two closely spaced formants. However, when the formants are very close with considerably wider bandwidths as in hypernasal speech, the group delay function also fails to resolve. To overcome this, we suggest a band-limited approach to estimate the locations of the formants. Using the band-limited group delay spectrum, we define a new acoustic measure for the detection of hypernasality. Experiments are carried out on the phonemes /a/, /i/, and /u/ uttered by 33 hypernasal speakers and 30 normal speakers. Using the group delay-based acoustic measure, the performance on a hypernasality detection task is found to be 100% for /a/, 88.78% for /i/ and 86.66% for /u/. The effectiveness of this acoustic measure is further cross-verified on a speech data collected in an entirely different recording environment

[1]  M. Ramasubba Reddy,et al.  The analysis on band-limited hypernasal speech using group delay based formant extraction technique , 2005, INTERSPEECH.

[2]  S Hawkins,et al.  Acoustic and perceptual correlates of the non-nasal--nasal distinction for vowels. , 1985, The Journal of the Acoustical Society of America.

[3]  Hema A. Murthy,et al.  The modified group delay function and its application to phoneme recognition , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[4]  D C SPRIESTERSBACH,et al.  Assessing nasal quality in cleft palate speech of children. , 1955, The Journal of speech and hearing disorders.

[5]  Chulhee Lee,et al.  A Noninvasive Estimation of Hypernasality Using a Linear Predictive Model , 2004, Annals of Biomedical Engineering.

[6]  Y Horii,et al.  An accelerometric measure as a physical correlate of perceived hypernasality in speech. , 1983, Journal of speech and hearing research.

[7]  A. Reich,et al.  Correspondence between an accelerometric nasal/voice amplitude ratio and listeners' direct magnitude estimations of hypernasality. , 1985, Journal of speech and hearing research.

[8]  A Boothroyd,et al.  Assessment of nasalization in the speech of deaf children. , 1976, Journal of speech and hearing research.

[9]  David P Kuehn,et al.  Speech evaluation and treatment for patients with cleft palate. , 2003, American journal of speech-language pathology.

[10]  Bayya Yegnanarayana,et al.  Formant extraction from group delay function , 1991, Speech Commun..

[11]  James Glass,et al.  Detection of nasalized vowels in American English , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  J.H.L. Hansen,et al.  A noninvasive technique for detecting hypernasal speech using a nonlinear operator , 1996, IEEE Transactions on Biomedical Engineering.

[13]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[14]  M. Ramasubba Reddy,et al.  Analysis of hypernasality by synthesis , 2004, INTERSPEECH.

[15]  M. Ramasubba Reddy,et al.  Detection of hypernasality using statistical pattern classifiers , 2005, INTERSPEECH.

[16]  M. Y. Chen,et al.  Acoustic parameters of nasalized vowels in hearing-impaired and normal-hearing speakers. , 1995, The Journal of the Acoustical Society of America.