Measurement of vowel nasalization by multi-dimensional acoustic analysis by Michael A. Berger Submitted in Partial Fulfillment of the Requirements for the Degree Master of Arts

When the velum lowers during vowels, the velopharyngeal port is opened, and there is acoustic coupling between the nasal cavity and the main vocal tract, giving rise to a distinct acoustic quality which we call nasality. Nasal coupling results in energy losses at low frequencies, damping of oral formants (especially F1), and introduction of nasal formants corresponding to the resonances of the nasal cavity and sinuses. These spectral modifications are gradient, increasing with degree of velar lowering; this relationship suggests that velar position may be recovered from the acoustic signal by measuring the degree of nasality in the vowel. However, the acoustic effects of nasalization vary not only with velar position, but also across different speakers and vowels, making it difficult to isolate an acoustic dimension corresponding to nasalization. This thesis presents a methodology for acoustic measurement of nasality in vowels which attempts to overcome this contextual variability by a normalization procedure. The measuring technique is implemented in an automated measuring system, which is trained on a phonetically balanced set of words recorded from a particular speaker to generate a speaker-specific model. The model specifies the parameters with which to measure nasality for the speaker, the contextual variability of those parameters which may be subtracted for normalization, and the contributions of the parameters to an integrated measure of nasality. The system performs high-frequency nasality measurements over the time course of vowels to generate temporally detailed nasality contours. These measurements may be interpreted as a form of articulatory inference, or as a representation of perceptual input. The system is trained and tested on recordings of 17 native speakers of three languages—English, Spanish and Bengali—speaking three types of vowels: oral (CV(C)), contextually nasalized (CṼN), and contrastively nasalized (CṼ(C)) (Bengali only). The measuring system is evaluated using various performance metrics.

[1]  T R Willemain,et al.  A miniature accelerometer for detecting glottal waveforms and nasalization. , 1975, Journal of speech and hearing research.

[2]  Ronald J. Baken,et al.  Clinical measurement of speech and voice , 1987 .

[3]  Marie K. Huffman,et al.  The role of F1 amplitude in producing nasal percepts , 1990 .

[4]  James Glass,et al.  Detection of nasalized vowels in American English , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  S. Blumstein,et al.  The effect of subphonetic differences on lexical access , 1994, Cognition.

[6]  Shinji Maeda The role of the sinus cavities in the production of nasal vowels , 1982, ICASSP.

[7]  M. Karnell,et al.  Variations in velar position over time: a nasal videoendoscopic study. , 1988, Journal of speech and hearing research.

[8]  Richard S. McGowan,et al.  Recovering articulatory movement from formant frequency trajectories using task dynamics and a genetic algorithm: Preliminary model tests , 1994, Speech Commun..

[9]  A Boothroyd,et al.  Assessment of nasalization in the speech of deaf children. , 1976, Journal of speech and hearing research.

[10]  C. M. Cobb,et al.  MEASURING PALATOPHARYNGEAL COMPETENCE WITH THE NASAL ANEMOMETER. , 1964, The Cleft palate journal.

[11]  John J. Ohala,et al.  Phonology and Phonetic Evidence: Speech perception and lexical representation: the role of vowel nasalization in Hindi and English , 1995 .

[12]  Tarun Pruthi Analysis, vocal-tract modeling and automatic detection of vowel nasalization , 2007 .

[13]  John J. Ohala,et al.  Monitoring Soft Palate Movements in Speech , 1971 .

[14]  Y Horii,et al.  An accelerometric approach to nasality measurement: a preliminary report. , 1980, The Cleft palate journal.

[15]  Harold Clumeck,et al.  Patterns of soft palate movements in six languages , 1976 .

[16]  A. Cohn Phonetic and phonological rules of nasalization , 1990 .

[17]  M. Solé,et al.  Spatio-Temporal Patterns of Velopharyngeal Action in Phonetic and Phonological Nasalization , 1995, Language and speech.

[18]  Hajime Hirose,et al.  Electromyographic study of the velum during speech , 1974 .

[19]  K. Moll,et al.  Cinefluorgraphic techniques in speech research. , 1960, Journal of speech and hearing research.

[20]  C A Fowler,et al.  Coarticulatory influences on the perceived height of nasal vowels. , 1988, The Journal of the Acoustical Society of America.

[21]  K. Stevens Toward a Model for Speech Recognition , 1960 .

[22]  Osamu Fujimura,et al.  Nasalization of Vowels in Relation to Nasals , 1958 .

[23]  Michael K. Tanenhaus,et al.  Gradient sensitivity to acoustic detail and temporal integration of phonetic cues , 2004 .

[24]  C. Fowler An event approach to the study of speech perception from a direct realist perspective , 1986 .

[25]  A. I. Weiss ORAL AND NASAL SOUND PRESSURE LEVELS AS RELATED TO JUDGED SEVERITY OF NASALITY , 1954 .

[26]  S Hawkins,et al.  Acoustic and perceptual correlates of the non-nasal--nasal distinction for vowels. , 1985, The Journal of the Acoustical Society of America.

[27]  S Horiguchi,et al.  The Velotrace: a device for monitoring velar position. , 1987, The Cleft palate journal.

[28]  M. Solé Phonetic and Phonological Processes: The Case of Nasalization , 1992 .

[29]  G. Sloan Posterior pharyngeal flap and sphincter pharyngoplasty: the state of the art. , 2000, The Cleft palate-craniofacial journal : official publication of the American Cleft Palate-Craniofacial Association.

[30]  Walter B. Swift,et al.  Speech Correction , 1920 .

[31]  S G Fletcher,et al.  Quantitative and graphic analysis of prosthetic treatment for "nasalance" in speech. , 1974, The Journal of prosthetic dentistry.

[32]  Coarticulation • Suprasegmentals,et al.  Acoustic Phonetics , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[33]  K. Stevens,et al.  Analog studies of the nasalization of vowels. , 1956, The Journal of speech and hearing disorders.

[34]  Marilyn Y. Chen,et al.  Acoustic correlates of English and French nasalized vowels. , 1997, The Journal of the Acoustical Society of America.

[35]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[36]  M. Y. Chen,et al.  Acoustic parameters of nasalized vowels in hearing-impaired and normal-hearing speakers. , 1995, The Journal of the Acoustical Society of America.

[37]  B. Atal,et al.  Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique. , 1978, The Journal of the Acoustical Society of America.

[38]  Mark Hasegawa-Johnson,et al.  Landmark-based speech recognition: report of the 2004 Johns Hopkins summer workshop , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[39]  W. B. Arndt,et al.  The relationship between nasality score values and oral and nasal sound pressure level. , 1967, Journal of speech and hearing research.