Audio–vocal responses of vocal fundamental frequency and formant during sustained vowel vocalizations in different noises

Sustained vocalizations of vowels [a], [i], and syllable [mə] were collected in twenty normal-hearing individuals. On vocalizations, five conditions of different audio-vocal feedback were introduced separately to the speakers including no masking, wearing supra-aural headphones only, speech-noise masking, high-pass noise masking, and broad-band-noise masking. Power spectral analysis of vocal fundamental frequency (F0) was used to evaluate the modulations of F0 and linear-predictive-coding was used to acquire first two formants. The results showed that while the formant frequencies were not significantly shifted, low-frequency modulations (<3 Hz) of F0 significantly increased with reduced audio-vocal feedback across speech sounds and were significantly correlated with auditory awareness of speakers' own voices. For sustained speech production, the motor speech controls on F0 may depend on a feedback mechanism while articulation should rely more on a feedforward mechanism. Power spectral analysis of F0 might be applied to evaluate audio-vocal control for various hearing and neurological disorders in the future.

[1]  C. Larson,et al.  Voice F0 responses to pitch-shifted voice feedback during English speech. , 2007, The Journal of the Acoustical Society of America.

[2]  C. Larson,et al.  Effects of simultaneous perturbations of voice pitch and loudness feedback on voice F0 and amplitude control. , 2007, The Journal of the Acoustical Society of America.

[3]  Arnold E. Aronson,et al.  Rapid Voice Tremor, or “Flutter,” in Amyotrophic Lateral Sclerosis , 1992, The Annals of otology, rhinology, and laryngology.

[4]  Janne Sinkkonen,et al.  The auditory transient 40‐Hz response is insensitive to changes in stimulus features , 1994, Neuroreport.

[5]  Emily Q. Wang,et al.  Effect of tonal native language on voice fundamental frequency responses to pitch feedback perturbations during sustained vocalizations. , 2010, The Journal of the Acoustical Society of America.

[6]  R. Näätänen,et al.  Early selective-attention effect on evoked potential reinterpreted. , 1978, Acta psychologica.

[7]  Satrajit S. Ghosh,et al.  Neural modeling and imaging of the cortical interactions underlying syllable production , 2006, Brain and Language.

[8]  T C Hain,et al.  Effects of delayed auditory feedback (DAF) on the pitch-shift reflex. , 2001, The Journal of the Acoustical Society of America.

[9]  Jason A. Tourville,et al.  Neural mechanisms underlying auditory feedback control of speech , 2008, NeuroImage.

[10]  T A Burnett,et al.  Comparison of voice F0 responses to pitch-shift onset and offset conditions. , 2001, The Journal of the Acoustical Society of America.

[11]  I R Titze,et al.  A model for neurologic sources of aperiodicity in vocal fold vibration. , 1991, Journal of speech and hearing research.

[12]  I R Titze,et al.  Modulation of fundamental frequency by laryngeal muscles during vibrato. , 1994, Journal of voice : official journal of the Voice Foundation.

[13]  F. Guenther Cortical interactions underlying the production of speech sounds. , 2006, Journal of communication disorders.

[14]  Shao-Hsuan Lee,et al.  Effects of hearing aid amplification on voice F0 variability in speakers with prelingual hearing loss , 2013, Hearing Research.

[15]  D W Massaro,et al.  Perceptual processing in dichotic listening. , 1976, Journal of experimental psychology. Human learning and memory.

[17]  Cheryl C. H. Yang,et al.  Effects of Speech Noise on Vocal Fundamental Frequency Using Power Spectral Analysis , 2007, Ear and hearing.

[18]  Carole T Ferrand Relationship between masking levels and phonatory stability in normal-speaking women. , 2006, Journal of voice : official journal of the Voice Foundation.

[19]  Ciara Leydon,et al.  The role of auditory feedback in sustaining vocal vibrato. , 2003, The Journal of the Acoustical Society of America.

[20]  C. Larson,et al.  Instructing subjects to make a voluntary response reveals the presence of two components to the audio-vocal reflex , 1999, Experimental Brain Research.

[21]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[22]  R. Näätänen Attention and brain function , 1992 .

[23]  James L. McClelland Explorations In Parallel Distributed Processing , 1988 .

[24]  C. Larson,et al.  Voice F0 responses to manipulations in pitch feedback. , 1998, The Journal of the Acoustical Society of America.