Auditory spectral integration in the perception of static vowels.

PURPOSE To evaluate potential contributions of broadband spectral integration in the perception of static vowels. Specifically, can the auditory system infer formant frequency information from changes in the intensity weighting across harmonics when the formant itself is missing? Does this type of integration produce the same results in the lower (first formant [F1]) and higher (second formant [F2]) regions? Does the spacing between the spectral components affect a listener's ability to integrate the acoustic cues? METHOD Twenty young listeners with normal hearing identified synthesized vowel-like stimuli created for adjustments in the F1 region (/Λ/-/α/, /i/-/ε/) and in the F2 region (/Λ/-/æ/). There were 2 types of stimuli: (a) 2-formant tokens and (b) tokens in which 1 formant was removed and 2 pairs of sine waves were inserted below and above the missing formant; the intensities of these harmonics were modified to cause variations in their spectral center of gravity (COG). The COG effects were tested over a wide range of frequencies. RESULTS Obtained patterns were consistent with calculated changes to the spectral COG, in both the F1 and F2 regions. The spacing of the sine waves did not affect listeners' responses. CONCLUSION The auditory system may perform broadband integration as a type of auditory wideband spectral analysis.

[1]  J. Sawusch Acoustic Analysis and Synthesis of Speech , 2008 .

[2]  A. Liberman,et al.  An Experimental Study of the Acoustic Determinants of Vowel Color; Observations on One- and Two-Formant Vowels Synthesized from Spectrographic Patterns , 1952 .

[3]  Robert Allen Fox,et al.  Spectral Integration of Dynamic Cues in the Perception of Syllable-Initial Stops , 2008, Phonetica.

[4]  O. Aaltonen The effect of relative amplitude levels of F2 and F3 on the categorization of synthetic vowels , 1985 .

[5]  S. S. Stevens,et al.  Critical Band Width in Loudness Summation , 1957 .

[6]  B. Lindblom,et al.  Modeling the judgment of vowel quality differences. , 1981, The Journal of the Acoustical Society of America.

[7]  H. Traunmüller Analytical expressions for the tonotopic sensory scale , 1990 .

[8]  Hartrnut Traunmiiller,et al.  Paralinguistic Variation and Invariance in the Characteristic Frequencies of Vowels , 2007 .

[9]  L. Feth,et al.  Two-tone auditory spectral resolution. , 1977, The Journal of the Acoustical Society of America.

[10]  Dennis H. Klatt,et al.  Prediction of perceived phonetic distance from critical-band spectra: A first step , 1982, ICASSP.

[11]  Eric W Healy,et al.  Effect of spectral frequency range and separation on the perception of asynchronous speech. , 2007, The Journal of the Acoustical Society of America.

[12]  E. Jacewicz Listener sensitivity to variations in the relative amplitude of vowel formants , 2005 .

[13]  M. Yano,et al.  On the effectiveness of whole spectral shape for vowel perception. , 2001, The Journal of the Acoustical Society of America.

[14]  D M Green,et al.  Phase independence of pitch produced by narrow-band sounds. , 1996, The Journal of the Acoustical Society of America.

[15]  S. Zahorian,et al.  Spectral-shape features versus formants as acoustic correlates for vowels. , 1993, The Journal of the Acoustical Society of America.

[16]  J. B. Pickering,et al.  Vowel Perception and Production , 1994 .

[17]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .

[18]  J. D. Miller,et al.  Auditory-perceptual interpretation of the vowel. , 1989, The Journal of the Acoustical Society of America.

[19]  Hideki Kawahara,et al.  Multiple period estimation and pitch perception model , 1999, Speech Commun..

[20]  J. Hillenbrand,et al.  A narrow band pattern-matching model of vowel perception. , 2003, The Journal of the Acoustical Society of America.

[21]  D. Pisoni,et al.  Speech perception without traditional speech cues. , 1981, Science.

[22]  L. A. Chistovich Central auditory processing of peripheral vowel spectra. , 1985, The Journal of the Acoustical Society of America.

[23]  Harvey M. Sussman,et al.  A neuronal model of vowel normalization and representation , 1986, Brain and Language.

[24]  J B Millar,et al.  The Effect of Relative Formant Amplitude on the Perceived Identity of Synthetic Vowels , 1972, Language and speech.

[25]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[26]  Hermann L. F. Helmholtz,et al.  The sensations of tone: As a physiological basis for the theory of music (6th ed.). , 1948 .

[27]  H. Helmholtz,et al.  On the Sensations of Tone as a Physiological Basis for the Theory of Music , 2005 .

[28]  B. Moore,et al.  Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. , 1983, The Journal of the Acoustical Society of America.

[29]  Robert Allen Fox,et al.  Auditory spectral integration in the perception of diphthongal vowels. , 2010, The Journal of the Acoustical Society of America.

[30]  P F Assmann,et al.  Perception of front vowels: the role of harmonics in the first formant region. , 1987, The Journal of the Acoustical Society of America.

[31]  P F Assmann,et al.  The Perception of Back Vowels: Centre of Gravity Hypothesis , 1991, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[32]  S Hawkins,et al.  The influence of spectral prominence on perceived vowel quality. , 1990, The Journal of the Acoustical Society of America.

[33]  Michael Kiefte,et al.  The relative importance of spectral tilt in monophthongs and diphthongs. , 2005, The Journal of the Acoustical Society of America.

[34]  Joshua G. W. Bernstein,et al.  Pitch discrimination of diotic and dichotic tone complexes: harmonic resolvability or harmonic number? , 2003, The Journal of the Acoustical Society of America.

[35]  Robert Allen Fox,et al.  Amplitude variations in coarticulated vowels. , 2008, The Journal of the Acoustical Society of America.

[36]  M. Kiefte,et al.  The role of formant amplitude in the perception of /i/ and /u/. , 2006, The Journal of the Acoustical Society of America.

[37]  L. Chistovich,et al.  The ‘center of gravity’ effect in vowel spectra and critical distance between the formants: Psychoacoustical study of the perception of vowel-like stimuli , 1979, Hearing Research.

[38]  A. Oxenham,et al.  Sequential F0 comparisons between resolved and unresolved harmonics: no evidence for translation noise between two pitch mechanisms. , 2004, The Journal of the Acoustical Society of America.

[39]  A K Krishnamurthy,et al.  Intensity-weighted average of instantaneous frequency as a model for frequency discrimination. , 1993, The Journal of the Acoustical Society of America.

[40]  Jean-Luc Schwartz,et al.  A strong evidence for the existence of a large-scale integrated spectral representation in vowel perception , 1989, Speech Commun..

[41]  Jennifer S. Pardo,et al.  On the Bistability of Sine Wave Analogues of Speech , 2001, Psychological science.

[42]  L. Feth,et al.  Bandwidth of spectral resolution for two-formant synthetic vowels and two-tone complex signals. , 2004, The Journal of the Acoustical Society of America.

[43]  L. Feth Frequency discrimination of complex periodic tones , 1974 .

[44]  R. P. Fahey,et al.  Perception of back vowels: effects of varying F1 - F0 Bark distance. , 1994, The Journal of the Acoustical Society of America.

[45]  A. M. Mimpen,et al.  The ear as a frequency analyzer. II. , 1964, The Journal of the Acoustical Society of America.

[46]  Coarticulation • Suprasegmentals,et al.  Acoustic Phonetics , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[47]  R L Diehl,et al.  Perception of vowel height: the role of F1-F0 distance. , 1994, The Journal of the Acoustical Society of America.

[48]  B. Granström,et al.  Music and Hearing Quarterly Progress and Status Report Some studies concerning perception of isolated vowels , 2007 .

[49]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[50]  Eric W Healy,et al.  The role of contrasting temporal amplitude patterns in the perception of speech. , 2003, The Journal of the Acoustical Society of America.

[51]  H. Traunmüller Some aspects of the sound of speech sounds , 1987 .

[52]  C. J. Darwin,et al.  Which harmonics contribute to the estimation of first formant frequency? , 1985, Speech Commun..

[53]  H. S. Gopal,et al.  A perceptual model of vowel recognition based on the auditory representation of American English vowels. , 1986, The Journal of the Acoustical Society of America.

[54]  R B Gardner,et al.  Mistuning a harmonic of a vowel: grouping and phase effects on vowel quality. , 1986, The Journal of the Acoustical Society of America.

[55]  J. Schwartz,et al.  The Dispersion-Focalization Theory of vowel systems , 1997 .