Acoustic and visual phonetic features in the mcgurk effect - an audiovisual speech illusion

Information from the acoustic speech signal and the talking face is integrated into a unified percept. This is demonstrated in the McGurk effect, in which discrepant visual articulation changes the auditory perception of a consonant. We studied acoustic (A) and visual (V) phonetic features that contribute to audiovisual speech perception by measuring the McGurk effect in two vowel contexts, [a] and [e], at various levels of acoustic noise. The McGurk stimuli consisted of an acoustic [p] presented with a visual [k]. This combination is generally heard as [t] (called fusion) or as [k] (visually dominant percept). The stimulus A[apa]V[aka] was most often heard as [aka], and these percepts increased with noise level. The stimulus A[epe]V[eke] was heard mostly as a fusion [ete], but in high noise also as [eke]. A phonetic analysis showed that, in [e] context, A[p] and V[k] stimulus features were close to those of [t], explaining why fusions were frequent. In [a] context, the visual stimulus had clear features of [k], while the features of the acoustic component were less distinctive, resulting in visual dominance particularly in noise. These results show how audiovisual integration depends on the features of acoustic and visual speech.

[1]  H. Bülthoff,et al.  Merging the senses into a robust percept , 2004, Trends in Cognitive Sciences.

[2]  W. H. Sumby,et al.  Visual contribution to speech intelligibility in noise , 1954 .

[3]  A. Meltzoff,et al.  Integrating speech information across talkers, gender, and sensory modality: Female faces and male voices in the McGurk effect , 1991, Perception & psychophysics.

[4]  K. Green The Use of Auditory and Visual Information in Phonetic Perception , 1996 .

[5]  Sumi Shigeno Influence of vowel context on the audio‐visual speech perception of voiced stop consonants , 2000 .

[6]  K. Green,et al.  Acoustic cues to place of articulation and the McGurk effect: the role of release bursts, aspiration, and formant transitions. , 1997, Journal of speech, language, and hearing research : JSLHR.

[7]  A. Liberman,et al.  Acoustic Loci and Transitional Cues for Consonants , 1954 .

[8]  Mikko Sams,et al.  The Mcgurk effect at various auditory signal-to-noise ratios in american and Finnish listeners , 2010, AVSP.

[9]  G. A. Miller,et al.  An Analysis of Perceptual Confusions Among Some English Consonants , 1955 .

[10]  Tobias S. Andersen,et al.  Visual attention modulates audiovisual speech perception , 2004 .

[11]  Mikko Sams,et al.  Sound location can influence audiovisual speech perception when spatial attention is manipulated. , 2011, Seeing and perceiving.

[12]  L. Bernstein,et al.  Psychophysics of the McGurk and other audiovisual speech integration effects. , 2011, Journal of experimental psychology. Human perception and performance.

[13]  F. Guenther,et al.  Changes in the McGurk Effect Across Phonetic Contexts , 1999 .

[14]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[15]  M. Pichora-Fuller,et al.  Coarticulation effects in lipreading. , 1982, Journal of speech and hearing research.