Effects of phonetic context on audio-visual intelligibility of French.

Bimodal perception leads to better speech understanding than auditory perception alone. We evaluated the overall benefit of lip-reading on natural utterances of French produced by a single speaker. Eighteen French subjects with good audition and vision were administered a closed set identification test of VCVCV nonsense words consisting of three vowels [i, a, y] and six consonants [b, v, z, 3, R, l]. Stimuli were presented under both auditory and audio-visual conditions with white noise added at various signal-to-noise ratios. Identification scores were higher in the bimodal condition than in the auditory-alone condition, especially in situations where acoustic information was reduced. The auditory and audio-visual intelligibility of the three vowels [i, a, y] averaged over the six consonantal contexts was evaluated as well. Two different hierarchies of intelligibility were found. Auditorily, [a] was most intelligible, followed by [i] and then by [y]; whereas visually [y] was most intelligible, followed by [a] and [i]. We also quantified the contextual effects of the three vowels on the auditory and audio-visual intelligibility of the consonants. Both the auditory and the audio-visual intelligibility of surrounding consonants was highest in the [a] context, followed by the [i] context and lastly the [y] context.

[1]  W. H. Sumby,et al.  Visual contribution to speech intelligibility in noise , 1954 .

[2]  G. A. Miller,et al.  An Analysis of Perceptual Confusions Among Some English Consonants , 1955 .

[3]  K. K. Neely Effect of Visual Factors on the Intelligibility of Speech , 1956 .

[4]  N. P. Erber Interaction of audition and vision in the recognition of oral speech stimuli. , 1969, Journal of speech and hearing research.

[5]  N P Erber Effects of distance on the visual reception of speech. , 1971, Journal of speech and hearing research.

[6]  M. Rossi,et al.  L’intensité spécifique des voyelles , 1971 .

[7]  Kenneth N. Stevens,et al.  On the quantal nature of speech , 1972 .

[8]  B. Lindblom,et al.  Numerical Simulation of Vowel Quality Systems: The Role of Perceptual Contrast , 1972 .

[9]  A A Montgomery,et al.  Auditory and visual contributions to the perception of consonants. , 1974, Journal of speech and hearing research.

[10]  A. Benguerel,et al.  Coarticulation of Upper Lip Protrusion in French , 1974, Phonetica.

[11]  N. P. Erber Auditory-visual perception of speech. , 1975, The Journal of speech and hearing disorders.

[12]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[13]  Q Summerfield,et al.  Use of Visual Information for Phonetic Perception , 1979, Phonetica.

[14]  P. L. Jackson,et al.  Visual vowel and diphthong perception from two horizontal viewing angles. , 1979, Journal of speech and hearing research.

[15]  M. Pichora-Fuller,et al.  Coarticulation effects in lipreading. , 1982, Journal of speech and hearing research.

[16]  A. Liberman,et al.  The motor theory of speech perception revised , 1985, Cognition.

[17]  E. Owens,et al.  Visemes observed by hearing-impaired and normal-hearing adult viewers. , 1985, Journal of speech and hearing research.

[18]  B. Lindblom Phonetic Universals in Vowel Systems , 1986 .

[19]  Christian Abry,et al.  "Laws" for lips , 1986, Speech Commun..

[20]  A. Macleod,et al.  Quantifying the contribution of vision to speech perception in noise. , 1987, British journal of audiology.

[21]  Effects of Consonantal Context on Vowel Lipreading , 1987 .

[22]  D. Reisberg,et al.  Easy to hear but hard to understand: A lip-reading advantage with intact auditory stimuli. , 1987 .

[23]  Louis D. Braida,et al.  Evaluating the articulation index for auditory-visual input. , 1987, The Journal of the Acoustical Society of America.

[24]  Mohamed Tahar Lallouache,et al.  Un poste "visage-parole" couleur : acquisition et traitement automatique des contours des lèvres , 1991 .

[25]  Y. Tohkura,et al.  McGurk effect in non-English listeners: few visual effects for Japanese subjects hearing Japanese syllables of high auditory intelligibility. , 1991, The Journal of the Acoustical Society of America.

[26]  Christian Abry,et al.  The effect of context on labiality in French , 1991, EUROSPEECH.

[27]  Tayeb Mohamadi,et al.  Synthèse à partir du texte de visages parlants : réalisation d'un prototype et mesures d'intelligibilité bimodale , 1993 .

[28]  Antoinette T. Gesi,et al.  Long-term training, transfer, and retention in learning to lipread , 1993, Perception & psychophysics.

[29]  Antoinette T. Gesi,et al.  Bimodal speech perception: an examination across languages , 1993 .

[30]  C. Benoît,et al.  A set of French visemes for visual speech synthesis , 1994 .