Auditory and Visual Lexical Neighborhoods in Audiovisual Speech Perception

Much evidence suggests that the mental lexicon is organized into auditory neighborhoods, with words that are phonologically similar belonging to the same neighborhood. In this investigation, we considered the existence of visual neighborhoods. When a receiver watches someone speak a word, a neighborhood of homophenes (ie, words that look alike on the face, such as pat and bat) is activated. The simultaneous activation of a word's auditory and visual neighborhoods may, in part, account for why individuals recognize speech better in an auditory–visual condition than what would be predicted by their performance in audition-only and vision-only conditions. A word test was administered to 3 groups of participants in audition-only, vision-only, and auditory–visual conditions, in the presence of 6-talker babble. Test words with sparse visual neighborhoods were recognized more accurately than words with dense neighborhoods in a vision-only condition. Densities of both the acoustic and visual neighborhoods as well as their intersection overlap were predictive of how well the test words were recognized in the auditory–visual condition. These results suggest that visual neighborhoods exist and that they affect auditory–visual speech perception. One implication is that in the presence of dual sensory impairment, the boundaries of both acoustic and visual neighborhoods may shift, adversely affecting speech recognition.

[1]  S. Lesner Deafness in the Adult: What Hearing Loss Means and What Can Be Done to Help , 1987 .

[2]  L. Bernstein,et al.  Stimulus-based lexical distinctiveness as a general word-recognition mechanism , 2002, Perception & psychophysics.

[3]  R. Fox Modularity and the Motor Theory of Speech Perception , 1994 .

[4]  K. Grant,et al.  Auditory-visual speech recognition by hearing-impaired subjects: consonant recognition, sentence recognition, and auditory-visual integration. , 1998, The Journal of the Acoustical Society of America.

[5]  W. H. Sumby,et al.  Visual contribution to speech intelligibility in noise , 1954 .

[6]  M. Sommers,et al.  Auditory-Visual Speech Perception and Auditory-Visual Enhancement in Normal-Hearing Younger and Older Adults , 2005, Ear and hearing.

[7]  K. Grant,et al.  Measures of auditory-visual integration for speech understanding: a theoretical perspective. , 2002, The Journal of the Acoustical Society of America.

[8]  Slim Ouni,et al.  Visual Contribution to Speech Perception: Measuring the Intelligibility of Animated Talking Heads , 2007, EURASIP J. Audio Speech Music. Process..

[9]  The effects of signal‐to‐noise ratio on auditory‐visual integration: Integration and encoding are not independent , 2005 .

[10]  Time-Compressed Visual Speech and Age: A First Report , 2004, Ear and hearing.

[11]  David B Pisoni,et al.  Talker and lexical effects on audiovisual word recognition by adults with cochlear implants. , 2003, Journal of speech, language, and hearing research : JSLHR.

[12]  L. Braida Crossmodal Integration in the Identification of Consonant Segments , 1991, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[13]  S A Lesner,et al.  Training influences on visual consonant and sentence recognition. , 1987, Ear and hearing.

[14]  Arlene Earley Carney,et al.  Auditory-Visual Speech Perception and Aging , 2002, Ear and hearing.

[15]  D. Pisoni,et al.  Recognizing Spoken Words: The Neighborhood Activation Model , 1998, Ear and hearing.

[16]  G. Plant Perceiving Talking Faces: From Speech Perception to a Behavioral Principle , 1999 .

[17]  W. H. Sumby,et al.  Erratum: Visual Contribution to Speech Intelligibility in Noise [J. Acoust. Soc. Am. 26, 212 (1954)] , 1954 .

[18]  P J Blamey,et al.  Speech perception using combinations of auditory, visual, and tactile information. , 1989, Journal of rehabilitation research and development.

[19]  Nancy Tye-Murray,et al.  Audiovisual integration and lipreading abilities of older adults with normal and impaired hearing. , 2006, Ear and hearing.