The Role of Facial Colour and Luminance in Visual and Audiovisual Speech Perception

We conducted four experiments to investigate the role of colour and luminance information in visual and audiovisual speech perception. In experiments la (stimuli presented in quiet conditions) and 1b (stimuli presented in auditory noise), face display types comprised naturalistic colour (NC), grey-scale (GS), and luminance inverted (LI) faces. In experiments 2a (quiet) and 2b (noise), face display types comprised NC, colour inverted (CI), LI, and colour and luminance inverted (CLI) faces. Six syllables and twenty-two words were used to produce auditory and visual speech stimuli. Auditory and visual signals were combined to produce congruent and incongruent audiovisual speech stimuli. Experiments 1a and 1b showed that perception of visual speech, and its influence on identifying the auditory components of congruent and incongruent audiovisual speech, was less for LI than for either NC or GS faces, which produced identical results. Experiments 2a and 2b showed that perception of visual speech, and influences on perception of incongruent auditory speech, was less for LI and CLI faces than for NC and CI faces (which produced identical patterns of performance). Our findings for NC and CI faces suggest that colour is not critical for perception of visual and audiovisual speech. The effect of luminance inversion on performance accuracy was relatively small (5%), which suggests that the luminance information preserved in LI faces is important for the processing of visual and audiovisual speech.

[1]  M C Morrone,et al.  Recognition of Positive and Negative Bandpass-Filtered Images , 1986, Perception.

[2]  Timothy R. Jordan,et al.  Visual and audiovisual speech perception with color and gray-scale facial images , 2000, Perception & psychophysics.

[3]  Sharon M. Thomas,et al.  Effects of horizontal viewing angle on visual and audiovisual speech recognition. , 2001, Journal of experimental psychology. Human perception and performance.

[4]  C. Liu,et al.  Lighting direction affects recognition of untextured faces in photographic positive and negative , 1999, Vision Research.

[5]  P. Cavanagh,et al.  Shape from shadows. , 1989, Journal of experimental psychology. Human perception and performance.

[6]  D W Massaro,et al.  Perceiving speech from inverted faces , 1996, Perception & psychophysics.

[7]  J. Rönnberg Cognitive characteristics of skilled tactiling: The case of GS , 1993 .

[8]  Avi Chaudhuri,et al.  Are There Qualitative Differences between Face Processing in Photographic Positive and Negative? , 1998, Perception.

[9]  K. Berger,et al.  Two-dimension vs. three-dimension viewing in speechreading , 1971 .

[10]  Jerker Rönnberg,et al.  Implicit and explicit use of scripted constraints in lip-reading , 1993 .

[11]  C. Lansing,et al.  Visual word recognition in two facial motion conditions: full-face versus lips-plus-mandible. , 1995, Journal of speech and hearing research.

[12]  Timothy R. Jordan,et al.  Effects of Distance on Visual and Audiovisual Speech Recognition , 2000 .

[13]  W. H. Sumby,et al.  Visual contribution to speech intelligibility in noise , 1954 .

[14]  A. Montgomery,et al.  Physical characteristics of the lips underlying vowel lipreading performance. , 1983, The Journal of the Acoustical Society of America.

[15]  J. Tanaka,et al.  Color diagnosticity in object recognition , 1999, Perception & psychophysics.

[16]  D. Massaro Perceiving talking faces: from speech perception to a behavioral principle , 1999 .

[17]  N. P. Erber Interaction of audition and vision in the recognition of oral speech stimuli. , 1969, Journal of speech and hearing research.

[18]  V Bruce,et al.  Perceiving the sex and race of faces: the role of shape and colour , 1995, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[19]  G K Humphrey,et al.  The Role of Surface Information in Object Recognition: Studies of a Visual Form Agnosic and Normal Subjects , 1994, Perception.

[20]  B. Walden,et al.  Effects of training on the visual recognition of consonants. , 1977, Journal of speech and hearing research.

[21]  Y. Tohkura,et al.  Inter-language differences in the influence of visual cues in speech perception. , 1993 .

[22]  Michael B. Lewis,et al.  The Thatcher Illusion as a Test of Configural Disruption , 1997, Perception.

[23]  B. Dodd,et al.  Hearing by Eye II , 1998 .

[24]  E. Owens,et al.  Visemes observed by hearing-impaired and normal-hearing adult viewers. , 1985, Journal of speech and hearing research.

[25]  A. Meltzoff,et al.  Investigating the role of specific facial information in audio‐visual speech perception , 1995 .

[26]  M. Woodward,et al.  Phoneme perception in lipreading. , 1960, Journal of speech and hearing research.

[27]  G W Humphreys,et al.  The Effects of Surface Detail on Object Categorization and Naming , 1989, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[28]  R. Galper,et al.  Recognition of faces in photographic negative , 1970 .

[29]  K Prazdny,et al.  Illusory contours are not caused by simultaneous brightness contrast , 1983, Perception & psychophysics.

[30]  L D Rosenblum,et al.  Face and mouth inversion effects on visual and audiovisual speech perception. , 2000, Journal of experimental psychology. Human perception and performance.

[31]  A. Macleod,et al.  Quantifying the contribution of vision to speech perception in noise. , 1987, British journal of audiology.

[32]  C. G. Fisher,et al.  Confusions among visually perceived consonants. , 1968, Journal of speech and hearing research.

[33]  A. Johnston,et al.  The Role of Movement in Face Recognition , 1997 .

[34]  A. Meltzoff,et al.  Integrating speech information across talkers, gender, and sensory modality: Female faces and male voices in the McGurk effect , 1991, Perception & psychophysics.

[35]  A. Liberman,et al.  The motor theory of speech perception revised , 1985, Cognition.

[36]  Paul Miller,et al.  Verification of face identities from images captured on video. , 1999 .

[37]  Rika Kanzaki,et al.  Effect of facial brightness reversal on visual and audiovisual speech perception , 1999, AVSP.

[38]  R Kemp,et al.  Perception and Recognition of Normal and Negative Faces: The Role of Shape from Shading and Pigmentation Cues , 1996, Perception.

[39]  A. Montgomery,et al.  Perceptual dimensions underlying vowellipreading performance. , 1976, Journal of speech and hearing research.

[40]  N. M. Brooke,et al.  Analysis, synthesis, and perception of visible articulatory movements , 1983 .

[41]  Dominic W. Massaro,et al.  Perception of Synthetic Visual Speech , 1996 .

[42]  D. Massaro,et al.  Perception of Synthesized Audible and Visible Speech , 1990 .

[43]  R Plomp,et al.  The effect of speechreading on the speech-reception threshold of sentences in noise. , 1987, The Journal of the Acoustical Society of America.

[44]  J Gordon,et al.  Nonlinearity in the perception of form , 1985, Perception & psychophysics.

[45]  D R Proffitt,et al.  Semantic versus perceptual influences of color in object recognition. , 1996, Journal of experimental psychology. Learning, memory, and cognition.

[46]  V Bruce,et al.  The Use of Pigmentation and Shading Information in Recognising the Sex and Identities of Faces , 1994, Perception.

[47]  G E Legge,et al.  Color improves object recognition in normal and low vision. , 1993, Journal of experimental psychology. Human perception and performance.

[48]  S M Luria,et al.  Comparison of Eye Movements over Faces in Photographic Positives and Negatives , 1978, Perception.

[49]  J. Hochberg,et al.  Recognition memory for photographs of faces. , 1971, The American journal of psychology.

[50]  R. Phillips Why are faces hard to recognize in photographic negative? , 1972 .

[51]  L. Rosenblum,et al.  An audiovisual test of kinematic primitives for visual speech perception. , 1996, Journal of experimental psychology. Human perception and performance.

[52]  D. Perrett,et al.  Presentation-Time Measures of the Effects of Manipulations in Colour Space on Discrimination of Famous Faces , 1997, Perception.

[53]  A. Oliva,et al.  Diagnostic Colors Mediate Scene Recognition , 2000, Cognitive Psychology.

[54]  V. S. Ramachandran,et al.  Perception of shape from shading , 1988, Nature.

[55]  D. Massaro,et al.  Perception of Visible Speech: Influence of Spatial Quantization , 1997, Perception.

[56]  A Hayes,et al.  Identification of Two-Tone Images; Some Implications for High- and Low-Spatial-Frequency Processes in Human Vision , 1988, Perception.

[57]  A A Montgomery,et al.  Auditory and visual contributions to the perception of consonants. , 1974, Journal of speech and hearing research.

[58]  S Grossberg,et al.  Neural dynamics of brightness perception: Features, boundaries, diffusion, and resonance , 1984, Perception & Psychophysics.

[59]  L. Rosenblum,et al.  Discrimination tests of visually influenced syllables , 1992, Perception & psychophysics.

[60]  Christian Benoît,et al.  Which components of the face do humans and machines best speechread , 1996 .

[61]  Ken W Grant,et al.  Hearing by Eye II: Advances in the Psychology of Speechreading and Auditory–Visual Speech, edited by Ruth Campbell, Barbara Dodd, and Denis Burnham , 1999, Trends in Cognitive Sciences.

[62]  A. Liberman On Finding That Speech Is Special , 1982 .

[63]  Y. Tohkura,et al.  McGurk effect in non-English listeners: few visual effects for Japanese subjects hearing Japanese syllables of high auditory intelligibility. , 1991, The Journal of the Acoustical Society of America.

[64]  A. M. Burton,et al.  Sex Discrimination: How Do We Tell the Difference between Male and Female Faces? , 1993, Perception.

[65]  A. Macleod,et al.  LIPS, TEETH, AND THE BENEFITS OF LIPREADING , 1989 .

[66]  Kerry P. Green,et al.  The influence of an inverted face on the McGurk effect , 1994 .

[67]  G. A. Miller,et al.  An Analysis of Perceptual Confusions Among Some English Consonants , 1955 .

[68]  K. Prazdny On the nature of inducing forms generating perceptions of illusory contours , 1985, Perception & psychophysics.

[69]  Q Summerfield,et al.  Detection and Resolution of Audio-Visual Incompatibility in the Perception of Vowels , 1984, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[70]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[71]  J MacDonald,et al.  Hearing by Eye: How Much Spatial Degradation can Be Tolerated? , 2000, Perception.

[72]  D. Perrett,et al.  Perception of age in adult Caucasian male faces: computer graphic manipulation of shape and colour information , 1995, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[73]  T. R. Jordan,et al.  Seeing and hearing rotated faces: influences of facial orientation on visual and audiovisual speech recognition. , 1997, Journal of experimental psychology. Human perception and performance.