Audiovisual integration in the human perception of materials.

Interest in the perception of the material of objects has been growing. While material perception is a critical ability for animals to properly regulate behavioral interactions with surrounding objects (e.g., eating), little is known about its underlying processing. Vision and audition provide useful information for material perception; using only its visual appearance or impact sound, we can infer what an object is made from. However, what material is perceived when the visual appearance of one material is combined with the impact sound of another, and what are the rules that govern cross-modal integration of material information? We addressed these questions by asking 16 human participants to rate how likely it was that audiovisual stimuli (48 combinations of visual appearances of six materials and impact sounds of eight materials) along with visual-only stimuli and auditory-only stimuli fell into each of 13 material categories. The results indicated strong interactions between audiovisual material perceptions; for example, the appearance of glass paired with a pepper sound is perceived as transparent plastic. Rating material-category likelihoods follow a multiplicative integration rule in that the categories judged to be likely are consistent with both visual and auditory stimuli. On the other hand, rating-material properties, such as roughness and hardness, follow a weighted average rule. Despite a difference in their integration calculations, both rules can be interpreted as optimal Bayesian integration of independent audiovisual estimations for the two types of material judgment, respectively.

[1]  L. N. Solomon Semantic Approach to the Perception of Complex Sounds , 1958 .

[2]  A. Gabrielsson,et al.  Perceived sound quality of sound-reproducing systems. , 1979, The Journal of the Acoustical Society of America.

[3]  Edward H. Adelson,et al.  Recognizing Materials Using Perceptually Inspired Features , 2013, International Journal of Computer Vision.

[4]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[5]  D. Massaro Speech Perception By Ear and Eye: A Paradigm for Psychological Inquiry , 1989 .

[6]  Guillaume Lemaitre,et al.  Auditory perception of material is fragile while action is strikingly robust. , 2012, The Journal of the Acoustical Society of America.

[7]  Robert W. Kentridge,et al.  Separate processing of texture and form in the ventral stream: evidence from FMRI and visual agnosia. , 2010, Cerebral cortex.

[8]  Sylvia C Pont,et al.  Illusory gloss on Lambertian surfaces. , 2010, Journal of vision.

[9]  David G. Stork,et al.  Speech Recognition and Sensory Integration A 240-year-old theorem helps explain how people and machines can integrate auditory and visual information to understand speech , 2016 .

[10]  Naokazu Goda,et al.  Transformation from image-based to perceptual representation of materials along the human ventral visual pathway , 2011, NeuroImage.

[11]  Wolfgang Straßer,et al.  Perceptual Reparameterization of Material Properties , 2007, CAe.

[12]  Huseyin Boyaci,et al.  Estimating the glossiness transfer function induced by illumination change and testing its transitivity. , 2010, Journal of vision.

[13]  R. Lutfi,et al.  Auditory discrimination of material changes in a struck-clamped bar. , 1997, The Journal of the Acoustical Society of America.

[14]  Naokazu Goda,et al.  Neural Selectivity and Representation of Gloss in the Monkey Inferior Temporal Cortex , 2012, The Journal of Neuroscience.

[15]  D. Massaro From Multisensory Integration to Talking Heads and Language Learning , 2002 .

[16]  Susan J. Lederman,et al.  Multisensory Texture Perception , 2010 .

[17]  M. Ernst,et al.  Humans integrate visual and haptic information in a statistically optimal fashion , 2002, Nature.

[18]  M. Landy,et al.  Measurement and modeling of depth cue combination: in defense of weak fusion , 1995, Vision Research.

[19]  Richard Kronland-Martinet,et al.  Controlling the Perceived Material in an Impact Sound Synthesizer , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[20]  Marc O. Ernst Optimal Multisensory Integration: Assumptions and Limits , 2012 .

[21]  D. Burr,et al.  The Ventriloquist Effect Results from Near-Optimal Bimodal Integration , 2004, Current Biology.

[22]  Barton L Anderson,et al.  The dark side of gloss , 2012, Nature Neuroscience.

[23]  Dominic W. Massaro,et al.  SPEECH RECOGNITION AND SENSORY INTEGRATION , 1998 .

[24]  Bruno L. Giordano,et al.  Material identification of real impact sounds: effects of size variation in steel, glass, wood, and plexiglass plates. , 2006, The Journal of the Acoustical Society of America.

[25]  M. Shiffrar,et al.  Human Body Perception From The Inside Out , 2005 .

[26]  S. Nishida,et al.  Use of image-based information in judgments of surface-reflectance properties. , 1998, Journal of the Optical Society of America. A, Optics, image science, and vision.

[27]  Edward H. Adelson,et al.  Material perception: What can you see in a brief glance? , 2010 .

[28]  Jonathan S. Cant,et al.  Crinkling and crumpling: An auditory fMRI study of material properties , 2008, NeuroImage.

[29]  Christiane B. Wiebel,et al.  Perceptual qualities and material classes. , 2013, Journal of vision.

[30]  David H Brainard,et al.  Color and material perception: achievements and challenges. , 2010, Journal of vision.

[31]  Qasim Zaidi,et al.  Visual inferences of material changes: color as clue and distraction. , 2011, Wiley interdisciplinary reviews. Cognitive science.

[32]  Marc O. Ernst,et al.  A Bayesian view on multimodal cue integration , 2006 .

[33]  G. Von Bismarck,et al.  Sharpness as an attribute of the timbre of steady sounds , 1974 .

[34]  B. Stein The new handbook of multisensory processes , 2012 .

[35]  Jonathan S. Cant,et al.  Cerebral Cortex Advance Access published April 28, 2006 Attention to Form or Surface Properties Modulates Different Regions of Human , 2022 .

[36]  W. G. Cochran Problems arising in the analysis of a series of similar experiments , 1937 .

[37]  Dinesh K. Pai,et al.  Perception of Material from Contact Sounds , 2000, Presence: Teleoperators & Virtual Environments.

[38]  Jonathan S. Cant,et al.  Scratching Beneath the Surface: New Insights into the Functional Properties of the Lateral Occipital Area and Parahippocampal Place Area , 2011, The Journal of Neuroscience.

[39]  Barton L Anderson,et al.  Visual perception of materials and surfaces , 2011, Current Biology.

[40]  Gouki Okazawa,et al.  Representation of the Material Properties of Objects in the Visual Cortex of Nonhuman Primates , 2014, The Journal of Neuroscience.

[41]  S. Lederman,et al.  Perception of texture by vision and touch: multidimensionality and intersensory integration. , 1986, Journal of experimental psychology. Human perception and performance.

[42]  R. Fleming Visual perception of materials and their properties , 2014, Vision Research.

[43]  Robert W. Kentridge,et al.  Separate channels for processing form, texture, and color: evidence from FMRI adaptation and visual object agnosia. , 2010, Cerebral cortex.

[44]  Lavanya Sharan,et al.  Image statistics and the perception of surface reflectance , 2005 .

[45]  E. Adelson,et al.  Image statistics and the perception of surface qualities , 2007, Nature.

[46]  Edward H. Adelson,et al.  On seeing stuff: the perception of materials by humans and machines , 2001, IS&T/SPIE Electronic Imaging.

[47]  C. Osgood,et al.  Certain relations among experienced contingencies, associative structure, and contingencies in encoded messages. , 1957, The American journal of psychology.

[48]  Karl R Gegenfurtner,et al.  Categorical sensitivity to color differences. , 2013, Journal of vision.