Time to wave good-bye to phase scrambling: creating controlled scrambled images using diffeomorphic transformations.

To isolate the neural mechanisms associated with recognizing objects from those processing basic visual properties, control stimuli are required that contain the same perceptual properties as the objects but are unrecognizable. We demonstrate that conventional methods for generating control stimuli (phase scrambling, box scrambling, texture scrambling) yield poor controls because they dramatically distort the basic visual properties (e.g., spatial frequency, perceptual organization) to which even the earliest stages of visual processing are sensitive. We developed a new scrambling method, using a diffeomorphic transformation that preserves the basic perceptual properties of the image while removing meaning. We acquired perceptual ratings to determine the least amount of scrambling necessary to remove recognition. We hypothesized that our "diffeomorphic" images would produce neural activity at the earliest stages of the visual system that more closely matched activity in response to intact images relative to the other scrambling methods. To test this hypothesis, we used the HMAX computational model of object recognition and compared the simulated neural activity at the earliest stages of the visual system (layers S1, C1, and S2) between a set of 149 images scrambled using each distortion method to their intact version. We found that scrambled "diffeomorphed" images were indistinguishable to intact images in each layer of the model, but all of the other distortion methods yielded quite different patterns. Our results indicate that "diffeomorphed" images serve as more appropriate control stimuli in neuroimaging studies that aim to disentangle the representations of perceptual and semantic object properties.

[1]  O. Reiser,et al.  Principles Of Gestalt Psychology , 1936 .

[2]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[3]  A.V. Oppenheim,et al.  The importance of phase in signals , 1980, Proceedings of the IEEE.

[4]  D. Burr,et al.  Feature detection in human vision: a phase-dependent energy model , 1988, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[5]  D. J. Felleman,et al.  Distributed hierarchical processing in the primate cerebral cortex. , 1991, Cerebral cortex.

[6]  Keiji Tanaka,et al.  Neuronal selectivities to complex object features in the ventral visual pathway of the macaque cerebral cortex. , 1994, Journal of neurophysiology.

[7]  R. Malach,et al.  Object-related activity revealed by functional magnetic resonance imaging in human occipital cortex. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Keiji Tanaka,et al.  Inferotemporal cortex and object vision. , 1996, Annual review of neuroscience.

[9]  N. Kanwisher,et al.  The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception , 1997, The Journal of Neuroscience.

[10]  S. Edelman,et al.  Human Brain Mapping 6:316–328(1998) � A Sequence of Object-Processing Stages Revealed by fMRI in the Human Occipital Lobe , 2022 .

[11]  Nancy Kanwisher,et al.  A cortical representation of the local visual environment , 1998, Nature.

[12]  M. Thomson,et al.  Visual coding and the phase structure of natural scenes. , 1999, Network.

[13]  R. Vogels,et al.  Effect of image scrambling on inferior temporal cortical responses. , 1999, Neuroreport.

[14]  Leslie G. Ungerleider,et al.  The Representation of Objects in the Human Occipital and Temporal Cortex , 2000, Journal of Cognitive Neuroscience.

[15]  N. Kanwisher,et al.  Cortical Regions Involved in Perceiving Object Shape , 2000, The Journal of Neuroscience.

[16]  A. T. Smith,et al.  Spatiotemporal Frequency and Direction Sensitivities of Human Visual Areas Measured Using fMRI , 2000, NeuroImage.

[17]  T. Hendler,et al.  A hierarchical axis of object processing stages in the human visual cortex. , 2001, Cerebral cortex.

[18]  T. Poggio,et al.  Neural mechanisms of object recognition , 2002, Current Opinion in Neurobiology.

[19]  R. Hess,et al.  What causes non-monotonic tuning of fMRI response to noisy images? , 2002, Current Biology.

[20]  Nikos K. Logothetis,et al.  The Effect of Image Scrambling on Visual Cortical BOLD Activity in the Anesthetized Monkey , 2002, NeuroImage.

[21]  Zhou Wang,et al.  Local Phase Coherence and the Perception of Blur , 2003, NIPS.

[22]  Peter Kovesi,et al.  Phase Congruency Detects Corners and Edges , 2003, DICTA.

[23]  Eero P. Simoncelli,et al.  A Parametric Texture Model Based on Joint Statistics of Complex Wavelet Coefficients , 2000, International Journal of Computer Vision.

[24]  Rhodri Cusack,et al.  The Intraparietal Sulcus and Perceptual Organization , 2005, Journal of Cognitive Neuroscience.

[25]  Thomas Serre,et al.  Robust Object Recognition with Cortex-Like Mechanisms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  D. Field,et al.  Does spatial invariance result from insensitivity to change? , 2016, Journal of vision.

[27]  Keiji Tanaka,et al.  Matching Categorical Object Representations in Inferior Temporal Cortex of Man and Monkey , 2008, Neuron.

[28]  G. Rousselet,et al.  Parametric study of EEG sensitivity to phase noise during face processing , 2008, BMC Neuroscience.

[29]  Frédéric Gosselin,et al.  Information processing algorithms in the brain , 2009, Trends in Cognitive Sciences.

[30]  R. Rosenholtz,et al.  A summary-statistic representation in peripheral vision explains visual crowding. , 2009, Journal of vision.

[31]  Michelle R. Greene,et al.  PSYCHOLOGICAL SCIENCE Research Article The Briefest of Glances The Time Course of Natural Scene Understanding , 2022 .

[32]  Nicole C. Rust,et al.  Selectivity and Tolerance (“Invariance”) Both Increase as Visual Information Propagates from Cortical Area V4 to IT , 2010, The Journal of Neuroscience.

[33]  John Duncan,et al.  Discrete Object Representation, Attention Switching, and Task Difficulty in the Parietal Lobe , 2010, Journal of Cognitive Neuroscience.

[34]  Jason Bell,et al.  Local motion effects on form in radial frequency patterns. , 2010, Journal of vision.

[35]  Peter J Bex,et al.  (In) sensitivity to spatial distortion in natural scenes. , 2010, Journal of vision.

[36]  P. Schyns,et al.  Visual Object Categorization in the Brain: What Can We Really Learn from ERP Peaks? , 2011, Front. Hum. Neurosci..

[37]  Adam P. Morris,et al.  Weighted integration of visual position information. , 2011, Journal of vision.

[38]  Jun Saiki,et al.  Higher-order image statistics is a cue for animal detection. , 2011 .

[39]  Richard F Murray,et al.  Classification images: A review. , 2011, Journal of vision.

[40]  R. Rosenholtz,et al.  A summary statistic representation in peripheral vision explains visual search. , 2009, Journal of vision.

[41]  Bruno Rossion,et al.  An objective method for measuring face detection thresholds using the sweep steady-state visual evoked response. , 2012, Journal of vision.

[42]  Matthias Bethge,et al.  How Sensitive Is the Human Visual System to the Local Statistics of Natural Images , 2012 .

[43]  A. Caramazza,et al.  Conceptual Object Representations in Human Anterior Temporal Cortex , 2012, The Journal of Neuroscience.

[44]  J. Kulikowski,et al.  Phases of daylight and the stability of color perception in the near peripheral human retina. , 2012, Journal of vision.

[45]  Eero P. Simoncelli,et al.  A functional and perceptual signature of the second visual area in primates , 2013, Nature Neuroscience.

[46]  Michelle E. Costanzo,et al.  Spatial and temporal features of superordinate semantic processing studied with fMRI and EEG , 2013, Front. Hum. Neurosci..

[47]  Rufin van Rullen,et al.  SWIFT: A novel method to track the neural correlates of recognition , 2013, NeuroImage.