Simple line drawings suffice for functional MRI decoding of natural scene categories

Humans are remarkably efficient at categorizing natural scenes. In fact, scene categories can be decoded from functional MRI (fMRI) data throughout the ventral visual cortex, including the primary visual cortex, the parahippocampal place area (PPA), and the retrosplenial cortex (RSC). Here we ask whether, and where, we can still decode scene category if we reduce the scenes to mere lines. We collected fMRI data while participants viewed photographs and line drawings of beaches, city streets, forests, highways, mountains, and offices. Despite the marked difference in scene statistics, we were able to decode scene category from fMRI data for line drawings just as well as from activity for color photographs, in primary visual cortex through PPA and RSC. Even more remarkably, in PPA and RSC, error patterns for decoding from line drawings were very similar to those from color photographs. These data suggest that, in these regions, the information used to distinguish scene category is similar for line drawings and photographs. To determine the relative contributions of local and global structure to the human ability to categorize scenes, we selectively removed long or short contours from the line drawings. In a category-matching task, participants performed significantly worse when long contours were removed than when short contours were removed. We conclude that global scene structure, which is preserved in line drawings, plays an integral part in representing scene categories.

[1]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[2]  P. O. Bishop,et al.  Spatial vision. , 1971, Annual review of psychology.

[3]  M. Potter Short-term conceptual memory for pictures. , 1976, Journal of experimental psychology. Human learning and memory.

[4]  J. Goodnow Children's Drawing , 1977 .

[5]  J. G. Snodgrass,et al.  A standardized set of 260 pictures: norms for name agreement, image agreement, familiarity, and visual complexity. , 1980, Journal of experimental psychology. Human learning and memory.

[6]  I. Biederman,et al.  Scene perception: Detecting and judging objects undergoing relational violations , 1982, Cognitive Psychology.

[7]  I. Biederman,et al.  Surface versus edge-based determinants of visual recognition , 1988, Cognitive Psychology.

[8]  E. Peterhans,et al.  Functional Organization of Area V2 in the Alert Macaque , 1993, The European journal of neuroscience.

[9]  P. King-Smith,et al.  Efficient and unbiased modifications of the QUEST threshold method: Theory, simulations, experimental evaluation and practical implementation , 1994, Vision Research.

[10]  D. C. Essen,et al.  Neural responses to polar, hyperbolic, and Cartesian gratings in area V4 of the macaque monkey. , 1996, Journal of neurophysiology.

[11]  R W Cox,et al.  AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. , 1996, Computers and biomedical research, an international journal.

[12]  M. D’Esposito,et al.  The parahippocampus subserves topographical learning in man , 1996, NeuroImage.

[13]  Nancy Kanwisher,et al.  A cortical representation of the local visual environment , 1998, Nature.

[14]  J. Wolfe Visual memory: What do you know about what you saw? , 1998, Current Biology.

[15]  C. Connor,et al.  Responses to contour features in macaque area V4. , 1999, Journal of neurophysiology.

[16]  N. Kanwisher,et al.  Mental Imagery of Faces and Places Activates Corresponding Stimulus-Specific Brain Regions , 2000, Journal of Cognitive Neuroscience.

[17]  E. Maguire The retrosplenial contribution to human navigation: a review of lesion and neuroimaging findings. , 2001, Scandinavian journal of psychology.

[18]  P. Perona,et al.  Rapid natural scene categorization in the near absence of attention , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[19]  M. Bar,et al.  Cortical Analysis of Visual Context , 2003, Neuron.

[20]  Brenna Argall,et al.  SUMA: an interface for surface-based intra- and inter-subject analysis with AFNI , 2004, 2004 2nd IEEE International Symposium on Biomedical Imaging: Nano to Macro (IEEE Cat No. 04EX821).

[21]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[22]  F. Tong,et al.  Decoding the visual and subjective contents of the human brain , 2005, Nature Neuroscience.

[23]  P. Perona,et al.  Why does natural scene categorization require little attention? Exploring attentional requirements for natural and synthetic stimuli , 2005 .

[24]  Rainer Goebel,et al.  Information-based functional brain mapping. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Russell A. Epstein,et al.  Where Am I Now? Distinct Roles for Parahippocampal and Retrosplenial Cortices in Place Recognition , 2007, The Journal of Neuroscience.

[26]  Russell A. Epstein,et al.  Visual scene processing in familiar and unfamiliar environments. , 2007, Journal of neurophysiology.

[27]  Kathleen A. Hansen,et al.  Topographic Organization in and near Human Visual Area V4 , 2007, The Journal of Neuroscience.

[28]  H. Intraub,et al.  Beyond the Edges of a View: Boundary Extension in Human Scene-Selective Visual Cortex , 2007, Neuron.

[29]  Paul F. Bulakowski,et al.  Shared attentional resources for global and local motion processing. , 2007, Journal of vision.

[30]  P. Perona,et al.  What do we perceive in a glance of a real-world scene? , 2007, Journal of vision.

[31]  Russell A. Epstein,et al.  Differential parahippocampal and retrosplenial involvement in three types of visual scene recognition. , 2006, Cerebral cortex.

[32]  Russell A. Epstein,et al.  Decoding the Representation of Multiple Simultaneous Objects in Human Occipitotemporal Cortex , 2009, Current Biology.

[33]  Dirk B. Walther,et al.  Natural Scene Categories Revealed in Distributed Patterns of Activity in the Human Brain , 2009, The Journal of Neuroscience.

[34]  Soojin Park,et al.  Different roles of the parahippocampal place area (PPA) and retrosplenial cortex (RSC) in panoramic scene perception , 2009, NeuroImage.

[35]  Li Fei-Fei,et al.  Neural mechanisms of rapid natural scene categorization in human visual cortex , 2009, Nature.

[36]  Michelle R. Greene,et al.  PSYCHOLOGICAL SCIENCE Research Article The Briefest of Glances The Time Course of Natural Scene Understanding , 2022 .

[37]  Maggie Hobson-Baker Chauvet Cave (ca. 30,000 B.C.) , 2010 .

[38]  Li Fei-Fei,et al.  Categorization of good and bad examples of natural scene categories , 2010 .

[39]  Dirk B. Walther,et al.  To err is human : correlating fMRI decoding and behavioral errors to probe the neural representation of natural scene categories , 2011 .

[40]  Jiye G. Kim,et al.  Where do objects become scenes? , 2011, Cerebral cortex.