Understanding rapid category detection via multiply degraded images.

Rapid category detection, as discovered by S. Thorpe, D. Fize, and C. Marlot (1996), demonstrated that the human visual system can detect object categories in natural images in as little as 150 ms. To gain insight into this phenomenon and to determine its relevance to naturally occurring conditions, we degrade the stimulus set along various image dimensions and investigate the effects on perception. To investigate how well modern-day computer vision algorithms cope with degradations, we conduct an analog of this same experiment with state-of-the-art object recognition algorithms. We discover that rapid category detection in humans is quite robust to naturally occurring degradations and is mediated by a non-linear interaction of visual features. In contrast, modern-day object recognition algorithms are not as robust.

[1]  M. Tarr,et al.  Mental rotation and orientation-dependence in shape recognition , 1989, Cognitive Psychology.

[2]  S. Thorpe,et al.  Speed of processing in the human visual system , 1996, Nature.

[3]  C. Liu,et al.  Lighting direction affects recognition of untextured faces in photographic positive and negative , 1999, Vision Research.

[4]  B Julesz,et al.  Masking in Visual Recognition: Effects of Two-Dimensional Filtered Noise , 1973, Science.

[5]  T. Bachmann Identification of spatially quantised tachistoscopic images of faces: How many pixels does it take to carry identity? , 1991 .

[6]  D. Foster Color constancy , 2011, Vision Research.

[7]  Simon J. Thorpe,et al.  Ultra-rapid object detection with saccadic eye movements: Visual processing speed revisited , 2006, Vision Research.

[8]  C. J. McGrath,et al.  Effect of exchange rate return on volatility spill-over across trading regions , 2012 .

[9]  L. Chalupa,et al.  The visual neurosciences , 2004 .

[10]  Thomas Serre,et al.  A feedforward architecture accounts for rapid categorization , 2007, Proceedings of the National Academy of Sciences.

[11]  Simon J Thorpe,et al.  Animals roll around the clock: the rotation invariance of ultrarapid visual processing. , 2006, Journal of vision.

[12]  Antonio Torralba,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 80 Million Tiny Images: a Large Dataset for Non-parametric Object and Scene Recognition , 2022 .

[13]  Antonio Torralba,et al.  Building the gist of a scene: the role of global image features in recognition. , 2006, Progress in brain research.

[14]  Minami Ito,et al.  Size and position invariance of neuronal responses in monkey inferotemporal cortex. , 1995, Journal of neurophysiology.

[15]  R. Yin Looking at Upside-down Faces , 1969 .

[16]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[17]  Guy Richards Is it a bird, is it a plane? , 2009 .

[18]  I. Rock,et al.  The effect on form perception of change of orientation in the third dimension. , 1981, Journal of experimental psychology. Human perception and performance.

[19]  A. Torralba,et al.  Detecting faces in impoverished images , 2010 .

[20]  K. Parker IS IT A BIRD? IS IT A PLANE? , 1999 .

[21]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[22]  Bruno A Olshausen,et al.  Timecourse of neural signatures of object recognition. , 2003, Journal of vision.

[23]  John M. Findlay,et al.  Representations of Vision: Trends and Tacit Assumptions in Vision Research , 2009 .

[24]  C. M. Mooney Age in the development of closure ability in children. , 1957, Canadian journal of psychology.

[25]  Jitendra Malik,et al.  Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[26]  Patrick Cavanagh,et al.  What's up in top-down processing? , 1991 .

[27]  R VanRullen,et al.  Is it a Bird? Is it a Plane? Ultra-Rapid Visual Categorisation of Natural and Artifactual Objects , 2001, Perception.

[28]  Simon J. Thorpe,et al.  Ultra-Rapid Scene Categorization with a Wave of Spikes , 2002, Biologically Motivated Computer Vision.

[29]  D. V. van Essen,et al.  A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.