Superstitious perception by humans and convolutional neural networks

The advent of complex Hierarchical Convolutional Neural Networks (HCNNs) has led to great progress in the field of computer vision, with modern implementations of HCNNs rivalling human performance in object recognition tasks. The design of HCNNs was inspired by current understanding of how the neurons of the human visual system are organized to support object recognition. There are researchers who claim that the computations undertaken by HCNNs are approximating those of the human visual system, because of their high accuracy in predicting the neural activity of regions of the brain involved in object classification (Cichy, Khosla, Pantazis, Torralba, & Oliva, 2016; Khaligh-Razavi & Kriegeskorte, 2014; Yamins et al., 2014). However, there has been little investigation of how HCNNs and humans compare on other tasks that HCNNs have not been trained on. Our study compared the similarity of one HCNN, AlexNet, and humans on a superstitious perception task that involves falsely recognizing a learned object in the absence of strong evidence for its presence. We began by validating a new technique that quantifies human performance on the superstitious perception task. The first phase of the research revealed that human behaviour in the task is dependent on whether participants employed an active or passive task strategy. Next, the responses of our HCNN to the same images were analyzed in a similar manner. The results showed that HCNNs behaved similarly to humans in some ways and differently in others. Specifically, the classification images generated for the HCNN were similar to those derived from human participants, but the HCNN was also more consistent in its responses than humans. A second finding was that the differences in human participants classification images (created by adopting active versus passive strategies) could not be accounted for by simply altering the proportion of false alarm

[1]  Philip M. Merikle,et al.  Perception below the Objective Threshold? , 1993, Consciousness and Cognition.

[2]  I. Ohzawa,et al.  Spatiotemporal organization of simple-cell receptive fields in the cat's striate cortex. II. Linearity of temporal and spatial summation. , 1993, Journal of neurophysiology.

[3]  J. P. Jones,et al.  The two-dimensional spatial structure of simple receptive fields in cat striate cortex. , 1987, Journal of neurophysiology.

[4]  Thomas L. Griffiths,et al.  Adapting Deep Network Features to Capture Psychological Representations: An Abridged Report , 2017, IJCAI.

[5]  Nikolaus Kriegeskorte,et al.  Deep Supervised, but Not Unsupervised, Models May Explain IT Cortical Representation , 2014, PLoS Comput. Biol..

[6]  Timothée Masquelier,et al.  Deep Networks Can Resemble Human Feed-forward Vision in Invariant Object Recognition , 2015, Scientific Reports.

[7]  James T. Enns,et al.  Relax! Cognitive strategy influences visual search , 2006 .

[8]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[9]  Jacob Cohen,et al.  Applied multiple regression/correlation analysis for the behavioral sciences , 1979 .

[10]  O. Mason,et al.  Apophenia, theory of mind and schizotypy: Perceiving meaning and intentionality in randomness , 2008, Cortex.

[11]  C Brock Kirwan,et al.  Visual object pattern separation varies in older adults , 2013, Learning & memory.

[12]  Antonio Torralba,et al.  Deep Neural Networks predict Hierarchical Spatio-temporal Cortical Dynamics of Human Visual Object Recognition , 2016, ArXiv.

[13]  P. Schyns,et al.  Superstitious Perceptions Reveal Properties of Internal Representations , 2003, Psychological science.

[14]  Frédéric Gosselin,et al.  Bubbles: a technique to reveal the use of information in recognition tasks , 2001, Vision Research.

[15]  Johannes J. Fahrenfort,et al.  Masking Disrupts Reentrant Processing in Human Visual Cortex , 2007, Journal of Cognitive Neuroscience.

[16]  Daniel Smilek,et al.  A methodological note on evaluating performance in a sustained-attention-to-response task , 2013, Behavior research methods.

[17]  Jie Tian,et al.  Faces in the mist: illusory face and letter detection , 2011, i-Perception.

[18]  L. Bottou Stochastic Gradient Learning in Neural Networks , 1991 .

[19]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[20]  J. DiCarlo,et al.  Using goal-driven deep learning models to understand sensory cortex , 2016, Nature Neuroscience.

[21]  D H Brainard,et al.  The Psychophysics Toolbox. , 1997, Spatial vision.

[22]  L. Brooks,et al.  Nonanalytic Cognition: Memory, Perception, and Concept Learning , 1984 .

[23]  Guillermo Sapiro,et al.  A subspace reverse-correlation technique for the study of visual neurons , 1997, Vision Research.

[24]  D. Levi,et al.  Receptive versus perceptive fields from the reverse-correlation viewpoint , 2006, Vision Research.

[25]  Jie Tian,et al.  Seeing Jesus in toast: Neural and behavioral correlates of face pareidolia , 2014, Cortex.

[26]  Dario L. Ringach,et al.  Reverse correlation in neurophysiology , 2004, Cogn. Sci..

[27]  R. Dotsch,et al.  The Relationship Between Mental Representations of Welfare Recipients and Attitudes Toward Welfare , 2017, Psychological science.

[28]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[29]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[30]  Nikolaus Kriegeskorte,et al.  Frontiers in Systems Neuroscience Systems Neuroscience , 2022 .

[31]  A. Marcel Conscious and unconscious perception: Experiments on visual masking and word recognition , 1983, Cognitive Psychology.

[32]  Michael Lifshitz,et al.  Using suggestion to modulate automatic processes: From Stroop to McGurk and beyond , 2013, Cortex.

[33]  Ha Hong,et al.  Performance-optimized hierarchical models predict neural responses in higher visual cortex , 2014, Proceedings of the National Academy of Sciences.

[34]  D G Pelli,et al.  The VideoToolbox software for visual psychophysics: transforming numbers into movies. , 1997, Spatial vision.

[35]  C W Eriksen,et al.  The use of a visual mask may seriously confound your experiment , 1980, Perception & psychophysics.

[36]  Stephanie M. Stalinski,et al.  Journal of Experimental Psychology: Learning, Memory, and Cognition , 2012 .