论文信息 - Do Neural Networks Show Gestalt Phenomena? An Exploration of the Law of Closure

Do Neural Networks Show Gestalt Phenomena? An Exploration of the Law of Closure

One characteristic of human visual perception is the presence of `Gestalt phenomena,' that is, that the whole is something other than the sum of its parts. A natural question is whether image-recognition networks show similar effects. Our paper investigates one particular type of Gestalt phenomenon, the law of closure, in the context of a feedforward image classification neural network (NN). This is a robust effect in human perception, but experiments typically rely on measurements (e.g., reaction time) that are not available for artificial neural nets. We describe a protocol for identifying closure effect in NNs, and report on the results of experiments with simple visual stimuli. Our findings suggest that NNs trained with natural images do exhibit closure, in contrast to networks with randomized weights or networks that have been trained on visually random data. Furthermore, the closure effect reflects something beyond good feature extraction; it is correlated with the network's higher layer features and ability to generalize.

[1] Joseph L. Sanguinetti,et al. Increased alpha band activity indexes inhibitory competition across a border during figure assignment , 2014, Vision Research.

[2] Albert S. Bregman,et al. Asking the “What For” Question in Auditory Perception , 2017 .

[3] S. Palmer,et al. A century of Gestalt psychology in visual perception: I. Perceptual grouping and figure-ground organization. , 2012, Psychological bulletin.

[4] Daniel L. K. Yamins,et al. Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition , 2014, PLoS Comput. Biol..

[5] G. Westheimer. Gestalt Theory Reconfigured: Max Wertheimer's Anticipation of Recent Developments in Visual Neuroscience , 1999, Perception.

[6] M. Wertheimer. Laws of organization in perceptual forms. , 1938 .

[7] O. Reiser,et al. Principles Of Gestalt Psychology , 1936 .

[8] James Elder,et al. The effect of contour closure on the rapid discrimination of two-dimensional shapes , 1993, Vision Research.

[9] Duane Schultz,et al. A History of Modern Psychology , 1969 .

[10] Andrea Vedaldi,et al. Deep Image Prior , 2017, International Journal of Computer Vision.

[11] Welch Bl. THE GENERALIZATION OF ‘STUDENT'S’ PROBLEM WHEN SEVERAL DIFFERENT POPULATION VARLANCES ARE INVOLVED , 1947 .

[12] J. Zinker. Creative process in Gestalt therapy , 1977 .

[13] Matthias M. Müller,et al. Human Gamma Band Activity and Perception of a Gestalt , 1999, The Journal of Neuroscience.

[14] Bolei Zhou,et al. Network Dissection: Quantifying Interpretability of Deep Visual Representations , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Johan Wagemans,et al. Quantifying density cues in grouping displays , 2016, Vision Research.

[16] W. Härdle,et al. Applied Multivariate Statistical Analysis , 2003 .

[17] Marina Schmid,et al. An Introduction To The Event Related Potential Technique , 2016 .

[18] Max Tegmark,et al. Why Does Deep and Cheap Learning Work So Well? , 2016, Journal of Statistical Physics.

[19] W. Line,et al. A Visual Motor Gestalt Test and Its Clinical Use , 1940 .

[20] P. König,et al. Combining EEG and eye tracking: identification, characterization, and correction of eye movement artifacts in electroencephalographic data , 2012, Front. Hum. Neurosci..

[21] Matthias Bethge,et al. Comparing deep neural networks against humans: object recognition when the signal gets weaker , 2017, ArXiv.

[22] Nikolaus Kriegeskorte,et al. Frontiers in Systems Neuroscience Systems Neuroscience , 2022 .

[23] James J DiCarlo,et al. Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks , 2018, The Journal of Neuroscience.

[24] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Shlomit Yuval-Greenberg,et al. Saccadic spike potentials in gamma-band EEG: Characterization, detection and suppression , 2010, NeuroImage.

[26] R. Behrens. Art, Design and Gestalt Theory , 2017 .

[27] R. von der Heydt,et al. Illusory contours and cortical neuron responses. , 1984, Science.

[28] Jitendra Malik,et al. Learning a classification model for segmentation , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[29] M. Wertheimer,et al. A source book of Gestalt psychology. , 1939 .

[30] R. W. Ditchburn. Seeing is Deceiving: The Psychology of Visual Illusions , 1979 .

[31] W. Ma. Organizing probabilistic models of perception , 2012, Trends in Cognitive Sciences.

[32] F. Jäkel,et al. An overview of quantitative approaches in Gestalt perception , 2016, Vision Research.

[33] Samuel Ritter,et al. Cognitive Psychology for Deep Neural Networks: A Shape Bias Case Study , 2017, ICML.

[34] R. Shapley,et al. Spatial and Temporal Properties of Illusory Contours and Amodal Boundary Completion , 1996, Vision Research.

[35] Antonio Torralba,et al. Deep Neural Networks predict Hierarchical Spatio-temporal Cortical Dynamics of Human Visual Object Recognition , 2016, ArXiv.

[36] Quoc V. Le,et al. Do Better ImageNet Models Transfer Better? , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[37] Ha Hong,et al. Performance-optimized hierarchical models predict neural responses in higher visual cortex , 2014, Proceedings of the National Academy of Sciences.

[38] W. Geisler. Sequential ideal-observer analysis of visual discriminations. , 1989 .

[39] E. Schröger,et al. High-pass filters and baseline correction in M/EEG analysis. Commentary on: “How inappropriate high-pass filters can produce artefacts and incorrect conclusions in ERP studies of language and cognition” , 2016, Journal of Neuroscience Methods.

[40] Philip J. Kellman,et al. Modeling spatiotemporal boundary formation , 2012, Vision Research.

[41] R. Kimchi,et al. Perceptual organization, visual attention, and objecthood , 2016, Vision Research.

[42] J. Feldman. What is a visual object? , 2003, Trends in Cognitive Sciences.

[43] Jascha Sohl-Dickstein,et al. Adversarial Examples that Fool both Computer Vision and Time-Limited Humans , 2018, NeurIPS.

[44] Jean-Michel Morel,et al. From Gestalt Theory to Image Analysis: A Probabilistic Approach , 2007 .

[45] Rich Caruana,et al. Do Deep Nets Really Need to be Deep? , 2013, NIPS.

[46] J. R. Pomerantz,et al. A century of Gestalt psychology in visual perception: II. Conceptual and theoretical foundations. , 2012, Psychological bulletin.

[47] T. Poggio,et al. Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[48] Lauren E. Welbourne,et al. Humans, but Not Deep Neural Networks, Often Miss Giant Targets in Scenes , 2017, Current Biology.

[49] Arnaud Delorme,et al. Single-Trial Normalization for Event-Related Spectral Decomposition Reduces Sensitivity to Noisy Trials , 2011, Front. Psychology.

[50] R. Kimchi. Primacy of wholistic processing and global/local paradigm: a critical review. , 1992, Psychological bulletin.

[51] Felix Hill,et al. Measuring abstract reasoning in neural networks , 2018, ICML.

[52] Steven J. Luck,et al. On high-pass filter artifacts (they’re real) and baseline correction (it's a good idea) in ERP/ERMF analysis , 2016, Journal of Neuroscience Methods.