Cognitive Deficit of Deep Learning in Numerosity

Subitizing, or the sense of small natural numbers, is an innate cognitive function of humans and primates; it responds to visual stimuli prior to the development of any symbolic skills, language or arithmetic. Given successes of deep learning (DL) in tasks of visual intelligence and given the primitivity of number sense, a tantalizing question is whether DL can comprehend numbers and perform subitizing. But somewhat disappointingly, extensive experiments of the type of cognitive psychology demonstrate that the examples-driven black box DL cannot see through superficial variations in visual representations and distill the abstract notion of natural number, a task that children perform with high accuracy and confidence. The failure is apparently due to the learning method not the CNN computational machinery itself. A recurrent neural network capable of subitizing does exist, which we construct by encoding a mechanism of mathematical morphology into the CNN convolutional kernels. Also, we investigate, using subitizing as a test bed, the ways to aid the black box DL by cognitive priors derived from human insight. Our findings are mixed and interesting, pointing to both cognitive deficit of pure DL, and some measured successes of boosting DL by predetermined cognitive implements. This case study of DL in cognitive computing is meaningful for visual numerosity represents a minimum level of human intelligence.

[2]  E. Miller,et al.  Coding of Cognitive Magnitude Compressed Scaling of Numerical Information in the Primate Prefrontal Cortex , 2003, Neuron.

[3]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Midori Tokita,et al.  How might the discrepancy in the effects of perceptual variables on numerosity judgment be reconciled? , 2010, Attention, perception & psychophysics.

[5]  Olaf Sporns,et al.  The small world of the cerebral cortex , 2007, Neuroinformatics.

[6]  Danna Zhou,et al.  d. , 1934, Microbial pathogenesis.

[7]  Tairui Chen,et al.  Going Deeper with Convolutional Neural Network for Intelligent Transportation , 2016 .

[8]  Elizabeth M. Brannon,et al.  Nonverbal representations of time and number in animals and human infants. , 2003 .

[9]  C. Packer,et al.  Roaring and numerical assessment in contests between groups of female lions, Panthera leo , 1994, Animal Behaviour.

[10]  E. L. Kaufman,et al.  The discrimination of visual number. , 1949, The American journal of psychology.

[11]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  B. P. Klein,et al.  Topographic Representation of Numerosity in the Human Parietal Cortex , 2013, Science.

[13]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[14]  Nikola T. Markov,et al.  Anatomy of hierarchy: Feedforward and feedback pathways in macaque visual cortex , 2013, The Journal of comparative neurology.

[15]  Fei Xu,et al.  Numerosity discrimination in infants: Evidence for two systems of representations , 2003, Cognition.

[16]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[17]  D. J. Felleman,et al.  Distributed hierarchical processing in the primate cerebral cortex. , 1991, Cerebral cortex.

[18]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Marco Zorzi,et al.  Emergence of a 'visual number sense' in hierarchical generative models , 2012, Nature Neuroscience.

[20]  Xiaogang Wang,et al.  Deep Learning Face Representation by Joint Identification-Verification , 2014, NIPS.

[21]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[22]  S. Dehaene,et al.  Cultural Recycling of Cortical Maps , 2007, Neuron.

[23]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[24]  Andreas Nieder,et al.  A parieto-frontal network for visual numerical information in the monkey. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Xiaoou Tang,et al.  Surpassing Human-Level Face Verification Performance on LFW with GaussianFace , 2014, AAAI.

[26]  E. Spelke,et al.  Sources of mathematical thinking: behavioral and brain-imaging evidence. , 1999, Science.

[27]  Samuel Ritter,et al.  Cognitive Psychology for Deep Neural Networks: A Shape Bias Case Study , 2017, ICML.

[28]  Andrew Zisserman,et al.  Microscopy cell counting and detection with fully convolutional regression networks , 2018, Comput. methods Biomech. Biomed. Eng. Imaging Vis..

[29]  Philippe Pinel,et al.  Tuning Curves for Approximate Numerosity in the Human Intraparietal Sulcus , 2004, Neuron.

[30]  Xiaogang Wang,et al.  Cross-scene crowd counting via deep convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  S. Dehaene,et al.  The Number Sense: How the Mind Creates Mathematics. , 1998 .

[32]  Xiaogang Wang,et al.  DeepID3: Face Recognition with Very Deep Neural Networks , 2015, ArXiv.

[33]  Bruce E. Lyon,et al.  Egg recognition and counting reduce costs of avian conspecific brood parasitism , 2003, Nature.