Controlled-rearing studies of newborn chicks and deep neural networks

Convolutional neural networks (CNNs) can now achieve human-level performance on challenging object recognition tasks. CNNs are also the leading quantitative models in terms of predicting neural and behavioral responses in visual recognition tasks. However, there is a widely accepted critique of CNN models: unlike newborn animals, which learn rapidly and efficiently, CNNs are thought to be “data hungry,” requiring massive amounts of training data to develop accurate models for object recognition. This critique challenges the promise of using CNNs as models of visual development. Here, we directly examined whether CNNs are more data hungry than newborn animals by performing parallel controlled-rearing experiments on newborn chicks and CNNs. We raised newborn chicks in strictly controlled visual environments, then simulated the training data available in that environment by constructing a virtual animal chamber in a video game engine. We recorded the visual images acquired by an agent moving through the virtual chamber and used those images to train CNNs. When CNNs received similar visual training data as chicks, the CNNs successfully solved the same challenging view-invariant object recognition tasks as the chicks. Thus, the CNNs were not more data hungry than animals: both CNNs and chicks successfully developed robust object models from training data of a single object.

[1]  Justin N Wood,et al.  Measuring the speed of newborn object recognition in controlled visual worlds. , 2017, Developmental science.

[2]  M. Livingstone,et al.  On the relationship between maps and domains in inferotemporal cortex , 2021, Nature Reviews Neuroscience.

[3]  Nikolaus Kriegeskorte,et al.  Cognitive computational neuroscience , 2018, Nature Neuroscience.

[4]  Justin N Wood,et al.  Newly Hatched Chicks Solve the Visual Binding Problem , 2014, Psychological science.

[5]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[6]  Jonas Kubilius,et al.  Integrative Benchmarking to Advance Neurally Mechanistic Models of Human Intelligence , 2020, Neuron.

[7]  J. DiCarlo,et al.  Using goal-driven deep learning models to understand sensory cortex , 2016, Nature Neuroscience.

[8]  Justin N. Wood,et al.  Using automated controlled rearing to explore the origins of object permanence. , 2019, Developmental science.

[9]  R. Held,et al.  MOVEMENT-PRODUCED STIMULATION IN THE DEVELOPMENT OF VISUALLY GUIDED BEHAVIOR. , 1963, Journal of comparative and physiological psychology.

[10]  E. Gibson,et al.  Behavior of light- and dark-reared rats on a visual cliff. , 1957, Science.

[11]  Marwan Mattar,et al.  Unity: A General Platform for Intelligent Agents , 2018, ArXiv.

[12]  James J DiCarlo,et al.  Neural population control via deep image synthesis , 2018, Science.

[13]  Geoffrey E. Hinton,et al.  A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[14]  Michael C. Frank,et al.  Unsupervised neural network models of the ventral visual stream , 2020, Proceedings of the National Academy of Sciences.

[15]  Alexei A. Efros,et al.  What makes ImageNet good for transfer learning? , 2016, ArXiv.

[16]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[17]  John Rohrlich,et al.  Deep Predictive Learning in Neocortex and Pulvinar , 2020, Journal of Cognitive Neuroscience.

[18]  Linda B. Smith,et al.  The Developing Infant Creates a Curriculum for Statistical Learning , 2018, Trends in Cognitive Sciences.

[19]  Justin N. Wood Newborn chickens generate invariant object representations at the onset of visual object experience , 2013, Proceedings of the National Academy of Sciences.

[20]  Lawrence D. Jackel,et al.  Handwritten Digit Recognition with a Back-Propagation Network , 1989, NIPS.

[21]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Joshua C. Peterson,et al.  Capturing human categorization of natural images by combining deep networks and cognitive models , 2020, Nature Communications.

[23]  Jason G. Goldman,et al.  An automated controlled-rearing method for studying the origins of movement recognition in newly hatched chicks , 2015, Animal Cognition.

[24]  Justin N. Wood,et al.  One-shot object parsing in newborn chicks. , 2021, Journal of experimental psychology. General.

[25]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[26]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[27]  Justin N. Wood,et al.  A chicken model for studying the emergence of invariant object recognition , 2015, Front. Neural Circuits.

[28]  Linda B. Smith,et al.  Toddler-Inspired Visual Object Learning , 2018, NeurIPS.

[29]  Justin N Wood,et al.  Face recognition in newly hatched chicks at the onset of vision. , 2015, Journal of experimental psychology. Animal learning and cognition.

[30]  Michal Valko,et al.  Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning , 2020, NeurIPS.