Efficient coding explains the universal law of generalization in human perception

Balancing costs and performance Deciding whether a novel object is another instance of something already known or an example of something different is an easily solved problem. Empirical mapping of human performance across a wide range of domains has established an exponential relationship between the generalization gradient and interstimuli distance. Sims now shows that this relationship can be derived from a consideration of the costs of optimal information coding. Science, this issue p. 652 Learning from experience is an inevitable outcome of any error-minimizing information processing system. Perceptual generalization and discrimination are fundamental cognitive abilities. For example, if a bird eats a poisonous butterfly, it will learn to avoid preying on that species again by generalizing its past experience to new perceptual stimuli. In cognitive science, the “universal law of generalization” seeks to explain this ability and states that generalization between stimuli will follow an exponential function of their distance in “psychological space.” Here, I challenge existing theoretical explanations for the universal law and offer an alternative account based on the principle of efficient coding. I show that the universal law emerges inevitably from any information processing system (whether biological or artificial) that minimizes the cost of perceptual error subject to constraints on the ability to process or transmit information.

[1]  R. Shepard Stimulus and response generalization: A stochastic model relating generalization to distance in psychological space , 1957 .

[2]  R. Nosofsky Attention and learning processes in the identification and categorization of integral stimuli. , 1987, Journal of experimental psychology. Learning, memory, and cognition.

[3]  Chris R Sims,et al.  The cost of misremembering: Inferring the loss function in visual working memory. , 2015, Journal of vision.

[4]  C. Sims Rate–distortion theory and human perception , 2016, Cognition.

[5]  R. Jacobs,et al.  An ideal observer analysis of visual working memory. , 2012, Psychological review.

[6]  Richard D. Morey,et al.  Learning in a unidimensional absolute identification task , 2004, Psychonomic bulletin & review.

[7]  K Cheng,et al.  Shepard's Universal Law Supported by Honeybees in Spatial Generalization , 2000, Psychological science.

[8]  Charles Kemp,et al.  How to Grow a Mind: Statistics, Structure, and Abstraction , 2011, Science.

[9]  Toby Berger,et al.  Rate distortion theory : a mathematical basis for data compression , 1971 .

[10]  R. Nosofsky,et al.  Predicting similarity and categorization from identification. , 1991, Journal of experimental psychology. General.

[11]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[12]  D. Kornbrot,et al.  Theoretical and empirical comparison of Luce’s choice model and logistic Thurstone model of categorical judgment , 1978, Perception & psychophysics.

[13]  A. Stocker,et al.  Lawful relation between perceptual bias and discriminability , 2017, Proceedings of the National Academy of Sciences.

[14]  J. Tenenbaum,et al.  Generalization, similarity, and Bayesian inference. , 2001, The Behavioral and brain sciences.

[15]  A. Tversky Features of Similarity , 1977 .

[16]  Sarah Marzen,et al.  The evolution of lossy compression , 2015, Journal of The Royal Society Interface.

[17]  Eero P. Simoncelli,et al.  Noise characteristics and prior expectations in human visual speed perception , 2006, Nature Neuroscience.

[18]  R. Shepard,et al.  Toward a universal law of generalization for psychological science. , 1987, Science.

[19]  M. Myles-Worsley,et al.  The influence of expertise on X-ray image processing. , 1988, Journal of experimental psychology. Learning, memory, and cognition.

[20]  Nick Chater,et al.  The Generalized Universal Law of Generalization , 2001, ArXiv.

[21]  Jane Van Zandt Brower,et al.  EXPERIMENTAL STUDIES OF MIMICRY IN SOME NORTH AMERICAN BUTTERFLIES: PART I. THE MONARCH, DANAUS PLEXIPPUS, AND VICEROY, LIMENITIS ARCHIPPUS ARCHIPPUS , 1958 .

[22]  Lynette A. Jones,et al.  Identification of vibrotactile patterns: building blocks for tactons , 2013, 2013 World Haptics Conference (WHC).

[23]  S. Ghirlanda,et al.  A century of generalization , 2003, Animal Behaviour.

[24]  J. Grey Multidimensional perceptual scaling of musical timbres. , 1977, The Journal of the Acoustical Society of America.

[25]  Joshua B. Tenenbaum,et al.  Human-level concept learning through probabilistic program induction , 2015, Science.

[26]  Jeffrey S. Perry,et al.  Statistics for optimal point prediction in natural images. , 2011, Journal of vision.

[27]  G. A. Miller,et al.  An Analysis of Perceptual Confusions Among Some English Consonants , 1955 .

[28]  Tom M. Mitchell,et al.  Generalization as Search , 2002 .

[29]  L. Marks,et al.  study of taste perception , 1999 .

[30]  Richard E. Blahut,et al.  Computation of channel capacity and rate-distortion functions , 1972, IEEE Trans. Inf. Theory.

[31]  Hong Z. Tan,et al.  Haptic stiffness identification by veterinarians and novices: A comparison , 2009, World Haptics 2009 - Third Joint EuroHaptics conference and Symposium on Haptic Interfaces for Virtual Environment and Teleoperator Systems.